logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000250_02148

You are here: Home > Sequence: MGYG000000250_02148

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species TF01-11 sp001414325
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; TF01-11; TF01-11 sp001414325
CAZyme ID MGYG000000250_02148
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1068 116197.33 6.9923
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000250 3613289 Isolate China Asia
Gene Location Start: 5596;  End: 8802  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000250_02148.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 545 852 3.6e-80 0.9891304347826086

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 4.27e-38 543 849 14 266
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 9.24e-18 494 825 30 328
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
pfam13306 LRR_5 7.52e-08 273 335 3 66
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.
sd00036 LRR_3 3.54e-07 284 335 17 70
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.
pfam13306 LRR_5 6.06e-07 269 335 44 111
Leucine rich repeats (6 copies). This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CCO05195.1 4.81e-98 497 871 43 393
VEU81114.1 4.02e-91 511 871 37 375
CBK96866.1 1.18e-87 467 871 3 380
AHF25528.1 2.57e-74 511 871 19 367
QTE66996.1 1.65e-72 511 871 34 382

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2JEP_A 1.85e-48 511 850 38 362
Nativefamily 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEP_B Native family 5 xyloglucanase from Paenibacillus pabuli [Paenibacillus pabuli],2JEQ_A Family 5 xyloglucanase from Paenibacillus pabuli in complex with ligand [Paenibacillus pabuli]
6WQP_A 1.06e-47 500 854 9 326
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
6XRK_A 7.76e-45 513 853 29 342
GH5-4broad specificity endoglucanase from an uncultured bovine rumen ciliate [uncultured bovine rumen ciliate],6XRK_B GH5-4 broad specificity endoglucanase from an uncultured bovine rumen ciliate [uncultured bovine rumen ciliate]
1EDG_A 1.16e-42 518 867 32 367
SingleCrystal Structure Determination Of The Catalytic Domain Of Celcca Carried Out At 15 Degree C [Ruminiclostridium cellulolyticum H10]
4W8A_A 1.80e-42 511 852 6 336
Crystalstructure of XEG5B, a GH5 xyloglucan-specific beta-1,4-glucanase from ruminal metagenomic library, in the native form [uncultured bacterium],4W8B_A Crystal structure of XEG5B, a GH5 xyloglucan-specific beta-1,4-glucanase from ruminal metagenomic library, in complex with XXLG [uncultured bacterium]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
O08342 3.25e-46 511 850 43 367
Endoglucanase A OS=Paenibacillus barcinonensis OX=198119 GN=celA PE=1 SV=1
P17901 4.51e-41 518 867 57 392
Endoglucanase A OS=Ruminiclostridium cellulolyticum (strain ATCC 35319 / DSM 5812 / JCM 6584 / H10) OX=394503 GN=celCCA PE=1 SV=1
P28623 7.81e-39 503 854 39 340
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P54937 1.08e-38 511 856 42 348
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
A7LXT7 5.08e-38 497 857 141 479
Xyloglucan-specific endo-beta-1,4-glucanase BoGH5A OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / BCRC 10623 / CCUG 4943 / NCTC 11153) OX=411476 GN=BACOVA_02653 PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000419 0.998529 0.000376 0.000280 0.000191 0.000161

TMHMM  Annotations      download full data without filtering help

start end
9 31