logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003925_02137

You are here: Home > Sequence: MGYG000003925_02137

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-882 sp900545175
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; CAG-882; CAG-882 sp900545175
CAZyme ID MGYG000003925_02137
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
413 MGYG000003925_30|CGC2 45758.09 4.1372
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003925 3268447 MAG China Asia
Gene Location Start: 9252;  End: 10493  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 131 371 7.4e-92 0.9873417721518988

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 9.68e-74 129 379 2 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 1.39e-12 154 316 76 278
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
NF033849 ser_rich_anae_1 0.001 29 112 418 501
serine-rich protein. This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
NF033849 ser_rich_anae_1 0.001 30 119 425 512
serine-rich protein. This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
COG4547 CobT2 0.008 35 134 234 330
Cobalamin biosynthesis protein CobT (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole phosphorib... [Coenzyme transport and metabolism].

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QNM00780.1 1.94e-132 112 411 213 512
QWT53734.1 7.77e-132 112 411 213 512
AFA47670.1 4.26e-130 110 413 195 500
QNM02472.1 2.19e-126 110 411 84 384
CBK83877.1 4.13e-124 111 411 225 527

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6GJF_A 2.41e-108 112 413 5 302
Ancestralendocellulase Cel5A [synthetic construct],6GJF_B Ancestral endocellulase Cel5A [synthetic construct],6GJF_C Ancestral endocellulase Cel5A [synthetic construct],6GJF_D Ancestral endocellulase Cel5A [synthetic construct],6GJF_E Ancestral endocellulase Cel5A [synthetic construct],6GJF_F Ancestral endocellulase Cel5A [synthetic construct]
4XZB_A 4.52e-98 112 411 4 303
endo-glucanaseGsCelA P1 [Geobacillus sp. 70PC53]
4XZW_A 1.16e-92 112 411 4 302
Endo-glucanasechimera C10 [uncultured bacterium]
1H11_A 1.19e-92 110 411 2 300
2-DEOXY-2-FLURO-B-D-CELLOTRIOSYL/ENZYMEINTERMEDIATE COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION [Salipaludibacillus agaradhaerens],1H2J_A ENDOGLUCANASE CEL5A IN COMPLEX WITH UNHYDROLYSED AND COVALENTLY LINKED 2,4-DINITROPHENYL-2-DEOXY-2-FLUORO-CELLOBIOSIDE AT 1.15 A RESOLUTION [Salipaludibacillus agaradhaerens],1HF6_A ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHAERENS IN THE ORTHORHOMBIC CRYSTAL FORM IN COMPLEX WITH CELLOTRIOSE [Salipaludibacillus agaradhaerens],1OCQ_A COMPLEX OF THE ENDOGLUCANASE CEL5A FROM BACILLUS AGARADHEARANS AT 1.08 ANGSTROM RESOLUTION with cellobio-derived isofagomine [Salipaludibacillus agaradhaerens],1W3K_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellobio Derived-tetrahydrooxazine [Salipaludibacillus agaradhaerens],1W3L_A Endoglucanase Cel5a From Bacillus Agaradhaerens In Complex With Cellotri Derived-Tetrahydrooxazine [Salipaludibacillus agaradhaerens],4A3H_A 2',4' Dinitrophenyl-2-Deoxy-2-Fluro-B-D-Cellobioside Complex Of The Endoglucanase Cel5a From Bacillus Agaradhaerens At 1.6 A Resolution [Salipaludibacillus agaradhaerens],5A3H_A 2-Deoxy-2-Fluro-B-D-CellobiosylENZYME INTERMEDIATE COMPLEX Of The Endoglucanase Cel5a From Bacillus Agaradhearans At 1.8 Angstroms Resolution [Salipaludibacillus agaradhaerens],6A3H_A 2-Deoxy-2-Fluro-B-D-CellotriosylENZYME INTERMEDIATE COMPLEX OF THE Endoglucanase Cel5a From Bacillus Agaradhearans At 1.6 Angstrom Resolution [Salipaludibacillus agaradhaerens],7A3H_A Native Endoglucanase Cel5a Catalytic Core Domain At 0.95 Angstroms Resolution [Salipaludibacillus agaradhaerens],8A3H_A Cellobiose-derived imidazole complex of the endoglucanase cel5A from Bacillus agaradhaerens at 0.97 A resolution [Salipaludibacillus agaradhaerens]
1E5J_A 1.27e-92 110 411 2 300
EndoglucanaseCel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Methyl-4ii-S-Alpha-Cellobiosyl-4ii-Thio Beta-Cellobioside [Salipaludibacillus agaradhaerens],1QHZ_A Native Tetragonal Structure Of The Endoglucanase Cel5a From Bacillus Agaradhaerens [Salipaludibacillus agaradhaerens],1QI0_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With Cellobiose [Salipaludibacillus agaradhaerens],1QI2_A Endoglucanase Cel5a From Bacillus Agaradhaerens In The Tetragonal Crystal Form In Complex With 2',4'-Dinitrophenyl 2-Deoxy-2-Fluoro-B- D-Cellotrioside [Salipaludibacillus agaradhaerens],2V38_A Family 5 endoglucanase Cel5A from Bacillus agaradhaerens in complex with cellobio-derived noeuromycin [Salipaludibacillus agaradhaerens]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P22541 8.99e-102 110 413 106 409
Endoglucanase A OS=Butyrivibrio fibrisolvens OX=831 GN=celA PE=1 SV=1
Q07940 5.08e-99 119 408 4 291
Endoglucanase 4 OS=Ruminococcus albus OX=1264 GN=Eg IV PE=1 SV=1
P15704 5.10e-94 106 411 33 335
Endoglucanase OS=Clostridium saccharobutylicum OX=169679 GN=eglA PE=3 SV=1
P06565 2.01e-92 106 411 23 326
Endoglucanase B OS=Evansella cellulosilytica (strain ATCC 21833 / DSM 2522 / FERM P-1141 / JCM 9156 / N-4) OX=649639 GN=celB PE=3 SV=1
O85465 4.85e-91 106 411 24 326
Endoglucanase 5A OS=Salipaludibacillus agaradhaerens OX=76935 GN=cel5A PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000001 0.000003 1.000026 0.000000 0.000000 0.000000

TMHMM  Annotations      download full data without filtering help

start end
7 24