logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004429_01107

You are here: Home > Sequence: MGYG000004429_01107

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; CAG-274; ;
CAZyme ID MGYG000004429_01107
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
556 60903.35 4.2618
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004429 2352873 MAG Israel Asia
Gene Location Start: 2914;  End: 4584  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 68 343 4.3e-100 0.9891304347826086

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 4.24e-65 54 346 1 272
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 9.46e-44 30 362 37 380
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd14256 Dockerin_I 5.84e-13 504 556 2 55
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.
pfam00404 Dockerin_1 2.22e-08 504 544 1 41
Dockerin type I repeat. The dockerin repeat is the binding partner of the cohesin domain pfam00963. The cohesin-dockerin interaction is the crucial interaction for complex formation in the cellulosome. The dockerin repeats, each bearing homology to the EF-hand calcium-binding loop bind calcium.
cd14255 Dockerin_III 2.18e-05 504 555 1 58
Type III dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. Two specific calcium-dependent interactions between cohesin and dockerin appear to be essential for cellulosome assembly, type I and type II. This subfamily represents the atypical type III dockerins and related domains.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CDM69884.1 1.34e-113 36 373 39 376
AGF55279.1 1.46e-112 35 377 37 378
CBL16772.1 1.97e-112 35 375 97 439
AAC37035.1 3.19e-110 39 386 41 389
ADZ19876.1 1.59e-109 17 373 21 367

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6WQP_A 3.84e-113 35 372 14 353
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
6Q1I_A 3.38e-112 39 374 16 352
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
6MQ4_A 1.18e-111 35 379 10 353
ChainA, cellulase [Acetivibrio cellulolyticus]
6PZ7_A 1.31e-108 35 373 7 335
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]
3NDY_A 5.13e-108 38 378 13 344
Thestructure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDY_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans [Clostridium cellulovorans],3NDZ_A The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_B The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_C The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans],3NDZ_D The structure of the catalytic and carbohydrate binding domain of endoglucanase D from Clostridium cellulovorans bound to cellotriose [Clostridium cellulovorans]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P54937 6.38e-111 39 386 41 389
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P28623 7.16e-106 38 378 44 375
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
P28621 2.53e-101 32 376 37 375
Endoglucanase B OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engB PE=3 SV=1
P10477 1.28e-92 39 383 59 393
Cellulase/esterase CelE OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celE PE=1 SV=2
Q12647 2.97e-84 30 379 18 367
Endoglucanase B OS=Neocallimastix patriciarum OX=4758 GN=CELB PE=2 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000440 0.998484 0.000555 0.000180 0.000167 0.000158

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000004429_01107.