logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001021_00199

You are here: Home > Sequence: MGYG000001021_00199

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Ruminococcus_C sp000980705
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus_C; Ruminococcus_C sp000980705
CAZyme ID MGYG000001021_00199
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
595 MGYG000001021_2|CGC2 66431.4 3.9654
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001021 2360136 MAG Sweden Europe
Gene Location Start: 153398;  End: 155185  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 64 343 1.2e-87 0.9927536231884058

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 3.69e-54 55 344 1 270
Cellulase (glycosyl hydrolase family 5).
COG2730 BglC 5.26e-31 41 366 47 383
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd14256 Dockerin_I 1.72e-14 536 592 1 57
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.
pfam18259 CBM65_1 2.94e-07 431 511 24 111
Carbohydrate binding module 65 domain 1. This domain is found in the non-catalytic carbohydrate binding module 65B (CMB65B) present in Eubacterium cellulosolvens. CBMs are present in plant cell wall degrading enzymes and are responsible for targeting, which enhances catalysis. CBM65s display higher affinity for oligosaccharides, such as cellohexaose, and particularly polysaccharides than cellotetraose, which fully occupies the core component of the substrate binding cleft. The concave surface presented by beta-sheet 2 comprises the beta-glucan binding site in CBM65s. C6 of all the backbone glucose moieties makes extensive hydrophobic interactions with the surface tryptophans of CBM65s. Three out of the four surface Trp are highly conserved. The conserved metal ion site typical of CBMs is absent in this CBM65 family.
cd14252 Dockerin_like 3.20e-07 537 589 1 54
Dockerin repeat domains and domains resembling dockerin repeats. Dockerins are modules in the cellulosome complex that often anchor catalytic subunits by binding to cohesin domains of scaffolding proteins. Three types of dockerins and their corresponding cohesin have been described in the literature. This alignment models two consecutive dockerin repeats, the functional unit.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QPB75694.1 3.36e-174 7 530 9 531
ADU86907.1 2.14e-149 14 514 13 504
CAB05881.1 4.22e-143 2 524 4 545
AAB19708.1 1.15e-125 31 412 32 434
ADU21113.1 7.19e-102 1 552 1 560

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6XSU_A 2.44e-130 28 380 3 353
GH5-4broad specificity endoglucanase from Ruminococcus flavefaciens [Ruminococcus flavefaciens],6XSU_B GH5-4 broad specificity endoglucanase from Ruminococcus flavefaciens [Ruminococcus flavefaciens]
6Q1I_A 1.00e-81 30 366 6 341
GH5-4broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum],6Q1I_B GH5-4 broad specificity endoglucanase from Clostrdium longisporum [Clostridium longisporum]
6WQP_A 4.04e-75 36 377 14 353
GH5-4broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQP_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis [Ruminococcus champanellensis],6WQV_A GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_B GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_C GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis],6WQV_D GH5-4 broad specificity endoglucanase from Ruminococcus champanellensis with bound cellotriose [Ruminococcus champanellensis]
6PZ7_A 4.85e-74 36 365 7 324
GH5-4broad specificity endoglucanase from Clostridium acetobutylicum [Clostridium acetobutylicum ATCC 824]
4W85_A 2.12e-73 40 375 7 334
Crystalstructure of XEG5A, a GH5 xyloglucan-specific endo-beta-1,4-glucanase from ruminal metagenomic library, in complex with glucose [uncultured bacterium],4W85_B Crystal structure of XEG5A, a GH5 xyloglucan-specific endo-beta-1,4-glucanase from ruminal metagenomic library, in complex with glucose [uncultured bacterium],4W87_A Crystal structure of XEG5A, a GH5 xyloglucan-specific endo-beta-1,4-glucanase from metagenomic library, in complex with a xyloglucan oligosaccharide [uncultured bacterium],4W87_B Crystal structure of XEG5A, a GH5 xyloglucan-specific endo-beta-1,4-glucanase from metagenomic library, in complex with a xyloglucan oligosaccharide [uncultured bacterium],4W89_A Crystal structure of XEG5A, a GH5 xyloglucan-specific endo-beta-1,4-glucanase from metagenomic library, in complex with cellotriose [uncultured bacterium],4W89_B Crystal structure of XEG5A, a GH5 xyloglucan-specific endo-beta-1,4-glucanase from metagenomic library, in complex with cellotriose [uncultured bacterium]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P54937 6.51e-80 30 366 31 366
Endoglucanase A OS=Clostridium longisporum OX=1523 GN=celA PE=1 SV=1
P23660 3.10e-78 36 377 25 360
Endoglucanase A OS=Ruminococcus albus OX=1264 GN=celA PE=1 SV=1
P28623 9.81e-71 38 365 43 359
Endoglucanase D OS=Clostridium cellulovorans (strain ATCC 35296 / DSM 3052 / OCM 3 / 743B) OX=573061 GN=engD PE=1 SV=2
Q12647 1.85e-70 29 361 16 346
Endoglucanase B OS=Neocallimastix patriciarum OX=4758 GN=CELB PE=2 SV=1
P23661 5.00e-70 35 366 68 388
Endoglucanase B OS=Ruminococcus albus OX=1264 GN=celB PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.001066 0.998079 0.000204 0.000271 0.000192 0.000172

TMHMM  Annotations      download full data without filtering help

start end
7 29