logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000605_00334

You are here: Home > Sequence: MGYG000000605_00334

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus_C;
CAZyme ID MGYG000000605_00334
CAZy Family CBM30
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
842 94113.12 4.3169
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000605 2224694 MAG Madagascar Africa
Gene Location Start: 1144;  End: 3672  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000605_00334.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 300 771 2.1e-72 0.9952153110047847
CBM30 54 200 6.7e-21 0.8529411764705882

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 2.31e-52 303 769 2 373
Glycosyl hydrolase family 9.
pfam02927 CelD_N 2.31e-20 183 268 1 82
Cellulase N-terminal ig-like domain.
cd02850 E_set_Cellulase_N 8.79e-17 185 267 2 79
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
cd14256 Dockerin_I 2.50e-12 787 839 2 56
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.
PLN03009 PLN03009 2.58e-08 291 764 20 479
cellulase

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ADL51370.1 1.31e-94 68 817 101 794
AEV69564.1 3.88e-91 68 808 101 784
AWV33854.1 1.17e-90 40 841 78 839
QNU68902.1 7.79e-89 54 820 89 798
AEY66182.1 2.24e-88 68 814 102 792

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4CJ0_A 8.63e-38 185 810 30 583
ChainA, ENDOGLUCANASE D [Acetivibrio thermocellus],4CJ1_A Chain A, ENDOGLUCANASE D [Acetivibrio thermocellus]
1CLC_A 9.85e-38 185 810 44 597
ChainA, ENDOGLUCANASE CELD; EC: 3.2.1.4 [Acetivibrio thermocellus]
3X17_A 3.79e-34 173 771 6 554
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
1UT9_A 1.10e-30 183 771 5 602
ChainA, CELLULOSE 1,4-BETA-CELLOBIOSIDASE [Acetivibrio thermocellus]
5U0H_A 1.15e-30 209 769 13 535
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P23658 2.91e-47 185 774 4 546
Cellodextrinase OS=Butyrivibrio fibrisolvens OX=831 GN=ced1 PE=1 SV=1
P0C2S4 4.73e-37 185 810 30 583
Endoglucanase D (Fragment) OS=Acetivibrio thermocellus OX=1515 GN=celD PE=1 SV=1
A3DCH1 8.76e-37 183 811 212 857
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
P0C2S1 4.75e-36 183 811 212 857
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1
A3DDN1 6.02e-36 185 810 54 607
Endoglucanase D OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celD PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000297 0.998958 0.000215 0.000208 0.000164 0.000138

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000605_00334.