logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004384_00862

You are here: Home > Sequence: MGYG000004384_00862

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus;
CAZyme ID MGYG000004384_00862
CAZy Family CBM4
CAZyme Description Cellulose 1,4-beta-cellobiosidase
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1016 MGYG000004384_41|CGC1 112015.77 4.496
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004384 2222403 MAG Israel Asia
Gene Location Start: 9681;  End: 12731  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.- 3.2.1.91

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 354 913 2.5e-100 0.9880382775119617
CBM4 37 178 1e-20 0.9603174603174603

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 1.82e-53 354 912 1 373
Glycosyl hydrolase family 9.
cd02850 E_set_Cellulase_N 1.06e-23 244 343 2 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam02927 CelD_N 3.25e-22 244 335 3 82
Cellulase N-terminal ig-like domain.
pfam02018 CBM_4_9 1.41e-07 39 172 6 123
Carbohydrate binding domain. This family includes diverse carbohydrate binding domains.
cd14256 Dockerin_I 1.19e-05 935 1002 2 57
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CBL17684.1 6.33e-246 1 980 2 932
AAR01216.1 1.87e-222 3 963 2 944
CAS03458.1 1.67e-221 1 963 1 948
ADU22927.1 9.02e-219 3 963 6 941
BAJ05815.1 3.96e-217 3 963 6 941

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
1UT9_A 9.61e-123 244 916 7 604
ChainA, CELLULOSE 1,4-BETA-CELLOBIOSIDASE [Acetivibrio thermocellus]
1RQ5_A 9.89e-123 244 922 7 610
ChainA, Cellobiohydrolase [Acetivibrio thermocellus]
3X17_A 3.81e-54 244 914 18 554
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
6DHT_A 9.00e-48 244 916 18 564
Bacteroidesovatus GH9 Bacova_02649 [Bacteroides ovatus ATCC 8483]
4CJ0_A 2.33e-39 266 951 36 576
ChainA, ENDOGLUCANASE D [Acetivibrio thermocellus],4CJ1_A Chain A, ENDOGLUCANASE D [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A3DCH1 1.51e-135 36 955 42 853
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
P0C2S1 1.51e-135 36 955 42 853
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1
Q05156 2.59e-103 41 919 39 745
Cellulase 1 OS=Streptomyces reticuli OX=1926 GN=cel1 PE=1 SV=1
P14090 1.17e-95 244 921 341 912
Endoglucanase C OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cenC PE=1 SV=2
P10476 3.55e-92 246 921 39 603
Endoglucanase A OS=Cellvibrio japonicus (strain Ueda107) OX=498211 GN=celA PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000371 0.998845 0.000167 0.000245 0.000174 0.000156

TMHMM  Annotations      download full data without filtering help

start end
9 31