logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000959_01590

You are here: Home > Sequence: MGYG000000959_01590

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Ruminococcus sp900752785
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Ruminococcus; Ruminococcus sp900752785
CAZyme ID MGYG000000959_01590
CAZy Family CBM4
CAZyme Description Cellulose 1,4-beta-cellobiosidase
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1019 MGYG000000959_46|CGC1 111791.39 4.4985
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000959 2365658 MAG Denmark Europe
Gene Location Start: 7477;  End: 10536  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.- 3.2.1.91

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 353 922 8.3e-99 0.9880382775119617
CBM4 37 179 3.4e-21 0.9603174603174603

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 7.61e-53 353 922 1 374
Glycosyl hydrolase family 9.
cd02850 E_set_Cellulase_N 2.01e-26 244 342 2 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam02927 CelD_N 1.85e-23 244 332 3 80
Cellulase N-terminal ig-like domain.
pfam02018 CBM_4_9 2.80e-07 41 179 8 129
Carbohydrate binding domain. This family includes diverse carbohydrate binding domains.
cd14256 Dockerin_I 3.30e-07 950 1008 2 57
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
CBL17684.1 1.39e-242 1 987 2 932
CAS03458.1 1.88e-224 1 930 1 907
ADU22927.1 1.64e-223 3 932 6 905
AAR01216.1 5.19e-223 13 930 8 903
BAJ05815.1 7.21e-222 3 932 6 905

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
1UT9_A 2.39e-125 244 925 7 604
ChainA, CELLULOSE 1,4-BETA-CELLOBIOSIDASE [Acetivibrio thermocellus]
1RQ5_A 2.46e-125 244 930 7 609
ChainA, Cellobiohydrolase [Acetivibrio thermocellus]
3X17_A 5.36e-56 244 923 18 554
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
6DHT_A 6.52e-56 244 925 18 564
Bacteroidesovatus GH9 Bacova_02649 [Bacteroides ovatus ATCC 8483]
5U0H_A 8.37e-47 283 931 20 546
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P0C2S1 1.11e-136 36 966 42 849
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1
A3DCH1 2.17e-136 36 966 42 849
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
Q05156 5.04e-108 41 928 39 745
Cellulase 1 OS=Streptomyces reticuli OX=1926 GN=cel1 PE=1 SV=1
P10476 4.63e-96 230 929 23 602
Endoglucanase A OS=Cellvibrio japonicus (strain Ueda107) OX=498211 GN=celA PE=3 SV=2
P14090 3.21e-95 244 937 341 918
Endoglucanase C OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cenC PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000475 0.998681 0.000181 0.000277 0.000189 0.000165

TMHMM  Annotations      download full data without filtering help

start end
9 31