logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000072_00046

You are here: Home > Sequence: MGYG000000072_00046

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UBA1394 sp900066845
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; UBA1394; UBA1394 sp900066845
CAZyme ID MGYG000000072_00046
CAZy Family CBM4
CAZyme Description Cellulose 1,4-beta-cellobiosidase
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
888 97886.12 4.2981
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000072 2648835 Isolate United Kingdom Europe
Gene Location Start: 43108;  End: 45774  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4 3.2.1.78

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 309 795 1.4e-122 0.9952153110047847
CBM4 40 181 1.9e-28 0.9761904761904762

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 3.22e-99 312 793 2 373
Glycosyl hydrolase family 9.
cd02850 E_set_Cellulase_N 2.05e-29 219 303 2 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam02927 CelD_N 7.63e-25 217 298 1 83
Cellulase N-terminal ig-like domain.
PLN02266 PLN02266 2.32e-13 363 802 83 508
endoglucanase
PLN02613 PLN02613 5.56e-13 363 750 63 450
endoglucanase

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ADU21576.1 0.0 15 870 9 885
CBL17684.1 5.90e-271 1 800 1 861
QTL97152.1 1.44e-262 37 822 38 829
CUH93326.1 7.09e-260 29 800 35 793
QNU67291.1 4.58e-258 33 800 36 791

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
1UT9_A 7.85e-183 218 797 6 604
ChainA, CELLULOSE 1,4-BETA-CELLOBIOSIDASE [Acetivibrio thermocellus]
1RQ5_A 6.44e-182 218 800 6 607
ChainA, Cellobiohydrolase [Acetivibrio thermocellus]
3X17_A 1.43e-70 217 798 16 557
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
6DHT_A 2.74e-66 219 797 18 564
Bacteroidesovatus GH9 Bacova_02649 [Bacteroides ovatus ATCC 8483]
4CJ0_A 2.97e-61 217 801 28 551
ChainA, ENDOGLUCANASE D [Acetivibrio thermocellus],4CJ1_A Chain A, ENDOGLUCANASE D [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A3DCH1 1.20e-210 40 797 43 811
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
P0C2S1 2.39e-210 40 797 43 811
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1
Q05156 7.27e-163 19 800 10 745
Cellulase 1 OS=Streptomyces reticuli OX=1926 GN=cel1 PE=1 SV=1
P14090 3.09e-158 20 800 158 910
Endoglucanase C OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cenC PE=1 SV=2
P10476 1.69e-128 222 799 40 600
Endoglucanase A OS=Cellvibrio japonicus (strain Ueda107) OX=498211 GN=celA PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.001339 0.997590 0.000252 0.000410 0.000206 0.000190

TMHMM  Annotations      download full data without filtering help

start end
12 34
855 877