logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003618_01381

You are here: Home > Sequence: MGYG000003618_01381

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Muribaculaceae; CAG-873;
CAZyme ID MGYG000003618_01381
CAZy Family GH9
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
599 66436.43 4.5182
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003618 1816095 MAG Fiji Oceania
Gene Location Start: 1960;  End: 3759  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003618_01381.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 124 590 1.3e-73 0.9736842105263158

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 3.33e-63 124 590 7 374
Glycosyl hydrolase family 9.
pfam02927 CelD_N 1.32e-09 23 95 4 78
Cellulase N-terminal ig-like domain.
PLN00119 PLN00119 9.50e-08 124 388 40 277
endoglucanase
PLN02613 PLN02613 2.46e-07 118 374 30 252
endoglucanase
cd02850 E_set_Cellulase_N 5.76e-07 21 93 1 74
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
BAR47645.1 3.48e-214 1 593 1 587
BAR50387.1 1.40e-213 1 593 1 587
AEW22711.1 6.99e-213 1 593 8 594
CEA15951.1 2.83e-209 22 593 24 588
SCD20825.1 6.40e-209 18 593 27 597

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
3X17_A 5.22e-28 79 589 85 552
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
5U2O_A 2.37e-20 82 592 54 538
Crystalstructure of Zn-binding triple mutant of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
5U0H_A 1.20e-17 155 592 131 538
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
1JS4_A 1.69e-17 124 593 14 440
EndoEXOCELLULASE:CELLOBIOSEFROM THERMOMONOSPORA [Thermobifida fusca],1JS4_B EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA [Thermobifida fusca],1TF4_A EndoEXOCELLULASE FROM THERMOMONOSPORA [Thermobifida fusca],1TF4_B EndoEXOCELLULASE FROM THERMOMONOSPORA [Thermobifida fusca],3TF4_A EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA [Thermobifida fusca],3TF4_B EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA [Thermobifida fusca],4TF4_A EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA [Thermobifida fusca],4TF4_B EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA [Thermobifida fusca]
1UT9_A 8.65e-16 161 509 169 531
ChainA, CELLULOSE 1,4-BETA-CELLOBIOSIDASE [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P14090 3.68e-17 22 589 341 903
Endoglucanase C OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cenC PE=1 SV=2
P26221 1.31e-16 124 593 60 486
Endoglucanase E-4 OS=Thermobifida fusca OX=2021 GN=celD PE=1 SV=2
P0C2S1 7.81e-14 161 509 376 738
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1
A3DCH1 1.36e-13 161 509 376 738
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
P28622 5.70e-13 104 587 20 457
Endoglucanase 4 OS=Bacillus sp. (strain KSM-522) OX=120046 PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000301 0.998894 0.000298 0.000154 0.000158 0.000148

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003618_01381.