logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003546_00169

You are here: Home > Sequence: MGYG000003546_00169

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UBA1181 sp900769555
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Paludibacteraceae; UBA1181; UBA1181 sp900769555
CAZyme ID MGYG000003546_00169
CAZy Family GH9
CAZyme Description Cellulose 1,4-beta-cellobiosidase
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
720 81031.08 4.4603
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003546 3032937 MAG Fiji Oceania
Gene Location Start: 258;  End: 2420  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 133 628 3.7e-66 0.9952153110047847

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 4.67e-42 135 561 1 345
Glycosyl hydrolase family 9.
cd02850 E_set_Cellulase_N 3.04e-22 29 128 2 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam02927 CelD_N 4.27e-18 28 114 2 76
Cellulase N-terminal ig-like domain.
pfam18962 Por_Secre_tail 1.23e-09 652 715 1 69
Secretion system C-terminal sorting domain. Species that include Porphyromonas gingivalis, Fibrobacter succinogenes, Flavobacterium johnsoniae, Cytophaga hutchinsonii, Gramella forsetii, Prevotella intermedia, and Salinibacter ruber have on average twenty or more copies of this C-terminal domain, associated with sorting to the outer membrane and covalent modification. This domain targets proteins to type IX secretion systems and is secreted then cleaved off by a C-terminal signal peptidease. Based on similarity to other families it is likely that this domain adopts an immunoglobulin like fold.
TIGR04183 Por_Secre_tail 5.94e-08 652 715 1 68
Por secretion system C-terminal sorting domain. Species that include Porphyromonas gingivalis, Fibrobacter succinogenes, Flavobacterium johnsoniae, Cytophaga hutchinsonii, Gramella forsetii, Prevotella intermedia, and Salinibacter ruber average twenty or more copies of a C-terminal domain, represented by this model, associated with sorting to the outer membrane and covalent modification.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ABG58065.1 3.15e-211 1 709 14 728
AGP39523.1 1.45e-203 24 631 63 673
QCX37653.1 4.49e-201 3 705 2 697
ABD79822.1 5.27e-196 29 634 54 662
AJQ92180.1 9.77e-192 29 631 72 678

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
3X17_A 1.68e-38 29 632 18 558
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
5U0H_A 4.40e-30 65 630 19 539
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
1RQ5_A 1.30e-28 90 605 54 584
ChainA, Cellobiohydrolase [Acetivibrio thermocellus]
1UT9_A 1.30e-28 90 605 54 584
ChainA, CELLULOSE 1,4-BETA-CELLOBIOSIDASE [Acetivibrio thermocellus]
5U2O_A 2.50e-28 65 630 19 539
Crystalstructure of Zn-binding triple mutant of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A3DCH1 8.48e-32 29 605 214 791
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celK PE=3 SV=1
P0C2S1 1.13e-31 29 605 214 791
Cellulose 1,4-beta-cellobiosidase OS=Acetivibrio thermocellus OX=1515 GN=celK PE=1 SV=1
P23658 1.97e-27 29 561 4 496
Cellodextrinase OS=Butyrivibrio fibrisolvens OX=831 GN=ced1 PE=1 SV=1
P0C2S4 4.21e-26 24 559 25 512
Endoglucanase D (Fragment) OS=Acetivibrio thermocellus OX=1515 GN=celD PE=1 SV=1
A3DDN1 4.67e-26 24 559 49 536
Endoglucanase D OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celD PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000354 0.998841 0.000245 0.000179 0.000177 0.000172

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003546_00169.