logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003564_01511

You are here: Home > Sequence: MGYG000003564_01511

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UBA1232 sp004561775
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; UBA932; UBA1232; UBA1232 sp004561775
CAZyme ID MGYG000003564_01511
CAZy Family GH9
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
876 MGYG000003564_312|CGC1 99188.61 5.6187
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003564 1730489 MAG Fiji Oceania
Gene Location Start: 1867;  End: 4497  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003564_01511.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 155 626 1.5e-73 0.9952153110047847

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 1.87e-63 163 623 7 372
Glycosyl hydrolase family 9.
cd10917 CE4_NodB_like_6s_7s 4.42e-26 661 861 1 171
Catalytic NodB homology domain of rhizobial NodB-like proteins. This family belongs to the large and functionally diverse carbohydrate esterase 4 (CE4) superfamily, whose members show strong sequence similarity with some variability due to their distinct carbohydrate substrates. It includes many rhizobial NodB chitooligosaccharide N-deacetylase (EC 3.5.1.-)-like proteins, mainly from bacteria and eukaryotes, such as chitin deacetylases (EC 3.5.1.41), bacterial peptidoglycan N-acetylglucosamine deacetylases (EC 3.5.1.-), and acetylxylan esterases (EC 3.1.1.72), which catalyze the N- or O-deacetylation of substrates such as acetylated chitin, peptidoglycan, and acetylated xylan. All members of this family contain a catalytic NodB homology domain with the same overall topology and a deformed (beta/alpha)8 barrel fold with 6- or 7 strands. Their catalytic activity is dependent on the presence of a divalent cation, preferably cobalt or zinc, and they employ a conserved His-His-Asp zinc-binding triad closely associated with the conserved catalytic base (aspartic acid) and acid (histidine) to carry out acid/base catalysis. Several family members show diversity both in metal ion specificities and in the residues that coordinate the metal.
cd10950 CE4_BsYlxY_like 9.77e-22 657 873 2 187
Putative catalytic NodB homology domain of uncharacterized protein YlxY from Bacillus subtilis and its bacterial homologs. The Bacillus subtilis genome contains six polysaccharide deacetylase gene homologs: pdaA, pdaB (previously known as ybaN), yheN, yjeA, yxkH and ylxY. This family is represented by Bacillus subtilis putative polysaccharide deacetylase BsYlxY, encoded by the ylxY gene, which is a member of the carbohydrate esterase 4 (CE4) superfamily. Although its biological function still remains unknown, BsYlxY shows high sequence homology to the catalytic domain of Bacillus subtilis pdaB gene encoding a putative polysaccharide deacetylase (BsPdaB), which is essential for the maintenance of spores after the late stage of sporulation and is highly conserved in spore-forming bacteria. However, disruption of the ylxY gene in B. subtilis did not cause any sporulation defect. Moreover, the Asp residue in the classical His-His-Asp zinc-binding motif of CE4 esterases is mutated to a Val residue in this family. Other catalytically relevant residues of CE4 esterases are also not conserved, which suggest that members of this family may be inactive.
cd02850 E_set_Cellulase_N 5.11e-21 59 150 1 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam02927 CelD_N 1.24e-19 58 145 1 83
Cellulase N-terminal ig-like domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AGY53377.1 0.0 56 870 22 831
QIK60909.1 0.0 57 867 23 823
QIK55492.1 0.0 56 867 22 823
BBE19991.1 3.62e-306 65 865 1 787
AHW60805.1 6.35e-303 60 870 24 815

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
3X17_A 1.33e-33 60 626 18 554
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
4CJ0_A 2.12e-26 58 552 28 492
ChainA, ENDOGLUCANASE D [Acetivibrio thermocellus],4CJ1_A Chain A, ENDOGLUCANASE D [Acetivibrio thermocellus]
1CLC_A 2.27e-26 58 552 42 506
ChainA, ENDOGLUCANASE CELD; EC: 3.2.1.4 [Acetivibrio thermocellus]
5U2O_A 5.00e-26 68 623 2 534
Crystalstructure of Zn-binding triple mutant of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
5U0H_A 8.07e-23 68 623 2 534
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P0C2S4 1.16e-25 58 552 28 492
Endoglucanase D (Fragment) OS=Acetivibrio thermocellus OX=1515 GN=celD PE=1 SV=1
A3DDN1 1.30e-25 58 552 52 516
Endoglucanase D OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celD PE=1 SV=1
P23658 8.19e-20 60 623 4 540
Cellodextrinase OS=Butyrivibrio fibrisolvens OX=831 GN=ced1 PE=1 SV=1
A7LXT3 1.21e-18 60 621 32 571
Xyloglucan-specific endo-beta-1,4-glucanase BoGH9A OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / BCRC 10623 / CCUG 4943 / NCTC 11153) OX=411476 GN=BACOVA_02649 PE=1 SV=1
P14090 9.32e-18 60 623 341 902
Endoglucanase C OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cenC PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000297 0.999052 0.000162 0.000167 0.000153 0.000143

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003564_01511.