You are browsing environment: HUMAN GUT

CAZyme Information: MGYG000003645_00473

You are here: Home > Sequence: MGYG000003645_00473

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species CAG-831 sp900771335
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; UBA932; CAG-831; CAG-831 sp900771335
CAZyme ID MGYG000003645_00473
CAZy Family GH9
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
827 93304.82 5.1585
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003645 1892545 MAG Fiji Oceania
Gene Location Start: 5406;  End: 7889  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000003645_00473.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH9 112 581 1.8e-70 0.9880382775119617

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00759 Glyco_hydro_9 4.15e-57 120 581 7 372
Glycosyl hydrolase family 9.
cd02850 E_set_Cellulase_N 4.50e-25 22 107 1 86
N-terminal Early set domain associated with the catalytic domain of cellulase. E or "early" set domains are associated with the catalytic domain of cellulases at the N-terminal end. Cellulases are O-glycosyl hydrolases (GHs) that hydrolyze beta 1-4 glucosidic bonds in cellulose. They are usually categorized into either exoglucanases, which sequentially release terminal sugar units from the cellulose chain, or endoglucanases, which also attack the chain internally. The N-terminal domain of cellulase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others.
pfam02927 CelD_N 5.56e-25 21 102 1 83
Cellulase N-terminal ig-like domain.
cd10917 CE4_NodB_like_6s_7s 2.60e-24 626 814 7 171
Catalytic NodB homology domain of rhizobial NodB-like proteins. This family belongs to the large and functionally diverse carbohydrate esterase 4 (CE4) superfamily, whose members show strong sequence similarity with some variability due to their distinct carbohydrate substrates. It includes many rhizobial NodB chitooligosaccharide N-deacetylase (EC 3.5.1.-)-like proteins, mainly from bacteria and eukaryotes, such as chitin deacetylases (EC, bacterial peptidoglycan N-acetylglucosamine deacetylases (EC 3.5.1.-), and acetylxylan esterases (EC, which catalyze the N- or O-deacetylation of substrates such as acetylated chitin, peptidoglycan, and acetylated xylan. All members of this family contain a catalytic NodB homology domain with the same overall topology and a deformed (beta/alpha)8 barrel fold with 6- or 7 strands. Their catalytic activity is dependent on the presence of a divalent cation, preferably cobalt or zinc, and they employ a conserved His-His-Asp zinc-binding triad closely associated with the conserved catalytic base (aspartic acid) and acid (histidine) to carry out acid/base catalysis. Several family members show diversity both in metal ion specificities and in the residues that coordinate the metal.
cd10944 CE4_SmPgdA_like 1.04e-21 614 822 1 189
Catalytic NodB homology domain of Streptococcus mutans polysaccharide deacetylase PgdA, Bacillus subtilis YheN, and similar proteins. This family is represented by a putative polysaccharide deacetylase PgdA from the oral pathogen Streptococcus mutans (SmPgdA) and Bacillus subtilis YheN (BsYheN), which are members of the carbohydrate esterase 4 (CE4) superfamily. SmPgdA is an extracellular metal-dependent polysaccharide deacetylase with a typical CE4 fold, with metal bound to a His-His-Asp triad. It possesses de-N-acetylase activity toward a hexamer of chitooligosaccharide N-acetylglucosamine, but not shorter chitooligosaccharides or a synthetic peptidoglycan tetrasaccharide. SmPgdA plays a role in tuning cell surface properties and in interactions with (salivary) agglutinin, an essential component of the innate immune system, most likely through deacetylation of an as-yet-unidentified polysaccharide. SmPgdA shows significant homology to the catalytic domains of peptidoglycan deacetylases from Streptococcus pneumoniae (SpPgdA) and Listeria monocytogenes (LmPgdA), both of which are involved in the bacterial defense mechanism against human mucosal lysozyme. The Bacillus subtilis genome contains six polysaccharide deacetylase gene homologs: pdaA, pdaB (previously known as ybaN), yheN, yjeA, yxkH and ylxY. The biological function of BsYheN is still unknown. This family also includes many uncharacterized polysaccharide deacetylases mainly found in bacteria.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AGY53377.1 0.0 1 820 1 828
QIK60909.1 0.0 20 823 23 826
QIK55492.1 0.0 19 823 22 826
AHW60805.1 0.0 4 820 6 812
QIA07887.1 0.0 2 820 4 813

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
5U2O_A 5.59e-35 31 591 2 545
Crystalstructure of Zn-binding triple mutant of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
3X17_A 1.12e-34 23 581 18 551
Crystalstructure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium],3X17_B Crystal structure of metagenome-derived glycoside hydrolase family 9 endoglucanase [uncultured bacterium]
5U0H_A 1.46e-31 31 591 2 545
Crystalstructure of GH family 9 endoglucanase J30 [Thermobacillus composti KWC4]
6DHT_A 5.57e-25 23 509 18 497
Bacteroidesovatus GH9 Bacova_02649 [Bacteroides ovatus ATCC 8483]
4CJ0_A 3.82e-23 20 497 27 482
ChainA, ENDOGLUCANASE D [Acetivibrio thermocellus],4CJ1_A Chain A, ENDOGLUCANASE D [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P23658 6.33e-25 23 330 4 285
Cellodextrinase OS=Butyrivibrio fibrisolvens OX=831 GN=ced1 PE=1 SV=1
A7LXT3 3.38e-24 23 509 32 511
Xyloglucan-specific endo-beta-1,4-glucanase BoGH9A OS=Bacteroides ovatus (strain ATCC 8483 / DSM 1896 / JCM 5824 / BCRC 10623 / CCUG 4943 / NCTC 11153) OX=411476 GN=BACOVA_02649 PE=1 SV=1
P14090 6.94e-24 23 592 341 913
Endoglucanase C OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cenC PE=1 SV=2
P0C2S4 2.09e-22 20 497 27 482
Endoglucanase D (Fragment) OS=Acetivibrio thermocellus OX=1515 GN=celD PE=1 SV=1
A3DDN1 2.28e-22 20 497 51 506
Endoglucanase D OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celD PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

0.001401 0.997662 0.000376 0.000187 0.000174 0.000195

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000003645_00473.