logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003114_01976

You are here: Home > Sequence: MGYG000003114_01976

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Clostridium sp900547475
Lineage Bacteria; Firmicutes_A; Clostridia; Clostridiales; Clostridiaceae; Clostridium; Clostridium sp900547475
CAZyme ID MGYG000003114_01976
CAZy Family GH101
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1917 MGYG000003114_22|CGC1 213102.8 4.4315
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003114 3467094 MAG United States North America
Gene Location Start: 9695;  End: 15448  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.97

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH101 636 1126 4e-114 0.7454031117397454
CBM32 1204 1325 8.3e-25 0.9354838709677419

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam12905 Glyco_hydro_101 1.11e-103 707 974 1 273
Endo-alpha-N-acetylgalactosaminidase. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by the S. pneumoniae protein Endo-alpha-N-acetylgalactosaminidase, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor.
cd14244 GH_101_like 7.44e-98 722 999 4 298
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam18080 Gal_mutarotas_3 3.68e-47 332 532 1 213
Galactose mutarotase-like fold domain. This domain is found in endo-alpha-N-acetylgalactosaminidase present in Streptococcus pneumoniae. Endo-alpha-N-acetylgalactosaminidase is a cell surface-anchored glycoside hydrolase involved in the breakdown of mucin type O-linked glycans. The domain, known as domain 2, exhibits strong structural similarlity to the galactose mutarotase-like fold but lacks the active site residues. Domains, found in a number of glycoside hydrolases, structurally similar to domain 2 confer stability to the multidomain architectures.
cd02133 PA_C5a_like 1.65e-25 573 684 25 140
PA_C5a_like: Protease-associated domain containing proteins like Streptococcus pyogenes C5a peptidase. This group contains various PA domain-containing proteins similar to S. pyogenes C5a, including, i) Vpr, a minor extracellular serine protease from Bacillus subtilis, ii) a large molecular mass collagenolytic protease from Geobacillus collagenovorans MO-1, and iii) PrtS, a cell envelope protease from Streptococcus thermophilus CNRZ 385. Proteins in this group belong to the peptidase S8 family. C5a peptidase is a cell surface serine protease which specifically inactivates C5a [a chemotactic peptide, which attracts polymorphonuclear leukocytes (PMNs)], by cleaving it to release a 7-residue carboxy-terminal fragment which contains the PMN binding site. The significance of the PA domain to these proteins has not been ascertained. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate.
pfam17451 Glyco_hyd_101C 1.93e-25 984 1100 3 111
Glycosyl hydrolase 101 beta sandwich domain. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by a S. pneumoniae protein, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor. This domain represents C-terminal the beta sandwich domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AYE33294.1 0.0 8 1631 12 1484
QAS61465.1 0.0 8 1631 12 1484
ATD54766.1 0.0 4 1631 6 1487
QBJ75049.1 0.0 4 1631 6 1487
SLK16230.1 0.0 4 1631 6 1487

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6QEP_A 2.82e-81 332 1712 12 1199
EngBFDARPin Fusion 4b H14 [Bifidobacterium longum]
6QEV_B 2.88e-81 332 1712 12 1199
EngBFDARPin Fusion 4b B6 [Bifidobacterium longum]
6QFK_A 2.88e-81 332 1712 12 1199
EngBFDARPin Fusion 4b G10 [Bifidobacterium longum]
6SH9_B 2.88e-81 332 1712 12 1199
EngBFDARPin Fusion 4b D12 [Bifidobacterium longum subsp. longum JCM 1217]
2ZXQ_A 4.31e-81 317 1161 11 807
Crystalstructure of endo-alpha-N-acetylgalactosaminidase from Bifidobacterium longum (EngBF) [Bifidobacterium longum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A9WNA0 6.20e-84 640 1158 226 745
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2
Q8DR60 8.29e-69 92 1627 158 1418
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae (strain ATCC BAA-255 / R6) OX=171101 GN=spr0328 PE=1 SV=1
Q2MGH6 2.47e-68 92 1627 158 1418
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=SP_0368 PE=1 SV=1
P29767 4.72e-21 1196 1369 50 233
Sialidase OS=Clostridium septicum OX=1504 PE=3 SV=1
Q0TR53 9.37e-18 1193 1358 620 808
O-GlcNAcase NagJ OS=Clostridium perfringens (strain ATCC 13124 / DSM 756 / JCM 1290 / NCIMB 6125 / NCTC 8237 / Type A) OX=195103 GN=nagJ PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000366 0.998921 0.000177 0.000194 0.000166 0.000151

TMHMM  Annotations      download full data without filtering help

start end
13 35
1887 1909