logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001810_00780

You are here: Home > Sequence: MGYG000001810_00780

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Mediterraneibacter sp002314255
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Mediterraneibacter; Mediterraneibacter sp002314255
CAZyme ID MGYG000001810_00780
CAZy Family GH101
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2176 MGYG000001810_7|CGC1 236200.86 3.8631
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001810 2943094 MAG Denmark Europe
Gene Location Start: 32031;  End: 38561  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.97

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH101 534 1244 2.5e-227 0.9957567185289957

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd14244 GH_101_like 2.26e-120 787 1113 3 298
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam12905 Glyco_hydro_101 4.05e-120 773 1090 2 273
Endo-alpha-N-acetylgalactosaminidase. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by the S. pneumoniae protein Endo-alpha-N-acetylgalactosaminidase, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor.
pfam17974 GalBD_like 4.37e-62 1403 1583 1 187
Galactose-binding domain-like. Proteins containing a galactose-binding domain-like fold can be found in several different protein families, in both eukaryotes and prokaryotes. The common function of these domains is to bind to specific ligands, such as cell-surface-attached carbohydrate substrates for galactose oxidase and sialidase, phospholipids on the outer side of the mammalian cell membrane for coagulation factor Va, membrane-anchored ephrin for the Eph family of receptor tyrosine kinases, and a complex of broken single-stranded DNA and DNA polymerase beta for XRCC1. The structure of the galactose-binding domain-like members consists of a beta-sandwich, in which the strands making up the sheets exhibit a jellyroll fold.
pfam18080 Gal_mutarotas_3 7.79e-55 531 771 1 243
Galactose mutarotase-like fold domain. This domain is found in endo-alpha-N-acetylgalactosaminidase present in Streptococcus pneumoniae. Endo-alpha-N-acetylgalactosaminidase is a cell surface-anchored glycoside hydrolase involved in the breakdown of mucin type O-linked glycans. The domain, known as domain 2, exhibits strong structural similarlity to the galactose mutarotase-like fold but lacks the active site residues. Domains, found in a number of glycoside hydrolases, structurally similar to domain 2 confer stability to the multidomain architectures.
pfam17451 Glyco_hyd_101C 5.45e-33 1096 1217 1 110
Glycosyl hydrolase 101 beta sandwich domain. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by a S. pneumoniae protein, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor. This domain represents C-terminal the beta sandwich domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QNM14139.1 0.0 30 2051 29 2023
APC49531.1 6.18e-280 268 1606 66 1442
QOL33684.1 1.73e-273 251 2101 95 1966
QNM11059.1 2.51e-270 304 2119 120 1868
ASK64334.1 6.54e-270 392 1602 81 1278

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2ZXQ_A 1.68e-233 529 1817 25 1369
Crystalstructure of endo-alpha-N-acetylgalactosaminidase from Bifidobacterium longum (EngBF) [Bifidobacterium longum]
6QEP_A 1.98e-222 529 1592 10 1113
EngBFDARPin Fusion 4b H14 [Bifidobacterium longum]
6QFK_A 2.15e-222 529 1592 10 1113
EngBFDARPin Fusion 4b G10 [Bifidobacterium longum]
6QEV_B 2.15e-222 529 1592 10 1113
EngBFDARPin Fusion 4b B6 [Bifidobacterium longum]
6SH9_B 2.15e-222 529 1592 10 1113
EngBFDARPin Fusion 4b D12 [Bifidobacterium longum subsp. longum JCM 1217]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q2MGH6 1.10e-198 365 1869 167 1661
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=SP_0368 PE=1 SV=1
Q8DR60 5.48e-198 365 1869 167 1661
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae (strain ATCC BAA-255 / R6) OX=171101 GN=spr0328 PE=1 SV=1
A9WNA0 7.73e-131 530 1592 51 1034
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2
I1S2N3 7.17e-06 1682 1820 54 187
Galactose oxidase OS=Gibberella zeae (strain ATCC MYA-4620 / CBS 123657 / FGSC 9075 / NRRL 31084 / PH-1) OX=229533 GN=GAOA PE=3 SV=1
P0CS93 7.17e-06 1682 1820 54 187
Galactose oxidase OS=Gibberella zeae OX=5518 GN=GAOA PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000359 0.998936 0.000171 0.000180 0.000151 0.000144

TMHMM  Annotations      download full data without filtering help

start end
7 29
2149 2171