logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001442_00219

You are here: Home > Sequence: MGYG000001442_00219

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Streptococcus sp000411475
Lineage Bacteria; Firmicutes; Bacilli; Lactobacillales; Streptococcaceae; Streptococcus; Streptococcus sp000411475
CAZyme ID MGYG000001442_00219
CAZy Family GH101
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2209 MGYG000001442_1|CGC3 245365.05 4.6691
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001442 1753973 Isolate not provided not provided
Gene Location Start: 213354;  End: 219983  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.97

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH101 629 1336 0 0.9971711456859972

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam12905 Glyco_hydro_101 1.42e-154 879 1169 1 273
Endo-alpha-N-acetylgalactosaminidase. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by the S. pneumoniae protein Endo-alpha-N-acetylgalactosaminidase, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor.
cd14244 GH_101_like 4.12e-131 892 1192 1 298
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam17974 GalBD_like 5.58e-104 1500 1692 1 190
Galactose-binding domain-like. Proteins containing a galactose-binding domain-like fold can be found in several different protein families, in both eukaryotes and prokaryotes. The common function of these domains is to bind to specific ligands, such as cell-surface-attached carbohydrate substrates for galactose oxidase and sialidase, phospholipids on the outer side of the mammalian cell membrane for coagulation factor Va, membrane-anchored ephrin for the Eph family of receptor tyrosine kinases, and a complex of broken single-stranded DNA and DNA polymerase beta for XRCC1. The structure of the galactose-binding domain-like members consists of a beta-sandwich, in which the strands making up the sheets exhibit a jellyroll fold.
pfam17995 GH101_N 4.15e-94 420 599 1 180
Endo-alpha-N-acetylgalactosaminidase N-terminal. This is the N-terminal domain found in Streptococcus pneumoniae endo-alpha-N-acetylgalactosaminidase (EC:3.2.1.97), a cell surface-anchored glycoside hydrolase from family GH101 involved in the breakdown of mucin type O-linked glycans. This is a twisted beta-sandwich domain composed of two sheets of six and seven antiparallel beta-strands. The domain appears to be missing the extended metal and carbohydrate-binding loops.
pfam18080 Gal_mutarotas_3 1.25e-87 627 878 1 243
Galactose mutarotase-like fold domain. This domain is found in endo-alpha-N-acetylgalactosaminidase present in Streptococcus pneumoniae. Endo-alpha-N-acetylgalactosaminidase is a cell surface-anchored glycoside hydrolase involved in the breakdown of mucin type O-linked glycans. The domain, known as domain 2, exhibits strong structural similarlity to the galactose mutarotase-like fold but lacks the active site residues. Domains, found in a number of glycoside hydrolases, structurally similar to domain 2 confer stability to the multidomain architectures.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QLF55289.1 0.0 1 2209 1 2248
CBJ22778.1 0.0 1 2076 1 2042
VTS69771.1 0.0 1 2078 1 1990
QXW61151.1 0.0 1 2209 1 2121
QLL96372.1 0.0 1 2209 1 2121

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
5A55_A 0.0 614 1713 8 1112
Thenative structure of GH101 from Streptococcus pneumoniae TIGR4 [Streptococcus pneumoniae TIGR4]
5A58_A 0.0 614 1713 6 1110
Thestructure of GH101 D764N mutant from Streptococcus pneumoniae TIGR4 in complex with serinyl T-antigen [Streptococcus pneumoniae TIGR4]
5A57_A 0.0 614 1718 6 1115
Thestructure of GH101 from Streptococcus pneumoniae TIGR4 in complex with PUGT [Streptococcus pneumoniae TIGR4]
5A56_A 0.0 614 1713 6 1110
Thestructure of GH101 from Streptococcus pneumoniae TIGR4 in complex with 1-O-methyl-T-antigen [Streptococcus pneumoniae TIGR4]
3ECQ_A 0.0 420 1853 89 1530
Endo-alpha-N-acetylgalactosaminidasefrom Streptococcus pneumoniae: SeMet structure [Streptococcus pneumoniae R6],3ECQ_B Endo-alpha-N-acetylgalactosaminidase from Streptococcus pneumoniae: SeMet structure [Streptococcus pneumoniae R6]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q2MGH6 0.0 1 1982 1 1699
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=SP_0368 PE=1 SV=1
Q8DR60 0.0 1 1982 1 1699
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae (strain ATCC BAA-255 / R6) OX=171101 GN=spr0328 PE=1 SV=1
A9WNA0 6.61e-108 626 1706 51 1037
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2
P09331 1.06e-07 166 330 94 253
Exfoliative toxin A OS=Staphylococcus aureus OX=1280 GN=eta PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000324 0.998874 0.000199 0.000211 0.000187 0.000160

TMHMM  Annotations      download full data without filtering help

start end
17 39