logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000003147_00163

You are here: Home > Sequence: MGYG000003147_00163

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Robinsoniella sp900555455
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Robinsoniella; Robinsoniella sp900555455
CAZyme ID MGYG000003147_00163
CAZy Family GH101
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2559 MGYG000003147_1|CGC5 280751.86 4.7711
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000003147 4940098 MAG United States North America
Gene Location Start: 196948;  End: 204627  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.97

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH101 1029 1689 5.6e-127 0.9929278642149929

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam12905 Glyco_hydro_101 3.68e-82 1252 1500 1 240
Endo-alpha-N-acetylgalactosaminidase. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by the S. pneumoniae protein Endo-alpha-N-acetylgalactosaminidase, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor.
cd14244 GH_101_like 2.28e-72 1267 1555 2 298
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam18080 Gal_mutarotas_3 2.52e-52 1024 1251 1 243
Galactose mutarotase-like fold domain. This domain is found in endo-alpha-N-acetylgalactosaminidase present in Streptococcus pneumoniae. Endo-alpha-N-acetylgalactosaminidase is a cell surface-anchored glycoside hydrolase involved in the breakdown of mucin type O-linked glycans. The domain, known as domain 2, exhibits strong structural similarlity to the galactose mutarotase-like fold but lacks the active site residues. Domains, found in a number of glycoside hydrolases, structurally similar to domain 2 confer stability to the multidomain architectures.
pfam17451 Glyco_hyd_101C 3.64e-20 1538 1663 1 111
Glycosyl hydrolase 101 beta sandwich domain. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by a S. pneumoniae protein, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor. This domain represents C-terminal the beta sandwich domain.
COG5492 YjdB 1.34e-14 2399 2549 181 325
Uncharacterized conserved protein YjdB, contains Ig-like domain [General function prediction only].

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AIQ34688.1 0.0 106 2132 128 2156
QAS61722.1 4.26e-204 468 2397 23 1866
AYE33559.1 4.26e-204 468 2397 23 1866
ATD54516.1 2.85e-195 468 2390 20 1790
ATD57802.1 2.85e-195 468 2390 20 1790

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
5A55_A 1.30e-96 1024 2022 21 1056
Thenative structure of GH101 from Streptococcus pneumoniae TIGR4 [Streptococcus pneumoniae TIGR4]
5A57_A 1.40e-96 1024 2022 19 1054
Thestructure of GH101 from Streptococcus pneumoniae TIGR4 in complex with PUGT [Streptococcus pneumoniae TIGR4]
5A56_A 1.40e-96 1024 2022 19 1054
Thestructure of GH101 from Streptococcus pneumoniae TIGR4 in complex with 1-O-methyl-T-antigen [Streptococcus pneumoniae TIGR4]
5A59_A 3.39e-96 1024 2022 19 1054
Thestructure of GH101 E796Q mutant from Streptococcus pneumoniae TIGR4 in complex with T-antigen [Streptococcus pneumoniae TIGR4],5A5A_A The structure of GH101 E796Q mutant from Streptococcus pneumoniae TIGR4 in complex with PNP-T-antigen [Streptococcus pneumoniae TIGR4]
5A58_A 6.10e-96 1024 2022 19 1054
Thestructure of GH101 D764N mutant from Streptococcus pneumoniae TIGR4 in complex with serinyl T-antigen [Streptococcus pneumoniae TIGR4]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q2MGH6 9.49e-94 1024 2022 334 1369
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=SP_0368 PE=1 SV=1
Q8DR60 3.78e-93 1024 2022 334 1369
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae (strain ATCC BAA-255 / R6) OX=171101 GN=spr0328 PE=1 SV=1
A9WNA0 1.20e-87 1022 1971 50 963
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2
P33747 4.35e-06 2419 2542 57 184
Uncharacterized protein CA_P0160 OS=Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) OX=272562 GN=CA_P0160 PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as LIPO

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.001790 0.356205 0.641344 0.000266 0.000211 0.000167

TMHMM  Annotations      download full data without filtering help

start end
7 29