logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001018_00307

You are here: Home > Sequence: MGYG000001018_00307

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Mediterraneibacter sp900751785
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Mediterraneibacter; Mediterraneibacter sp900751785
CAZyme ID MGYG000001018_00307
CAZy Family GH101
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2292 MGYG000001018_22|CGC1 251095.36 4.0408
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001018 2841260 MAG Sweden Europe
Gene Location Start: 22448;  End: 29326  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001018_00307.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH101 513 1036 3.8e-78 0.8048090523338048

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam12905 Glyco_hydro_101 8.26e-58 743 1000 1 270
Endo-alpha-N-acetylgalactosaminidase. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by the S. pneumoniae protein Endo-alpha-N-acetylgalactosaminidase, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor.
cd14244 GH_101_like 5.27e-56 769 1026 14 298
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam18080 Gal_mutarotas_3 3.69e-35 509 742 1 243
Galactose mutarotase-like fold domain. This domain is found in endo-alpha-N-acetylgalactosaminidase present in Streptococcus pneumoniae. Endo-alpha-N-acetylgalactosaminidase is a cell surface-anchored glycoside hydrolase involved in the breakdown of mucin type O-linked glycans. The domain, known as domain 2, exhibits strong structural similarlity to the galactose mutarotase-like fold but lacks the active site residues. Domains, found in a number of glycoside hydrolases, structurally similar to domain 2 confer stability to the multidomain architectures.
pfam13385 Laminin_G_3 1.49e-24 1680 1836 1 150
Concanavalin A-like lectin/glucanases superfamily. This domain belongs to the Concanavalin A-like lectin/glucanases superfamily.
pfam17451 Glyco_hyd_101C 5.83e-24 1011 1110 3 110
Glycosyl hydrolase 101 beta sandwich domain. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae is largely determined by the ability to degrade host glycoproteins and to metabolize the resultant carbohydrates. This family is the enzymatic region, EC:3.2.1.97, of the cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins. This reaction is exemplified by a S. pneumoniae protein, where Asp764 is the catalytic nucleophile-base and Glu796 the catalytic proton donor. This domain represents C-terminal the beta sandwich domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
ATD58436.1 4.80e-176 110 1460 292 1641
ATD54117.1 4.80e-176 110 1460 292 1641
QBJ76357.1 4.80e-176 110 1460 292 1641
SLK22612.1 4.80e-176 110 1460 292 1641
BBK62189.1 4.27e-150 1 2158 1 2114

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6QEP_A 3.04e-88 514 1468 17 1101
EngBFDARPin Fusion 4b H14 [Bifidobacterium longum]
6SH9_B 3.12e-88 514 1468 17 1101
EngBFDARPin Fusion 4b D12 [Bifidobacterium longum subsp. longum JCM 1217]
6QEV_B 3.12e-88 514 1468 17 1101
EngBFDARPin Fusion 4b B6 [Bifidobacterium longum]
6QFK_A 3.12e-88 514 1468 17 1101
EngBFDARPin Fusion 4b G10 [Bifidobacterium longum]
2ZXQ_A 3.66e-88 514 1468 32 1116
Crystalstructure of endo-alpha-N-acetylgalactosaminidase from Bifidobacterium longum (EngBF) [Bifidobacterium longum]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A9WNA0 1.95e-85 508 1389 51 962
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2
Q2MGH6 9.14e-80 340 1389 151 1319
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae serotype 4 (strain ATCC BAA-334 / TIGR4) OX=170187 GN=SP_0368 PE=1 SV=1
Q8DR60 2.73e-79 340 1389 151 1319
Endo-alpha-N-acetylgalactosaminidase OS=Streptococcus pneumoniae (strain ATCC BAA-255 / R6) OX=171101 GN=spr0328 PE=1 SV=1
E8MGH9 6.38e-12 1943 2143 1667 1871
Beta-L-arabinobiosidase OS=Bifidobacterium longum subsp. longum (strain ATCC 15707 / DSM 20219 / JCM 1217 / NCTC 11818 / E194b) OX=565042 GN=hypBA2 PE=1 SV=1
P50899 2.04e-07 1450 1614 700 864
Exoglucanase B OS=Cellulomonas fimi (strain ATCC 484 / DSM 20113 / JCM 1341 / NBRC 15513 / NCIMB 8980 / NCTC 7547) OX=590998 GN=cbhB PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000326 0.998845 0.000207 0.000215 0.000193 0.000174

TMHMM  Annotations      download full data without filtering help

start end
12 34
2263 2285