logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002403_05726

You are here: Home > Sequence: MGYG000002403_05726

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Robinsoniella peoriensis
Lineage Bacteria; Firmicutes_A; Clostridia; Lachnospirales; Lachnospiraceae; Robinsoniella; Robinsoniella peoriensis
CAZyme ID MGYG000002403_05726
CAZy Family CBM51
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2198 MGYG000002403_150|CGC1 244173.4 4.825
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002403 7202103 Isolate not provided not provided
Gene Location Start: 36110;  End: 42706  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002403_05726.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM51 1017 1156 5.9e-30 0.9850746268656716
CBM51 256 387 1.3e-20 0.9850746268656716

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd14244 GH_101_like 1.59e-60 576 889 1 298
Endo-a-N-acetylgalactosaminidase and related glcyosyl hydrolases. This family contains the enzymatically active domain of cell surface proteins that specifically cleave Gal-beta-1,3-GalNAc-alpha-Ser/Thr (T-antigen, galacto-N-biose), the core 1 type O-linked glycan common to mucin glycoproteins (EC:3.2.1.97). It has been classified as glycosyl hydrolase family 101 in the Cazy resource. Virulence of pathogenic organisms such as the Gram-positive Streptococcus pneumoniae and other commensal human bacteria is largely determined by their ability to degrade host glycoproteins and to metabolize the resultant carbohydrates.
pfam08305 NPCBM 1.13e-33 1017 1157 4 136
NPCBM/NEW2 domain. This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins. This domain has also been called the NEW2 domain (Naumoff DG. Phylogenetic analysis of alpha-galactosidases of the GH27 family. Molecular Biology (Engl Transl). (2004)38:388-399.)
smart00776 NPCBM 3.61e-26 1012 1157 1 145
This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins.
pfam08305 NPCBM 1.98e-21 255 387 3 135
NPCBM/NEW2 domain. This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins. This domain has also been called the NEW2 domain (Naumoff DG. Phylogenetic analysis of alpha-galactosidases of the GH27 family. Molecular Biology (Engl Transl). (2004)38:388-399.)
sd00036 LRR_3 5.52e-20 2084 2188 1 104
leucine-rich repeats. A leucine-rich repeat (LRR) is a structural protein motif of 20-30 amino acids that is unusually rich in the hydrophobic amino acid leucine. The conserved eleven-residue sequence motif (LxxLxLxxN/CxL) within the LRRs corresponds to the beta-strand and adjacent loop regions, whereas the remaining parts of the repeats are variable. LRRs fold together to form a solenoid protein domain, termed leucine-rich repeat domain. Leucine-rich repeats are usually involved in protein-protein interactions.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QJD87425.1 0.0 14 1778 8 1842
AWB96632.1 3.24e-131 1134 1729 196 788
QTH41000.1 5.26e-118 367 1003 131 778
SYX82446.1 3.26e-106 516 1006 68 569
SMF83630.1 1.79e-105 491 1006 40 569

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7JS4_A 3.03e-13 1016 1253 612 835
ChainA, F5/8 type C domain protein [Clostridium perfringens ATCC 13124]
2VMH_A 1.79e-11 1008 1157 1 151
Thestructure of CBM51 from Clostridium perfringens GH95 [Clostridium perfringens]
2VMG_A 2.06e-11 1008 1157 7 157
Thestructure of CBM51 from Clostridium perfringens GH95 in complex with methyl-galactose [Clostridium perfringens]
2VMI_A 1.55e-10 1008 1157 1 151
Thestructure of seleno-methionine labelled CBM51 from Clostridium perfringens GH95 [Clostridium perfringens]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A9WNA0 2.76e-07 985 1155 1123 1302
Putative endo-alpha-N-acetylgalactosaminidase OS=Renibacterium salmoninarum (strain ATCC 33209 / DSM 20767 / JCM 11484 / NBRC 15589 / NCIMB 2235) OX=288705 GN=RSal33209_1326 PE=3 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000282 0.998976 0.000207 0.000212 0.000157 0.000134

TMHMM  Annotations      download full data without filtering help

start end
7 29