logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000004230_01687

You are here: Home > Sequence: MGYG000004230_01687

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Clostridium sp900759995
Lineage Bacteria; Firmicutes_A; Clostridia; Clostridiales; Clostridiaceae; Clostridium; Clostridium sp900759995
CAZyme ID MGYG000004230_01687
CAZy Family CBM51
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1460 162715.65 4.0854
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000004230 3516057 MAG United Kingdom Europe
Gene Location Start: 25433;  End: 29815  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000004230_01687.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM51 54 200 4.5e-25 0.9850746268656716

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam08305 NPCBM 2.84e-27 51 200 1 135
NPCBM/NEW2 domain. This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins. This domain has also been called the NEW2 domain (Naumoff DG. Phylogenetic analysis of alpha-galactosidases of the GH27 family. Molecular Biology (Engl Transl). (2004)38:388-399.)
smart00776 NPCBM 7.43e-21 49 200 1 144
This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins.
pfam00754 F5_F8_type_C 2.91e-14 889 1003 2 118
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
NF033839 PspC_subgroup_2 1.62e-09 1055 1280 365 525
pneumococcal surface protein PspC, LPXTG-anchored form. The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
NF033845 MSCRAMM_ClfB 1.49e-08 1063 1267 629 820
MSCRAMM family adhesin clumping factor ClfB. Clumping factor B is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AQW23377.1 1.78e-253 1 1022 1 988
ATD49073.1 4.14e-253 50 1022 51 993
AMN35280.1 4.96e-253 1 1022 1 988
BCL58565.1 1.15e-183 52 1032 80 1059
QUF62688.1 6.13e-152 31 1022 120 1097

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2VMH_A 3.11e-18 35 200 2 150
Thestructure of CBM51 from Clostridium perfringens GH95 [Clostridium perfringens]
2VMG_A 3.65e-18 35 200 8 156
Thestructure of CBM51 from Clostridium perfringens GH95 in complex with methyl-galactose [Clostridium perfringens]
2VMI_A 7.86e-18 35 200 2 150
Thestructure of seleno-methionine labelled CBM51 from Clostridium perfringens GH95 [Clostridium perfringens]
7JS4_A 2.41e-13 40 200 603 752
ChainA, F5/8 type C domain protein [Clostridium perfringens ATCC 13124]

Swiss-Prot Hits      help

has no Swissprot hit.

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000481 0.998662 0.000257 0.000211 0.000181 0.000169

TMHMM  Annotations      download full data without filtering help

start end
1431 1453