logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001525_03343

You are here: Home > Sequence: MGYG000001525_03343

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Paenibacillus_A rubinfantis
Lineage Bacteria; Firmicutes; Bacilli; Paenibacillales; Paenibacillaceae; Paenibacillus_A; Paenibacillus_A rubinfantis
CAZyme ID MGYG000001525_03343
CAZy Family CBM46
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
982 104884.12 4.5566
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001525 5369528 Isolate not provided not provided
Gene Location Start: 1121565;  End: 1124513  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001525_03343.

CAZyme Signature Domains help

Family Start End Evalue family coverage
CBM46 145 219 1.3e-16 0.9080459770114943

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam03442 CBM_X2 1.86e-30 139 221 1 83
Carbohydrate binding domain X2. This domain binds to cellulose and to bacterial cell walls. It is found in glycosyl hydrolases and in scaffolding proteins of cellulosomes (multiprotein glycosyl hydrolase complexes). In the cellulosome it may aid cellulose degradation by anchoring the cellulosome to the bacterial cell wall and by binding it to its substrate. This domain has an Ig-like fold.
NF033190 inl_like_NEAT_1 1.16e-24 817 957 574 712
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.
pfam03442 CBM_X2 2.34e-21 50 127 4 83
Carbohydrate binding domain X2. This domain binds to cellulose and to bacterial cell walls. It is found in glycosyl hydrolases and in scaffolding proteins of cellulosomes (multiprotein glycosyl hydrolase complexes). In the cellulosome it may aid cellulose degradation by anchoring the cellulosome to the bacterial cell wall and by binding it to its substrate. This domain has an Ig-like fold.
pfam03442 CBM_X2 6.83e-21 354 424 1 71
Carbohydrate binding domain X2. This domain binds to cellulose and to bacterial cell walls. It is found in glycosyl hydrolases and in scaffolding proteins of cellulosomes (multiprotein glycosyl hydrolase complexes). In the cellulosome it may aid cellulose degradation by anchoring the cellulosome to the bacterial cell wall and by binding it to its substrate. This domain has an Ig-like fold.
NF033190 inl_like_NEAT_1 3.05e-15 781 935 598 754
NEAT domain-containing leucine-rich repeat protein. Members of this family have an N-terminal NEAT (near transporter) domain often associated with iron transport, followed by a leucine-rich repeat region with significant sequence similarity to the internalins of Listeria monocytogenes. However, since Bacillus cereus (from which this protein was described, in PMID:16978259) is not considered an intracellular pathogen, and the function may be iron transport rather than internalization, applying the name "internalin" to this family probably would be misleading.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QHW31849.1 9.46e-311 11 979 624 1589
AZN39276.1 1.02e-303 2 980 616 1590
ANY69031.1 1.31e-223 43 967 897 1878
QSF46401.1 8.22e-181 8 965 599 1378
QLG41554.1 3.60e-174 136 982 665 1311

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6BT4_A 1.90e-07 824 932 25 133
Crystalstructure of the SLH domain of Sap from Bacillus anthracis in complex with a pyruvylated SCWP unit [Bacillus anthracis]
3PYW_A 1.93e-07 824 932 4 112
Thestructure of the SLH domain from B. anthracis surface array protein at 1.8A [Bacillus anthracis]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q06853 9.00e-41 783 980 480 682
Cell surface glycoprotein 2 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=Cthe_3079 PE=1 SV=1
Q06848 4.64e-37 783 968 239 430
Cellulosome-anchoring protein OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=ancA PE=1 SV=1
Q06852 5.02e-36 783 968 2100 2296
Cell surface glycoprotein 1 OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=olpB PE=3 SV=2
P38058 3.87e-32 41 224 1516 1698
Cellulose-binding protein A OS=Clostridium cellulovorans OX=1493 GN=cbpA PE=3 SV=1
P23659 2.42e-27 3 226 621 832
Endoglucanase Z OS=Thermoclostridium stercorarium OX=1510 GN=celZ PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as OTHER

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
1.000065 0.000000 0.000000 0.000000 0.000000 0.000000

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000001525_03343.