logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001423_01603

You are here: Home > Sequence: MGYG000001423_01603

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Clostridium celatum
Lineage Bacteria; Firmicutes_A; Clostridia; Clostridiales; Clostridiaceae; Clostridium; Clostridium celatum
CAZyme ID MGYG000001423_01603
CAZy Family GH110
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1769 197821.28 4.3689
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001423 3553421 Isolate not provided not provided
Gene Location Start: 63013;  End: 68322  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001423_01603.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH110 43 620 2.6e-155 0.9653284671532847
CBM51 1565 1705 1.3e-34 0.9925373134328358

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam08305 NPCBM 1.63e-37 1565 1705 3 135
NPCBM/NEW2 domain. This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins. This domain has also been called the NEW2 domain (Naumoff DG. Phylogenetic analysis of alpha-galactosidases of the GH27 family. Molecular Biology (Engl Transl). (2004)38:388-399.)
smart00776 NPCBM 5.90e-29 1565 1705 5 144
This novel putative carbohydrate binding module (NPCBM) domain is found at the N-terminus of glycosyl hydrolase family 98 proteins.
cd14254 Dockerin_II 1.60e-06 1719 1767 1 54
Type II dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type II dockerins, which are responsible for mediating attachment of the cellulosome complex to the bacterial cell wall.
pfam12708 Pectate_lyase_3 0.002 43 88 1 46
Pectate lyase superfamily protein. This family of proteins possesses a beta helical structure like Pectate lyase. This family is most closely related to glycosyl hydrolase family 28.
pfam12733 Cadherin-like 0.007 1479 1560 6 87
Cadherin-like beta sandwich domain. This domain is found in several bacterial, metazoan and chlorophyte algal proteins. A profile-profile comparison recovered the cadherin domain and a comparison of the predicted structure of this domain with the crystal structure of the cadherin showed a congruent seven stranded secondary structure. The domain is widespread in bacteria and seen in the firmicutes, actinobacteria, certain proteobacteria, bacteroides and chlamydiae with an expansion in Clostridium. In contrast, it is limited in its distribution in eukaryotes suggesting that it was derived through lateral transfer from bacteria. In prokaryotes, this domain is widely fused to other domains such as FNIII (Fibronectin Type III), TIG, SLH (S-layer homology), discoidin, cell-wall-binding repeat domain and alpha-amylase-like glycohydrolases. These associations are suggestive of a carbohydrate-binding function for this cadherin-like domain. In animal proteins it is associated with an ATP-grasp domain.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
AYE35188.1 0.0 3 1393 7 1396
QAS60591.1 0.0 3 1393 7 1396
ATD55583.1 9.61e-285 44 693 3 616
ATD56740.1 9.61e-285 44 693 3 616
QBJ75880.1 1.03e-284 44 693 5 618

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
7JW4_A 1.79e-45 27 607 11 564
Crystalstructure of PdGH110B in complex with D-galactose [Pseudoalteromonas distincta],7JW4_B Crystal structure of PdGH110B in complex with D-galactose [Pseudoalteromonas distincta]
7JWF_A 7.79e-45 27 607 11 564
Crystalstructure of PdGH110B D344N in complex with alpha-(1,3)-galactobiose [Pseudoalteromonas distincta],7JWF_B Crystal structure of PdGH110B D344N in complex with alpha-(1,3)-galactobiose [Pseudoalteromonas distincta],7JWF_C Crystal structure of PdGH110B D344N in complex with alpha-(1,3)-galactobiose [Pseudoalteromonas distincta],7JWF_D Crystal structure of PdGH110B D344N in complex with alpha-(1,3)-galactobiose [Pseudoalteromonas distincta]
7JS4_A 5.47e-26 1507 1760 552 797
ChainA, F5/8 type C domain protein [Clostridium perfringens ATCC 13124]
2VMH_A 4.37e-19 1559 1704 3 149
Thestructure of CBM51 from Clostridium perfringens GH95 [Clostridium perfringens]
2VMG_A 5.14e-19 1559 1704 9 155
Thestructure of CBM51 from Clostridium perfringens GH95 in complex with methyl-galactose [Clostridium perfringens]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
B1V8K7 4.43e-122 33 757 24 726
Alpha-1,3-galactosidase A OS=Streptacidiphilus griseoplanus OX=66896 GN=glaA PE=1 SV=1
Q826C5 1.43e-111 18 611 14 585
Alpha-1,3-galactosidase A OS=Streptomyces avermitilis (strain ATCC 31267 / DSM 46492 / JCM 5070 / NBRC 14893 / NCIMB 12804 / NRRL 8165 / MA-4680) OX=227882 GN=glaA PE=3 SV=1
Q5L7M8 1.07e-85 46 678 28 592
Alpha-1,3-galactosidase A OS=Bacteroides fragilis (strain ATCC 25285 / DSM 2151 / CCUG 4856 / JCM 11019 / NCTC 9343 / Onslow) OX=272559 GN=glaA PE=3 SV=1
Q64MU6 1.07e-85 46 678 28 592
Alpha-1,3-galactosidase A OS=Bacteroides fragilis (strain YCH46) OX=295405 GN=glaA PE=3 SV=1
Q8A2Z5 1.09e-80 77 610 46 533
Alpha-1,3-galactosidase A OS=Bacteroides thetaiotaomicron (strain ATCC 29148 / DSM 2079 / JCM 5827 / CCUG 10774 / NCTC 10582 / VPI-5482 / E50) OX=226186 GN=glaA PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000343 0.998877 0.000262 0.000174 0.000166 0.000145

TMHMM  Annotations      download full data without filtering help

start end
13 35