logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000013_04509

You are here: Home > Sequence: MGYG000000013_04509

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Bacteroides sp902362375
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae; Bacteroides; Bacteroides sp902362375
CAZyme ID MGYG000000013_04509
CAZy Family GH5
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
508 MGYG000000013_14|CGC4 59017.75 6.505
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000013 6368149 Isolate United Kingdom Europe
Gene Location Start: 141601;  End: 143127  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

EC 3.2.1.4

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH5 44 351 6.2e-119 0.9935897435897436

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam00150 Cellulase 1.91e-26 71 349 31 268
Cellulase (glycosyl hydrolase family 5).
cd04081 CBM35_galactosidase-like 5.34e-09 385 502 3 125
Carbohydrate Binding Module family 35 (CBM35); appended mainly to enzymes that bind alpha-D-galactose (CBM35-Gal), including glycoside hydrolase (GH) families GH27 and GH43. This family includes carbohydrate binding module family 35 (CBM35); these are non-catalytic carbohydrate binding domains that are appended mainly to enzymes that bind alpha-D-galactose (CBM35-Gal), including glycoside hydrolase (GH) families GH27 and GH43. Examples of proteins which contain CBM35s belonging to this family includes the CBM35 of an exo-beta-1,3-galactanase from Phanerochaete chrysosporium 9 (Pc1,3Gal43A) which is appended to a GH43 domain, and the CBM35 domain of two bifunctional proteins with beta-L-arabinopyranosidase/alpha-D-galactopyranosidase activities from Fusarium oxysporum 12S, Foap1 and Foap2 (Fo/AP1 and Fo/AP2), that are appended to GH27 domains. CBM35s are unique in that they display conserved specificity through extensive sequence similarity but divergent function through their appended catalytic modules. They are known to bind alpha-D-galactose (Gal), mannan (Man), xylan, glucuronic acid (GlcA), a beta-polymer of mannose, and possibly glucans, forming four subfamilies based on general ligand specificities (galacto, urono, manno, and gluco configurations). Some CBM35s bind their ligands in a calcium-dependent manner. In contrast to most CBMs that are generally rigid proteins, CBM35 undergoes significant conformational change upon ligand binding. GH43 includes beta-xylosidases and beta-xylanases, using aryl-glycosides as substrates, while family GH27 includes alpha-galactosidases, alpha-N-acetylgalactosaminidases, and isomaltodextranases.
COG2730 BglC 2.75e-05 42 175 52 191
Aryl-phospho-beta-D-glucosidase BglC, GH1 family [Carbohydrate transport and metabolism].
cd02795 CBM6-CBM35-CBM36_like 3.91e-04 385 502 2 124
Carbohydrate Binding Module 6 (CBM6) and CBM35_like superfamily. Carbohydrate binding module family 6 (CBM6, family 6 CBM), also known as cellulose binding domain family VI (CBD VI), and related CBMs (CBM35 and CBM36). These are non-catalytic carbohydrate binding domains found in a range of enzymes that display activities against a diverse range of carbohydrate targets, including mannan, xylan, beta-glucans, cellulose, agarose, and arabinans. These domains facilitate the strong binding of the appended catalytic modules to their dedicated, insoluble substrates. Many of these CBMs are associated with glycoside hydrolase (GH) domains. CBM6 is an unusual CBM as it represents a chimera of two distinct binding sites with different modes of binding: binding site I within the loop regions and binding site II on the concave face of the beta-sandwich fold. CBM36s are calcium-dependent xylan binding domains. CBM35s display conserved specificity through extensive sequence similarity, but divergent function through their appended catalytic modules. This alignment model also contains the C-terminal domains of bacterial insecticidal toxins, where they may be involved in determining insect specificity through carbohydrate binding functionality.
cd04083 CBM35_Lmo2446-like 0.001 404 481 23 102
Carbohydrate Binding Module 35 (CBM35) domains similar to Lmo2446. This family includes carbohydrate binding module 35 (CBM35) domains that are appended to several carbohydrate binding enzymes. Some CBM35 domains belonging to this family are appended to glycoside hydrolase (GH) family domains, including glycoside hydrolase family 31 (GH31), for example the CBM35 domain of Lmo2446, an uncharacterized protein from Listeria monocytogenes EGD-e. These CBM35s are non-catalytic carbohydrate binding domains that facilitate the strong binding of the GH catalytic modules with their dedicated, insoluble substrates. GH31 has a wide range of hydrolytic activities such as alpha-glucosidase, alpha-xylosidase, 6-alpha-glucosyltransferase, or alpha-1,4-glucan lyase, cleaving a terminal carbohydrate moiety from a substrate that may be a starch or a glycoprotein. Most characterized GH31 enzymes are alpha-glucosidases.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QNL38165.1 0.0 1 508 1 508
QNL38167.1 3.74e-183 37 504 136 604
SMD43874.1 1.17e-98 39 364 34 359
AXT60837.1 3.83e-98 33 375 18 360
AUP81187.1 8.92e-95 28 374 26 376

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
1CEC_A 2.59e-46 44 371 7 338
ChainA, ENDOGLUCANASE CELC [Acetivibrio thermocellus]
1CEN_A 6.98e-46 44 371 7 338
ChainA, CELLULASE CELC [Acetivibrio thermocellus],1CEO_A Chain A, CELLULASE CELC [Acetivibrio thermocellus]
3AMC_A 1.44e-28 32 376 3 316
Crystalstructures of Thermotoga maritima Cel5A, apo form and dimer/au [Thermotoga maritima MSB8],3AMC_B Crystal structures of Thermotoga maritima Cel5A, apo form and dimer/au [Thermotoga maritima MSB8],3AMD_A Crystal structures of Thermotoga maritima Cel5A, apo form and tetramer/au [Thermotoga maritima MSB8],3AMD_B Crystal structures of Thermotoga maritima Cel5A, apo form and tetramer/au [Thermotoga maritima MSB8],3AMD_C Crystal structures of Thermotoga maritima Cel5A, apo form and tetramer/au [Thermotoga maritima MSB8],3AMD_D Crystal structures of Thermotoga maritima Cel5A, apo form and tetramer/au [Thermotoga maritima MSB8],3MMU_A Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_B Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_C Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_D Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_E Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_F Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_G Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMU_H Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMW_A Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMW_B Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMW_C Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima],3MMW_D Crystal structure of endoglucanase Cel5A from the hyperthermophilic Thermotoga maritima [Thermotoga maritima]
3AMG_A 9.49e-28 32 376 3 316
Crystalstructures of Thermotoga maritima Cel5A in complex with Cellobiose substrate, mutant form [Thermotoga maritima MSB8],3AMG_B Crystal structures of Thermotoga maritima Cel5A in complex with Cellobiose substrate, mutant form [Thermotoga maritima MSB8]
3AZR_A 9.49e-28 32 376 3 316
DiverseSubstrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellobiose [Thermotoga maritima MSB8],3AZR_B Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellobiose [Thermotoga maritima MSB8],3AZS_A Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Mannotriose [Thermotoga maritima MSB8],3AZS_B Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Mannotriose [Thermotoga maritima MSB8],3AZT_A Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellotetraose [Thermotoga maritima MSB8],3AZT_B Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellotetraose [Thermotoga maritima MSB8],3AZT_C Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellotetraose [Thermotoga maritima MSB8],3AZT_D Diverse Substrates Recognition Mechanism Revealed by Thermotoga maritima Cel5A Structures in Complex with Cellotetraose [Thermotoga maritima MSB8]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
A3DJ77 2.71e-46 44 371 7 338
Endoglucanase C OS=Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) OX=203119 GN=celC PE=3 SV=1
P23340 2.71e-46 44 371 7 338
Endoglucanase C307 OS=Clostridium sp. (strain F1) OX=1508 GN=celC307 PE=1 SV=1
P0C2S3 1.42e-45 44 371 7 338
Endoglucanase C OS=Acetivibrio thermocellus OX=1515 GN=celC PE=1 SV=1
P16169 1.02e-34 41 364 7 311
Cellodextrinase A OS=Ruminococcus flavefaciens OX=1265 GN=celA PE=3 SV=3
P14250 9.16e-26 41 363 310 647
Endoglucanase 3 OS=Fibrobacter succinogenes (strain ATCC 19169 / S85) OX=59374 GN=cel-3 PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.129813 0.867663 0.001712 0.000251 0.000247 0.000297

TMHMM  Annotations      download full data without filtering help

start end
5 27