logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000001549_00521

You are here: Home > Sequence: MGYG000001549_00521

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Gracilibacillus phocaeensis
Lineage Bacteria; Firmicutes; Bacilli; Bacillales_D; Amphibacillaceae; Gracilibacillus; Gracilibacillus phocaeensis
CAZyme ID MGYG000001549_00521
CAZy Family GH43
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1374 MGYG000001549_9|CGC4 154251.68 4.2907
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000001549 4547476 Isolate not provided not provided
Gene Location Start: 71415;  End: 75539  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000001549_00521.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH43 54 286 1.3e-62 0.9962546816479401
CBM6 690 823 3.2e-31 0.9637681159420289
CBM66 1134 1283 4.3e-29 0.8774193548387097
CBM6 989 1123 2.5e-26 0.9710144927536232
CBM6 840 972 7.2e-25 0.9637681159420289

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd08991 GH43_HoAraf43-like 7.01e-96 45 293 1 283
Glycosyl hydrolase family 43 protein such as Halothermothrix orenii H 168 alpha-L-arabinofuranosidase (HoAraf43;Hore_20580). This glycosyl hydrolase family 43 (GH43) subgroup includes Halothermothrix orenii H 168 alpha-L-arabinofuranosidase (EC 3.2.1.55) (HoAraf43;Hore_20580). It belongs to the glycosyl hydrolase clan F (according to carbohydrate-active enzymes database (CAZY)) which includes family 43 (GH43) and 62 (GH62) families. This GH43_ HoAraf43-like subgroup includes enzymes that have been annotated as having xylan-digesting beta-xylosidase (EC 3.2.1.37) and xylanase (endo-alpha-L-arabinanase, EC 3.2.1.8) activities. GH43 are inverting enzymes (i.e. they invert the stereochemistry of the anomeric carbon atom of the substrate) that have an aspartate as the catalytic general base, a glutamate as the catalytic general acid and another aspartate that is responsible for pKa modulation and orienting the catalytic acid. Many GH43 enzymes display both alpha-L-arabinofuranosidase and beta-D-xylosidase activity using aryl-glycosides as substrates. A common structural feature of GH43 enzymes is a 5-bladed beta-propeller domain that contains the catalytic acid and catalytic base. A long V-shaped groove, partially enclosed at one end, forms a single extended substrate-binding surface across the face of the propeller.
cd04080 CBM6_cellulase-like 7.31e-50 682 823 1 143
Carbohydrate Binding Module 6 (CBM6); appended to glycoside hydrolase (GH) domains, including GH5 (cellulase). This family includes carbohydrate binding module 6 (CBM6) domains that are appended to several glycoside hydrolase (GH) domains, including GH5 (cellulase) and GH16, as well as to coagulation factor 5/8 carbohydrate-binding domains. CBM6s are non-catalytic carbohydrate binding domains that facilitate the strong binding of the GH catalytic modules with their dedicated, insoluble substrates. The CBM6s are appended to GHs that display a diversity of substrate specificities. For some members of this family information is available about the specific substrates of the appended GH domains. It includes the CBM domains of various enzymes involved in cell wall degradation including, an extracellular beta-1,3-glucanase from Lysobacter enzymogenes encoded by the gluC gene (its catalytic domain belongs to the GH16 family), the tandem CBM domains of Pseudomonas sp. PE2 beta-1,3(4)-glucanase A (its catalytic domain also belongs to GH16), and a family 6 CBM from Cellvibrio mixtus Endoglucanase 5A (CmCBM6) which binds to the beta1,4-beta1,3-mixed linked glucans lichenan, and barley beta-glucan, cello-oligosaccharides, insoluble forms of cellulose, the beta1,3-glucan laminarin, and xylooligosaccharides, and the CBM6 of Fibrobacter succinogenes S85 XynD xylanase, appended to a GH10 domain, and Cellvibrio japonicas Cel5G appended to a GH5 (cellulase) domain. GH5 (cellulase) family includes enzymes with several known activities such as endoglucanase, beta-mannanase, and xylanase, which are involved in the degradation of cellulose and xylans. GH16 family includes enzymes with lichenase, xyloglucan endotransglycosylase (XET), and beta-agarase activities. CBM6 is an unusual CBM as it represents a chimera of two distinct binding sites with different modes of binding: binding site I within the loop regions and binding site II on the concave face of the beta-sandwich fold. For CmCBM6 it has been shown that these two binding sites have different ligand specificities.
cd04080 CBM6_cellulase-like 1.90e-45 981 1122 1 143
Carbohydrate Binding Module 6 (CBM6); appended to glycoside hydrolase (GH) domains, including GH5 (cellulase). This family includes carbohydrate binding module 6 (CBM6) domains that are appended to several glycoside hydrolase (GH) domains, including GH5 (cellulase) and GH16, as well as to coagulation factor 5/8 carbohydrate-binding domains. CBM6s are non-catalytic carbohydrate binding domains that facilitate the strong binding of the GH catalytic modules with their dedicated, insoluble substrates. The CBM6s are appended to GHs that display a diversity of substrate specificities. For some members of this family information is available about the specific substrates of the appended GH domains. It includes the CBM domains of various enzymes involved in cell wall degradation including, an extracellular beta-1,3-glucanase from Lysobacter enzymogenes encoded by the gluC gene (its catalytic domain belongs to the GH16 family), the tandem CBM domains of Pseudomonas sp. PE2 beta-1,3(4)-glucanase A (its catalytic domain also belongs to GH16), and a family 6 CBM from Cellvibrio mixtus Endoglucanase 5A (CmCBM6) which binds to the beta1,4-beta1,3-mixed linked glucans lichenan, and barley beta-glucan, cello-oligosaccharides, insoluble forms of cellulose, the beta1,3-glucan laminarin, and xylooligosaccharides, and the CBM6 of Fibrobacter succinogenes S85 XynD xylanase, appended to a GH10 domain, and Cellvibrio japonicas Cel5G appended to a GH5 (cellulase) domain. GH5 (cellulase) family includes enzymes with several known activities such as endoglucanase, beta-mannanase, and xylanase, which are involved in the degradation of cellulose and xylans. GH16 family includes enzymes with lichenase, xyloglucan endotransglycosylase (XET), and beta-agarase activities. CBM6 is an unusual CBM as it represents a chimera of two distinct binding sites with different modes of binding: binding site I within the loop regions and binding site II on the concave face of the beta-sandwich fold. For CmCBM6 it has been shown that these two binding sites have different ligand specificities.
cd09004 GH43_bXyl-like 3.68e-45 46 285 2 260
Glycosyl hydrolase family 43 protein such as Bacteroides thetaiotaomicron VPI-5482 alpha-L-arabinofuranosidases (BT3675;BT_3675) and (BT3662;BT_3662); includes mostly xylanases. This glycosyl hydrolase family 43 (GH43) subgroup includes enzymes that have been annotated as xylan-digesting beta-xylosidase (EC 3.2.1.37) and xylanase (endo-alpha-L-arabinanase, EC 3.2.1.8) activities, as well the Bacteroides thetaiotaomicron VPI-5482 alpha-L-arabinofuranosidases (EC 3.2.1.55) (BT3675;BT_3675) and (BT3662;BT_3662). It belongs to the glycosyl hydrolase clan F (according to carbohydrate-active enzymes database (CAZY)) which includes family 43 (GH43) and 62 (GH62) families. GH43 are inverting enzymes (i.e. they invert the stereochemistry of the anomeric carbon atom of the substrate) that have an aspartate as the catalytic general base, a glutamate as the catalytic general acid and another aspartate that is responsible for pKa modulation and orienting the catalytic acid. Many GH43 enzymes display both alpha-L-arabinofuranosidase and beta-D-xylosidase activity using aryl-glycosides as substrates. A common structural feature of GH43 enzymes is a 5-bladed beta-propeller domain that contains the catalytic acid and catalytic base. A long V-shaped groove, partially enclosed at one end, forms a single extended substrate-binding surface across the face of the propeller.
cd18608 GH43_F5-8_typeC-like 1.39e-44 46 285 3 270
Glycosyl hydrolase family 43 protein most having a F5/8 type C domain C-terminal to the GH43 domain. This glycosyl hydrolase family 43 (GH43) subgroup includes enzymes that have been annotated as having beta-xylosidase (EC 3.2.1.37), xylanase (EC 3.2.1.8), and beta-galactosidase (EC 3.2.1.145) activities, and some as F5/8 type C domain (also known as the discoidin (DS) domain)-containing proteins. Most contain a F5/8 type C domain C-terminal to the GH43 domain. It belongs to the glycosyl hydrolase clan F (according to carbohydrate-active enzymes database (CAZY)) which includes family 43 (GH43) and 62 (GH62) families. GH43 are inverting enzymes (i.e. they invert the stereochemistry of the anomeric carbon atom of the substrate) that have an aspartate as the catalytic general base, a glutamate as the catalytic general acid and another aspartate that is responsible for pKa modulation and orienting the catalytic acid. Many GH43 enzymes display both alpha-L-arabinofuranosidase and beta-D-xylosidase activity using aryl-glycosides as substrates. Characterized enzymes belonging to this subgroup include Lactobacillus brevis (LbAraf43) and Weissella sp (WAraf43) which show activity with similar catalytic efficiency on 1,5-alpha-L-arabinooligosaccharides with a degree of polymerization (DP) of 2-3; size is limited by an extended loop at the entrance to the active site. A common structural feature of GH43 enzymes is a 5-bladed beta-propeller domain that contains the catalytic acid and catalytic base. A long V-shaped groove, partially enclosed at one end, forms a single extended substrate-binding surface across the face of the propeller.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QGH36945.1 0.0 19 1374 1 1355
QNF31086.1 0.0 2 1331 3 1329
QJX61039.1 0.0 17 1331 8 1322
AYV70057.1 0.0 17 1331 8 1322
QKH60208.1 0.0 17 1331 8 1322

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
4QQS_A 9.31e-40 30 301 1 313
Crystalstructure of a thermostable family-43 glycoside hydrolase [Halothermothrix orenii H 168],4QQS_B Crystal structure of a thermostable family-43 glycoside hydrolase [Halothermothrix orenii H 168]
5A8C_A 3.09e-17 46 288 39 314
ChainA, CARBOHYDRATE BINDING FAMILY 6 [Acetivibrio thermocellus],5A8D_A Chain A, CARBOHYDRATE BINDING FAMILY 6 [Acetivibrio thermocellus]
5MSX_A 2.67e-16 46 288 36 301
Glycosidehydrolase BT_3662 [Bacteroides thetaiotaomicron VPI-5482],5MSX_B Glycoside hydrolase BT_3662 [Bacteroides thetaiotaomicron VPI-5482],5MSX_C Glycoside hydrolase BT_3662 [Bacteroides thetaiotaomicron VPI-5482]
2Y8K_A 3.02e-15 686 819 343 476
ChainA, Carbohydrate Binding Family 6 [Acetivibrio thermocellus],5LA0_A Chain A, Carbohydrate binding family 6 [Acetivibrio thermocellus JW20],5LA1_A Chain A, Carbohydrate binding family 6 [Acetivibrio thermocellus JW20]
5LA2_A 3.02e-15 686 819 343 476
ChainA, Carbohydrate binding family 6 [Acetivibrio thermocellus],5LA2_B Chain B, Carbohydrate binding family 6 [Acetivibrio thermocellus]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
P45982 3.64e-12 46 237 15 232
Xylosidase/arabinosidase OS=Butyrivibrio fibrisolvens OX=831 GN=xylB PE=3 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000321 0.998890 0.000225 0.000184 0.000173 0.000166

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000001549_00521.