logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000084_00470

You are here: Home > Sequence: MGYG000000084_00470

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Gemmiger formicilis
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Ruminococcaceae; Gemmiger; Gemmiger formicilis
CAZyme ID MGYG000000084_00470
CAZy Family GH59
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
2994 321782.7 4.2228
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000084 3006488 Isolate United Kingdom Europe
Gene Location Start: 525295;  End: 534279  Strand: -

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000084_00470.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH59 599 1298 4.5e-146 0.9920760697305864
GH43 2065 2471 2.7e-74 0.975609756097561

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd18825 GH43_CtGH43-like 1.30e-97 2033 2486 2 284
Glycosyl hydrolase family 43 protein similar to Clostridium thermocellum exo-beta-1,3-galactanase CtGH43 and Ruminococcus champanellensis arabinanase Ara43A. This uncharacterized glycosyl hydrolase family 43 (GH43) subgroup belongs to a subgroup which includes characterized enzymes with exo-beta-1,3-galactanase (EC 3.2.1.145, also known as galactan 1,3-beta-galactosidase) activity such as Clostridium thermocellum (Ct1,3Gal43A or CtGH43) and Phanerochaete chrysosporium 1,3Gal43A (Pc1, 3Gal43A), and arabinanase (EC 3.2.1.99) activity such as Ruminococcus champanellensis Ara43A. GH43 are inverting enzymes (i.e. they invert the stereochemistry of the anomeric carbon atom of the substrate) that have an aspartate as the catalytic general base, a glutamate as the catalytic general acid and another aspartate that is responsible for pKa modulation and orienting the catalytic acid. Many GH43 enzymes display both alpha-L-arabinofuranosidase and beta-D-xylosidase activity using aryl-glycosides as substrates. A common structural feature of GH43 enzymes is a 5-bladed beta-propeller domain that contains the catalytic acid and catalytic base. A long V-shaped groove, partially enclosed at one end, forms a single extended substrate-binding surface across the face of the propeller.
pfam02057 Glyco_hydro_59 9.59e-87 604 935 1 292
Glycosyl hydrolase family 59.
cd08985 GH43_CtGH43-like 1.42e-57 2033 2487 2 273
Glycosyl hydrolase family 43 protein such as Clostridium thermocellum exo-beta-1,3-galactanase CtGH43 and Ruminococcus champanellensis arabinanase Ara43A. This glycosyl hydrolase family 43 (GH43) subgroup includes characterized enzymes with exo-beta-1,3-galactanase (EC 3.2.1.145, also known as galactan 1,3-beta-galactosidase) activity such as Clostridium thermocellum (Ct1,3Gal43A or CtGH43) and Phanerochaete chrysosporium 1,3Gal43A (Pc1, 3Gal43A), and arabinanase (EC 3.2.1.99) activity such as Ruminococcus champanellensis Ara43A. GH43 are inverting enzymes (i.e. they invert the stereochemistry of the anomeric carbon atom of the substrate) that have an aspartate as the catalytic general base, a glutamate as the catalytic general acid and another aspartate that is responsible for pKa modulation and orienting the catalytic acid. Many GH43 enzymes display both alpha-L-arabinofuranosidase and beta-D-xylosidase activity using aryl-glycosides as substrates. A common structural feature of GH43 enzymes is a 5-bladed beta-propeller domain that contains the catalytic acid and catalytic base. A long V-shaped groove, partially enclosed at one end, forms a single extended substrate-binding surface across the face of the propeller.
cd18821 GH43_Pc3Gal43A-like 3.52e-51 2059 2487 6 262
Glycosyl hydrolase family 43 protein such as Phanerochaete chrysosporium exo-beta-1,3-galactanase (Pc1, 3Gal43A, 1,3Gal43A). This glycosyl hydrolase family 43 (GH43) subgroup includes characterized enzymes with exo-beta-1,3-galactanase (EC 3.2.1.145, also known as galactan 1,3-beta-galactosidase) activity such as Phanerochaete chrysosporium 1,3Gal43A (Pc1, 3Gal43A), Fusarium oxysporum 12S Fo/1 (3Gal), and Streptomyces sp. 19(2012) SGalase1 and SGalase2. It belongs to the GH43_CtGH43 subgroup of the glycosyl hydrolase clan F (according to carbohydrate-active enzymes database (CAZY)) which includes family 43 (GH43) and 62 (GH62) families. GH43_CtGH43 includes proteins such as Clostridium thermocellum exo-beta-1,3-galactanase (Ct1,3Gal43A or CtGH43) which is comprised of the GH43 domain, a CBM13 domain, and a dockerin domain, exhibits an unusual ability to hydrolyze beta-1,3-galactan in the presence of a beta-1,6 linked branch, and is missing an essential acidic residue suggesting a mechanism by which it bypasses beta-1,6 linked branches in the substrate. GH43 are inverting enzymes (i.e. they invert the stereochemistry of the anomeric carbon atom of the substrate) that have an aspartate as the catalytic general base, a glutamate as the catalytic general acid and another aspartate that is responsible for pKa modulation and orienting the catalytic acid. Many GH43 enzymes display both alpha-L-arabinofuranosidase and beta-D-xylosidase activity using aryl-glycosides as substrates. A common structural feature of GH43 enzymes is a 5-bladed beta-propeller domain that contains the catalytic acid and catalytic base. A long V-shaped groove, partially enclosed at one end, forms a single extended substrate-binding surface across the face of the propeller.
cd18822 GH43_CtGH43-like 3.25e-50 2218 2487 45 266
Glycosyl hydrolase family 43 protein such as Clostridium thermocellum exo-beta-1,3-galactanase (Ct1,3Gal43A or CtGH43). This glycosyl hydrolase family 43 (GH43) subgroup includes characterized enzymes with exo-beta-1,3-galactanase (EC 3.2.1.145, also known as galactan 1,3-beta-galactosidase) activity such as Clostridium thermocellum exo-beta-1,3-galactanase (Ct1,3Gal43A or CtGH43), Streptomyces avermitilis MA-4680 = NBRC 14893 (Sa1,3Gal43A;SAV2109) (1,3Gal43A), and Ruminiclostridium thermocellum ATCC 27405 (Ct1,3Gal43A;CtGH43;Cthe_0661) (1,3Gal43A). It belongs to the GH43_CtGH43 subgroup of the glycosyl hydrolase clan F (according to carbohydrate-active enzymes database (CAZY)) which includes family 43 (GH43) and 62 (GH62) families. GH43_CtGH43 includes proteins such as Clostridium thermocellum exo-beta-1,3-galactanase (Ct1,3Gal43A or CtGH43) which is comprised of the GH43 domain, a CBM13 domain, and a dockerin domain, exhibits an unusual ability to hydrolyze beta-1,3-galactan in the presence of a beta-1,6 linked branch, and is missing an essential acidic residue suggesting a mechanism by which it bypasses beta-1,6 linked branches in the substrate. GH43 are inverting enzymes (i.e. they invert the stereochemistry of the anomeric carbon atom of the substrate) that have an aspartate as the catalytic general base, a glutamate as the catalytic general acid and another aspartate that is responsible for pKa modulation and orienting the catalytic acid. Many GH43 enzymes display both alpha-L-arabinofuranosidase and beta-D-xylosidase activity using aryl-glycosides as substrates. A common structural feature of GH43 enzymes is a 5-bladed beta-propeller domain that contains the catalytic acid and catalytic base. A long V-shaped groove, partially enclosed at one end, forms a single extended substrate-binding surface across the face of the propeller.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QUW95523.1 1.88e-225 579 1432 36 916
QLH25411.1 1.88e-225 583 1426 39 909
AZM74018.1 3.91e-223 579 1432 33 913
QKW59509.1 2.65e-222 579 1432 33 913
AMW11854.1 1.63e-218 583 1432 39 915

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
3VSF_A 5.47e-31 2235 2495 113 338
ChainA, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSF_B Chain B, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSF_C Chain C, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSF_D Chain D, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSF_E Chain E, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSF_F Chain F, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSZ_A Chain A, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSZ_B Chain B, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSZ_C Chain C, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSZ_D Chain D, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSZ_E Chain E, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VSZ_F Chain F, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT0_A Chain A, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT0_B Chain B, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT0_C Chain C, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT0_D Chain D, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT0_E Chain E, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT0_F Chain F, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT1_A Chain A, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT1_B Chain B, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT1_C Chain C, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT1_D Chain D, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT1_E Chain E, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT1_F Chain F, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT2_A Chain A, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT2_B Chain B, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT2_C Chain C, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT2_D Chain D, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT2_E Chain E, Ricin B lectin [Acetivibrio thermocellus ATCC 27405],3VT2_F Chain F, Ricin B lectin [Acetivibrio thermocellus ATCC 27405]
6EUG_A 1.15e-28 2237 2504 107 374
TheGH43, Beta 1,3 Galactosidase, BT3683 with galactoimidazole [Bacteroides thetaiotaomicron],6EUH_A The GH43, Beta 1,3 Galactosidase, BT3683 with galactodeoxynojirimycin [Bacteroides thetaiotaomicron VPI-5482],6EUH_B The GH43, Beta 1,3 Galactosidase, BT3683 with galactodeoxynojirimycin [Bacteroides thetaiotaomicron VPI-5482],6EUH_C The GH43, Beta 1,3 Galactosidase, BT3683 with galactodeoxynojirimycin [Bacteroides thetaiotaomicron VPI-5482],6EUI_A The GH43, Beta 1,3 Galactosidase, BT3683 with galactose [Bacteroides thetaiotaomicron VPI-5482]
6EUF_A 3.37e-26 2240 2503 86 302
TheGH43, Beta 1,3 Galactosidase, BT0265 [Bacteroides thetaiotaomicron VPI-5482],6EUF_B The GH43, Beta 1,3 Galactosidase, BT0265 [Bacteroides thetaiotaomicron VPI-5482],6EUF_C The GH43, Beta 1,3 Galactosidase, BT0265 [Bacteroides thetaiotaomicron VPI-5482],6EUF_D The GH43, Beta 1,3 Galactosidase, BT0265 [Bacteroides thetaiotaomicron VPI-5482],6EUJ_A The GH43, Beta 1,3 Galactosidase, BT0265 [Bacteroides thetaiotaomicron VPI-5482],6EUJ_B The GH43, Beta 1,3 Galactosidase, BT0265 [Bacteroides thetaiotaomicron VPI-5482],6EUJ_C The GH43, Beta 1,3 Galactosidase, BT0265 [Bacteroides thetaiotaomicron VPI-5482],6EUJ_D The GH43, Beta 1,3 Galactosidase, BT0265 [Bacteroides thetaiotaomicron VPI-5482]
7BYS_A 6.08e-25 2231 2489 72 279
ChainA, Galactan 1,3-beta-galactosidase [Phanerodontia chrysosporium],7BYS_B Chain B, Galactan 1,3-beta-galactosidase [Phanerodontia chrysosporium],7BYT_A Chain A, Galactan 1,3-beta-galactosidase [Phanerodontia chrysosporium]
7BYV_A 1.49e-24 2231 2489 73 280
ChainA, Galactan 1,3-beta-galactosidase [Phanerodontia chrysosporium]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q5SNX7 4.96e-19 608 942 38 332
Galactocerebrosidase OS=Danio rerio OX=7955 GN=galc PE=2 SV=1
B5X3C1 1.04e-17 601 1061 34 452
Galactocerebrosidase OS=Salmo salar OX=8030 GN=galc PE=2 SV=1
P54803 2.48e-17 601 942 52 355
Galactocerebrosidase OS=Homo sapiens OX=9606 GN=GALC PE=1 SV=3
O02791 1.29e-16 601 942 52 355
Galactocerebrosidase OS=Macaca mulatta OX=9544 GN=GALC PE=1 SV=2
P54818 1.69e-16 601 942 52 355
Galactocerebrosidase OS=Mus musculus OX=10090 GN=Galc PE=1 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000386 0.998815 0.000236 0.000198 0.000177 0.000149

TMHMM  Annotations      download full data without filtering help

start end
12 31
2970 2989