logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000129_01194

You are here: Home > Sequence: MGYG000000129_01194

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Marseille-P4683 sp900232885
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Acutalibacteraceae; Marseille-P4683; Marseille-P4683 sp900232885
CAZyme ID MGYG000000129_01194
CAZy Family GH116
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1681 MGYG000000129_4|CGC1 186413.9 4.0316
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000129 2720267 Isolate Canada North America
Gene Location Start: 241789;  End: 246834  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000000129_01194.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH116 428 772 1.2e-20 0.9889807162534435
CBM6 1343 1475 9.1e-19 0.855072463768116
CBM6 1207 1332 1.4e-17 0.8478260869565217
CBM6 1080 1199 4.6e-16 0.7463768115942029

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam12215 Glyco_hydr_116N 7.11e-34 52 384 1 308
beta-glucosidase 2, glycosyl-hydrolase family 116 N-term. This domain is found in bacteria, archaea and eukaryotes. This domain is typically between 320 to 354 amino acids in length. This domain is found associated with pfam04685. It is found just after the extreme N-terminus. The N-terminal is thought to be the luminal domain while the C terminal is the cytosolic domain. The catalytic domain of GBA-2 is unknown. The primary catabolic pathway for glucosylceramide is catalysis by the lysosomal enzyme glucocerebrosidase. In higher eukaryotes, glucosylceramide is the precursor of glycosphingolipids, a complex group of ubiquitous membrane lipids. Mutations in the human protein cause motor-neurone defects in hereditary spastic paraplegia. The catalytic nucleophile, identified in UniProtKB:Q97YG8_SULSO, is a glutamine-335 in the downstream family pfam04685.
226800 COG4354 6.06e-21 42 780 3 721
COG4354 Uncharacterized protein, contains GBA2_N and DUF608 domains [Function unknown].
cd04084 CBM6_xylanase-like 7.02e-20 1203 1332 1 123
Carbohydrate Binding Module 6 (CBM6); many are appended to glycoside hydrolase (GH) family 11 and GH43 xylanase domains. This family includes carbohydrate binding module 6 (CBM6) domains that are appended mainly to glycoside hydrolase (GH) family domains, including GH3, GH11, and GH43 domains. These CBM6s are non-catalytic carbohydrate binding domains that facilitate the strong binding of the GH catalytic modules with their dedicated, insoluble substrates. Examples of proteins having CMB6s belonging to this family are Microbispora bispora GghA, a 1,4-beta-D-glucan glucohydrolase (GH3); Clostridium thermocellum xylanase U (GH11), and Penicillium purpurogenum ABF3, a bifunctional alpha-L-arabinofuranosidase/xylobiohydrolase (GH43). GH3 comprises enzymes with activities including beta-glucosidase (hydrolyzes beta-galactosidase) and beta-xylosidase (hydrolyzes 1,4-beta-D-xylosidase). GH11 family comprises enzymes with xylanase (endo-1,4-beta-xylanase) activity which catalyze the hydrolysis of beta-1,4 bonds of xylan, the major component of hemicelluloses, to generate xylooligosaccharides and xylose. GH43 includes beta-xylosidases and beta-xylanases, using aryl-glycosides as substrates. CBM6 is an unusual CBM as it represents a chimera of two distinct binding sites with different modes of binding: binding site I within the loop regions and binding site II on the concave face of the beta-sandwich fold.
cd04084 CBM6_xylanase-like 1.55e-18 1344 1473 2 123
Carbohydrate Binding Module 6 (CBM6); many are appended to glycoside hydrolase (GH) family 11 and GH43 xylanase domains. This family includes carbohydrate binding module 6 (CBM6) domains that are appended mainly to glycoside hydrolase (GH) family domains, including GH3, GH11, and GH43 domains. These CBM6s are non-catalytic carbohydrate binding domains that facilitate the strong binding of the GH catalytic modules with their dedicated, insoluble substrates. Examples of proteins having CMB6s belonging to this family are Microbispora bispora GghA, a 1,4-beta-D-glucan glucohydrolase (GH3); Clostridium thermocellum xylanase U (GH11), and Penicillium purpurogenum ABF3, a bifunctional alpha-L-arabinofuranosidase/xylobiohydrolase (GH43). GH3 comprises enzymes with activities including beta-glucosidase (hydrolyzes beta-galactosidase) and beta-xylosidase (hydrolyzes 1,4-beta-D-xylosidase). GH11 family comprises enzymes with xylanase (endo-1,4-beta-xylanase) activity which catalyze the hydrolysis of beta-1,4 bonds of xylan, the major component of hemicelluloses, to generate xylooligosaccharides and xylose. GH43 includes beta-xylosidases and beta-xylanases, using aryl-glycosides as substrates. CBM6 is an unusual CBM as it represents a chimera of two distinct binding sites with different modes of binding: binding site I within the loop regions and binding site II on the concave face of the beta-sandwich fold.
smart00606 CBD_IV 1.02e-17 1199 1332 2 129
Cellulose Binding Domain Type IV.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
BCI61147.1 0.0 1 1681 1 1681
ABR44834.1 3.51e-192 38 892 46 887
QRO15046.1 3.51e-192 38 892 46 887
AQQ10297.1 4.56e-192 37 898 46 895
ARN56247.1 9.42e-190 37 898 46 895

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
2V4V_A 3.53e-13 1067 1199 3 129
CrystalStructure of a Family 6 Carbohydrate-Binding Module from Clostridium cellulolyticum in complex with xylose [Ruminiclostridium cellulolyticum]
5M77_A 4.43e-13 1199 1383 387 573
ChainA, Alpha-1,6-mannanase [Niallia circulans],5M77_B Chain B, Alpha-1,6-mannanase [Niallia circulans]
1GMM_A 1.35e-12 1199 1317 1 115
ChainA, CBM6 [Acetivibrio thermocellus],1UXX_X Chain X, Xylanase U [Acetivibrio thermocellus]
1W9S_A 2.02e-11 1199 1332 9 138
ChainA, Bh0236 Protein [Halalkalibacterium halodurans],1W9S_B Chain B, Bh0236 Protein [Halalkalibacterium halodurans],1W9T_A Chain A, Bh0236 Protein [Halalkalibacterium halodurans],1W9T_B Chain B, Bh0236 Protein [Halalkalibacterium halodurans],1W9W_A Chain A, Bh0236 Protein [Halalkalibacterium halodurans]
1UY1_A 1.88e-10 1199 1336 18 145
Bindingsub-site dissection of a family 6 carbohydrate-binding module by X-ray crystallography and isothermal titration calorimetry [Thermoclostridium stercorarium],1UY2_A Binding sub-site dissection of a family 6 carbohydrate-binding module by X-ray crystallography and isothermal titration calorimetry [Thermoclostridium stercorarium],1UY3_A Binding sub-site dissection of a family 6 carbohydrate-binding module by X-ray crystallography and isothermal titration calorimetry [Thermoclostridium stercorarium],1UY4_A Binding sub-site dissection of a family 6 carbohydrate-binding module by X-ray crystallography and isothermal titration calorimetry [Thermoclostridium stercorarium]

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q8GJ44 1.12e-29 1066 1442 247 622
Endo-1,4-beta-xylanase A OS=Thermoclostridium stercorarium OX=1510 GN=xynA PE=1 SV=2
P33558 1.33e-20 957 1317 139 496
Endo-1,4-beta-xylanase A OS=Thermoclostridium stercorarium OX=1510 GN=xynA PE=1 SV=2
Q5M868 5.99e-08 52 738 142 837
Non-lysosomal glucosylceramidase OS=Rattus norvegicus OX=10116 GN=Gba2 PE=2 SV=2

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000289 0.998803 0.000236 0.000238 0.000210 0.000191

TMHMM  Annotations      download full data without filtering help

start end
7 29