logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000002553_00685

You are here: Home > Sequence: MGYG000002553_00685

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species UMGS1484 sp902388225
Lineage Bacteria; Firmicutes_A; Clostridia; Oscillospirales; Acutalibacteraceae; UMGS1484; UMGS1484 sp902388225
CAZyme ID MGYG000002553_00685
CAZy Family GH36
CAZyme Description hypothetical protein
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
1033 115140.41 4.3366
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000002553 1478585 MAG China Asia
Gene Location Start: 1344;  End: 4445  Strand: +

Full Sequence      Download help

Enzyme Prediction      help

No EC number prediction in MGYG000002553_00685.

CAZyme Signature Domains help

Family Start End Evalue family coverage
GH36 720 1014 7.9e-18 0.40843023255813954

CDD Domains      download full data without filtering help

Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
cd14791 GH36 3.50e-07 720 923 100 299
glycosyl hydrolase family 36 (GH36). GH36 enzymes occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-galactosidase, alpha-N-acetylgalactosaminidase, stachyose synthase, and raffinose synthase. All GH36 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. GH36 members are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues have been identified as the catalytic nucleophile and the acid/base, respectively.
pfam16874 Glyco_hydro_36C 2.72e-06 946 1029 1 78
Glycosyl hydrolase family 36 C-terminal domain. This domain is found at the C-terminus of many family 36 glycoside hydrolases. It has a beta-sandwich structure with a Greek key motif.
cd14256 Dockerin_I 1.28e-04 273 326 1 54
Type I dockerin repeat domain. Bacterial cohesin domains bind to a complementary protein domain named dockerin, and this interaction is required for the formation of the cellulosome, a cellulose-degrading complex. The cellulosome consists of scaffoldin, a noncatalytic scaffolding polypeptide, that comprises repeating cohesion modules and a single carbohydrate-binding module (CBM). Specific calcium-dependent interactions between cohesins and dockerins appear to be essential for cellulosome assembly. This subfamily represents type I dockerins, which are responsible for anchoring a variety of enzymatic domains to the complex.
pfam02065 Melibiase 1.90e-04 720 860 139 285
Melibiase. Glycoside hydrolase families GH27, GH31 and GH36 form the glycoside hydrolase clan GH-D. Glycoside hydrolase family 36 can be split into 11 families, GH36A to GH36K. This family includes enzymes from GH36A-B and GH36D-K and from GH27.

CAZyme Hits      help

Hit ID E-Value Query Start Query End Hit Start Hit End
QGQ94273.1 1.29e-123 343 1030 345 1029
AEE97820.1 9.28e-14 690 1026 363 684
ABU59612.1 9.82e-12 499 1029 150 682
BBH24245.1 1.78e-11 40 184 1097 1244
CED94212.1 2.98e-11 720 1031 396 695

PDB Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
6JHP_A 7.53e-07 720 1031 468 757
Crystalstructure of the glycoside hydrolase family 36 alpha-galactosidase from Paecilomyces thermophila [Paecilomyces sp. 'thermophila'],6JHP_B Crystal structure of the glycoside hydrolase family 36 alpha-galactosidase from Paecilomyces thermophila [Paecilomyces sp. 'thermophila'],6JHP_C Crystal structure of the glycoside hydrolase family 36 alpha-galactosidase from Paecilomyces thermophila [Paecilomyces sp. 'thermophila'],6JHP_D Crystal structure of the glycoside hydrolase family 36 alpha-galactosidase from Paecilomyces thermophila [Paecilomyces sp. 'thermophila']

Swiss-Prot Hits      download full data without filtering help

Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q5AU92 4.11e-06 720 1030 460 748
Alpha-galactosidase C OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) OX=227321 GN=aglC PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000666 0.998520 0.000218 0.000222 0.000189 0.000175

TMHMM  Annotations      download full data without filtering help

start end
5 27