logo
sublogo
You are browsing environment: HUMAN GUT
help

CAZyme Information: MGYG000000029_02746

You are here: Home > Sequence: MGYG000000029_02746

Basic Information | Genomic context | Full Sequence | Enzyme annotations |  CAZy signature domains |  CDD domains | CAZyme hits | PDB hits | Swiss-Prot hits | SignalP and Lipop annotations | TMHMM annotations

Basic Information help

Species Bacteroides finegoldii
Lineage Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae; Bacteroides; Bacteroides finegoldii
CAZyme ID MGYG000000029_02746
CAZy Family GH36
CAZyme Description Alpha-galactosidase AgaA
CAZyme Property
Protein Length CGC Molecular Weight Isoelectric Point
735 MGYG000000029_16|CGC1 83603.4 6.9313
Genome Property
Genome Assembly ID Genome Size Genome Type Country Continent
MGYG000000029 4415875 Isolate United Kingdom Europe
Gene Location Start: 14509;  End: 16716  Strand: +

Full Sequence      Download help

MKNLIRICLF  TMLLFVTGDL  AAQNIHLSTP  RTSLVLSAPA  RGELKYVYYG  SKLTENDCRE60
VYNAPTMQHV  AYPVYGMNCP  GESALAVTHA  DGNMTLQMEV  AGTEIKNTDG  AVTAVIKLKD120
KVYPFSVNVC  YKAYSDVDII  ETWTEIAHAE  KKNVMLRKFD  SAFLPVRRGD  VWLSSLYGSW180
ANEGRLVQEP  LEPGMKVIKN  RDGVRNSHTA  HAEVMFSLDG  KPQENYGNVI  GAALCYGGNY240
KLRIDTDDSE  YHQFYAGINE  ENSVYNLKKG  ELFITPALAL  TYSAEGLSGC  SRNFHRWARK300
YKLAHGNSLR  KILLNSWEGV  YFDINENGMN  QMMGDIASMG  GELFVMDDGW  FGDKYKRNSD360
NSSLGDWKVD  TGKLPGGIRK  LVDDARQYKV  KFGIWIEPEM  ANTTSELYEK  HPEWILKAPL420
REPVLGRGGT  QVVLDLGNPQ  VQDFIFNLVD  TLMTNYPEID  YIKWDANMAI  MNHGSDYLPK480
DEQSHLYIAY  HRGFENVCRR  IRAKYPELTI  QACASGGGRA  NYGVLPYFDE  FWVSDNTDAL540
QRIYMQWGAS  YFFPAIAMAS  HISAAPNHQT  FRTIPLKYRI  DVAMSGRLGM  EIQPKNMTGE600
EKELCRKAIA  DYKMIRPVVQ  LGDIYRLLSP  YDRLGAASLM  YVAPEKEKAV  FYWWKTEHFC660
NQHLPRIRMA  GLHPDKIYKV  TELNRIDNQP  LDYEGKAFSG  AYLMANGLEI  PYNHKVDYHK720
LSDYSSRVLY  LEEVK735

Enzyme Prediction      help

No EC number prediction in MGYG000000029_02746.

CAZyme Signature Domains help

Created with Snap367311014718322025729433036740444147751455158862466169825709GH36
Family Start End Evalue family coverage
GH36 25 709 9.2e-212 0.9956395348837209

CDD Domains      download full data without filtering help

Created with Snap3673110147183220257294330367404441477514551588624661698273622Melibiase310613GH3623630GalA42268Glyco_hydro_36N637731Glyco_hydro_36C
Cdd ID Domain E-Value qStart qEnd sStart sEnd Domain Description
pfam02065 Melibiase 7.69e-132 273 622 2 347
Melibiase. Glycoside hydrolase families GH27, GH31 and GH36 form the glycoside hydrolase clan GH-D. Glycoside hydrolase family 36 can be split into 11 families, GH36A to GH36K. This family includes enzymes from GH36A-B and GH36D-K and from GH27.
cd14791 GH36 5.16e-116 310 613 2 298
glycosyl hydrolase family 36 (GH36). GH36 enzymes occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-galactosidase, alpha-N-acetylgalactosaminidase, stachyose synthase, and raffinose synthase. All GH36 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. GH36 members are retaining enzymes that cleave their substrates via an acid/base-catalyzed, double-displacement mechanism involving a covalent glycosyl-enzyme intermediate. Two aspartic acid residues have been identified as the catalytic nucleophile and the acid/base, respectively.
COG3345 GalA 1.38e-109 23 630 2 600
Alpha-galactosidase [Carbohydrate transport and metabolism].
pfam16875 Glyco_hydro_36N 1.45e-43 42 268 1 256
Glycosyl hydrolase family 36 N-terminal domain. This domain is found at the N-terminus of many family 36 glycoside hydrolases. It has a beta-supersandwich fold.
pfam16874 Glyco_hydro_36C 2.27e-21 637 731 2 78
Glycosyl hydrolase family 36 C-terminal domain. This domain is found at the C-terminus of many family 36 glycoside hydrolases. It has a beta-sandwich structure with a Greek key motif.

CAZyme Hits      help

Created with Snap36731101471832202572943303674044414775145515886246616981735EDO12201.1|GH36|3.2.1.221735QRQ59073.1|GH361735SCV06946.1|GH361735ALJ47535.1|GH36|3.2.1.221735QUR45661.1|GH36
Hit ID E-Value Query Start Query End Hit Start Hit End
EDO12201.1 0.0 1 735 2 736
QRQ59073.1 0.0 1 735 1 735
SCV06946.1 0.0 1 735 1 735
ALJ47535.1 0.0 1 735 1 735
QUR45661.1 0.0 1 735 2 736

PDB Hits      download full data without filtering help

Created with Snap3673110147183220257294330367404441477514551588624661698166974FNQ_A236974FNR_A1066862XN0_A236974FNP_A1066862XN2_A
Hit ID E-Value Query Start Query End Hit Start Hit End Description
4FNQ_A 3.15e-112 16 697 3 709
Crystalstructure of GH36 alpha-galactosidase AgaB from Geobacillus stearothermophilus [Geobacillus stearothermophilus]
4FNR_A 9.85e-110 23 697 10 709
Crystalstructure of GH36 alpha-galactosidase AgaA from Geobacillus stearothermophilus [Geobacillus stearothermophilus],4FNR_B Crystal structure of GH36 alpha-galactosidase AgaA from Geobacillus stearothermophilus [Geobacillus stearothermophilus],4FNR_C Crystal structure of GH36 alpha-galactosidase AgaA from Geobacillus stearothermophilus [Geobacillus stearothermophilus],4FNR_D Crystal structure of GH36 alpha-galactosidase AgaA from Geobacillus stearothermophilus [Geobacillus stearothermophilus]
2XN0_A 2.92e-109 106 686 129 701
Structureof alpha-galactosidase from Lactobacillus acidophilus NCFM, PtCl4 derivative [Lactobacillus acidophilus NCFM],2XN0_B Structure of alpha-galactosidase from Lactobacillus acidophilus NCFM, PtCl4 derivative [Lactobacillus acidophilus NCFM],2XN1_A Structure of alpha-galactosidase from Lactobacillus acidophilus NCFM with TRIS [Lactobacillus acidophilus NCFM],2XN1_B Structure of alpha-galactosidase from Lactobacillus acidophilus NCFM with TRIS [Lactobacillus acidophilus NCFM],2XN1_C Structure of alpha-galactosidase from Lactobacillus acidophilus NCFM with TRIS [Lactobacillus acidophilus NCFM],2XN1_D Structure of alpha-galactosidase from Lactobacillus acidophilus NCFM with TRIS [Lactobacillus acidophilus NCFM]
4FNP_A 7.46e-109 23 697 10 709
Crystalstructure of GH36 alpha-galactosidase AgaA A355E from Geobacillus stearothermophilus [Geobacillus stearothermophilus],4FNP_B Crystal structure of GH36 alpha-galactosidase AgaA A355E from Geobacillus stearothermophilus [Geobacillus stearothermophilus],4FNP_C Crystal structure of GH36 alpha-galactosidase AgaA A355E from Geobacillus stearothermophilus [Geobacillus stearothermophilus],4FNP_D Crystal structure of GH36 alpha-galactosidase AgaA A355E from Geobacillus stearothermophilus [Geobacillus stearothermophilus],4FNS_A Crystal structure of GH36 alpha-galactosidase AgaA A355E from Geobacillus stearothermophilus in complex with 1-deoxygalactonojirimycin [Geobacillus stearothermophilus],4FNS_B Crystal structure of GH36 alpha-galactosidase AgaA A355E from Geobacillus stearothermophilus in complex with 1-deoxygalactonojirimycin [Geobacillus stearothermophilus],4FNS_C Crystal structure of GH36 alpha-galactosidase AgaA A355E from Geobacillus stearothermophilus in complex with 1-deoxygalactonojirimycin [Geobacillus stearothermophilus],4FNS_D Crystal structure of GH36 alpha-galactosidase AgaA A355E from Geobacillus stearothermophilus in complex with 1-deoxygalactonojirimycin [Geobacillus stearothermophilus]
2XN2_A 8.04e-109 106 686 129 701
Structureof alpha-galactosidase from Lactobacillus acidophilus NCFM with galactose [Lactobacillus acidophilus NCFM]

Swiss-Prot Hits      download full data without filtering help

Created with Snap3673110147183220257294330367404441477514551588624661698109733sp|Q2TW69|AGALC_ASPOR109733sp|B8NWY6|AGALC_ASPFN109734sp|Q0CVH2|AGALC_ASPTN113732sp|Q5AU92|AGALC_EMENI23697sp|Q9ALJ4|AGAA_GEOSE
Hit ID E-Value Query Start Query End Hit Start Hit End Description
Q2TW69 1.16e-110 109 733 150 750
Probable alpha-galactosidase C OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40) OX=510516 GN=aglC PE=3 SV=1
B8NWY6 1.16e-110 109 733 150 750
Probable alpha-galactosidase C OS=Aspergillus flavus (strain ATCC 200026 / FGSC A1120 / IAM 13836 / NRRL 3357 / JCM 12722 / SRRC 167) OX=332952 GN=aglC PE=3 SV=2
Q0CVH2 2.06e-110 109 734 146 747
Probable alpha-galactosidase C OS=Aspergillus terreus (strain NIH 2624 / FGSC A1156) OX=341663 GN=aglC PE=3 SV=1
Q5AU92 4.63e-109 113 732 154 748
Alpha-galactosidase C OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) OX=227321 GN=aglC PE=1 SV=1
Q9ALJ4 5.39e-109 23 697 10 709
Alpha-galactosidase AgaA OS=Geobacillus stearothermophilus OX=1422 GN=agaA PE=1 SV=1

SignalP and Lipop Annotations help

This protein is predicted as SP

Other SP_Sec_SPI LIPO_Sec_SPII TAT_Tat_SPI TATLIP_Sec_SPII PILIN_Sec_SPIII
0.000238 0.999186 0.000144 0.000140 0.000131 0.000127

TMHMM  Annotations      help

There is no transmembrane helices in MGYG000000029_02746.