CAZyme Information

Basic Information
SpeciesArabidopsis thaliana
Cazyme IDAT1G61820.1
FamilyGH1
Protein PropertiesLength: 517 Molecular Weight: 59112.9 Isoelectric Point: 6.9199
ChromosomeChromosome/Scaffold: 1 Start: 22835078 End: 22838615
DescriptionOs4bglu18 - monolignol beta-glucoside homologue, expressed
View CDS
External Links
TAIR
Geo Profiles
ATTED-II
NCBI Taxonomy
Plaza
SIGnAL
CAZyDB
Entrez Gene
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1335070
  SPFPSDFLFGTASSAFQYEGAFLTDGKGLNNWDVFAHENPGKIVDGSNGDIATDQYHRYMEDIQSMNFLGVNSYRLSISWSRVLPNGRFGVINYKGIKYY
  NNLIDALIKKGITPFVTLNHFDYPQELENRFKSWLSSEMQKDFGYLADICFKHFGDRVKHWITINEPNQHISLAYRSGLFPPARCSMPYGNCTHGNSETE
  PFIAAHNMILAHAKAIQIYRTKYQREQKGIIGIVVQTSWFEPISDSIADKNAAERAQSFYSNWILDPVVYGKYPEEMVNLLGSALPKFSSNEMNSLMSYK
  SDFLGINHYTSYFIQDCLITACNSGDGASKSEGLALKLDRKGNVSIGELTDVNWQHIDPNGFRKMLNYLKNRYHNIPMYITENGFGQLQKPETTVEELLH
  DTKRIQYLSGYLDALKAAMRDGANVKGYFAWSLLDNFEWLYGYKVRFGLFHVDFTTLKRTPKQSATWYKNFIEQN
Full Sequence
Protein Sequence     Length: 517     Download
MKTFANFAIL FLLQSLLFPL YSSCLHQTSD DSSPFPSDFL FGTASSAFQY EGAFLTDGKG    60
LNNWDVFAHE NPGKIVDGSN GDIATDQYHR YMEDIQSMNF LGVNSYRLSI SWSRVLPNGR    120
FGVINYKGIK YYNNLIDALI KKGITPFVTL NHFDYPQELE NRFKSWLSSE MQKDFGYLAD    180
ICFKHFGDRV KHWITINEPN QHISLAYRSG LFPPARCSMP YGNCTHGNSE TEPFIAAHNM    240
ILAHAKAIQI YRTKYQREQK GIIGIVVQTS WFEPISDSIA DKNAAERAQS FYSNWILDPV    300
VYGKYPEEMV NLLGSALPKF SSNEMNSLMS YKSDFLGINH YTSYFIQDCL ITACNSGDGA    360
SKSEGLALKL DRKGNVSIGE LTDVNWQHID PNGFRKMLNY LKNRYHNIPM YITENGFGQL    420
QKPETTVEEL LHDTKRIQYL SGYLDALKAA MRDGANVKGY FAWSLLDNFE WLYGYKVRFG    480
LFHVDFTTLK RTPKQSATWY KNFIEQNVNI EDQIDK* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
PLN02998PLN029985.0e-12940505470+
PLN02814PLN028141.0e-12940513481+
COG2723BglB3.0e-14640512478+
TIGR03356BGL2.0e-15540500461+
pfam00232Glyco_hydro_12.0e-17040507472+
Gene Ontology
GO TermDescription
GO:0003824catalytic activity
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
GO:0009809lignin biosynthetic process
GO:0012505endomembrane system
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAC28502.101551626527Similar to F4I1.26 putative beta-glucosidase gi
GenBankAAU05454.10925161425At1g61820 [Arabidopsis thaliana]
RefSeqNP_176374.1015151518BGLU45 (BETA-GLUCOSIDASE 45); catalytic/ cation binding / hydrolase, hydrolyzing O-glycosyl compounds [Arabidopsis thaliana]
RefSeqNP_850968.1015161516BGLU46 (BETA GLUCOSIDASE 46); catalytic/ cation binding / hydrolase, hydrolyzing O-glycosyl compounds [Arabidopsis thaliana]
RefSeqNP_974067.101435164377BGLU46 (BETA GLUCOSIDASE 46); catalytic/ cation binding / hydrolase, hydrolyzing O-glycosyl compounds [Arabidopsis thaliana]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3gnr_A03550517487B Chain B, The Avrptob-Bak1 Complex Reveals Two Structurally Similar Kinaseinteracting Domains In A Single Type Iii Effector
PDB3gnp_A03550517487B Chain B, The Avrptob-Bak1 Complex Reveals Two Structurally Similar Kinaseinteracting Domains In A Single Type Iii Effector
PDB3gno_A03550517487A Chain A, Crystal Structure Of A Rice Os3bglu6 Beta-glucosidase
PDB3ptq_B03550434503A Chain A, Crystal Structure Of A Rice Os3bglu6 Beta-glucosidase
PDB3ptq_A03550434503A Chain A, Crystal Structure Of A Rice Os3bglu6 Beta-glucosidase
Metabolic Pathways
Pathway NameReactionECProtein Name
coumarin biosynthesis (via 2-coumarate)RXN-8036EC-3.2.1.21β-glucosidase
linamarin degradationRXN-5341EC-3.2.1.21β-glucosidase
linustatin bioactivationRXN-13602EC-3.2.1.21β-glucosidase
linustatin bioactivationRXN-5341EC-3.2.1.21β-glucosidase
lotaustralin degradationRXN-9674EC-3.2.1.21β-glucosidase
neolinustatin bioactivationRXN-13603EC-3.2.1.21β-glucosidase
neolinustatin bioactivationRXN-9674EC-3.2.1.21β-glucosidase
taxiphyllin bioactivationRXN-13600EC-3.2.1.21β-glucosidase
Signal Peptide
Cleavage Site
24
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EX072146267593250
ES902608242843250
DK493108233402720
FG227815475405050
DK5605082352825160
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_012_00117.1Aquca_054_00059.1Aquca_054_00064.2Aquca_054_00061.2Aquca_054_00061.1
Aquca_054_00064.3Aquca_054_00064.1.216.582Aquca_054_00064.1.216.582Aquca_054_00060.1Aquca_054_00064.4
Aquca_054_00061.3Aquca_054_00061.4Aquca_054_00061.5Aquca_054_00061.6
Arabidopsis lyrata475147492699475148
Arabidopsis thalianaAT4G21760.1AT1G61810.1AT1G61810.3AT1G61820.3
Brachypodium distachyonBradi5g15540.1Bradi5g15527.1Bradi5g15540.3Bradi5g15540.2
Brassica rapaBra028383Bra031392Bra038756Bra013557Bra038755
Carica papayaevm.model.supercontig_198.21
Capsella rubellaCarubv10021428mCarubv10020143mCarubv10021331mCarubv10006876m
Citrus clementinaCiclev10007996mCiclev10014887mCiclev10017801m
Citrus sinensisorange1.1g009535morange1.1g014339morange1.1g045534morange1.1g016308morange1.1g012716m
orange1.1g016444morange1.1g016438morange1.1g022079m
Cucumis sativusCucsa.341820.2Cucsa.341820.1Cucsa.123340.1Cucsa.086000.1Cucsa.341820.3
Cucsa.341810.1
Eucalyptus grandisEucgr.H00071.1Eucgr.H02855.1Eucgr.E04103.1Eucgr.H02856.1Eucgr.H02856.2
Fragaria vescamrna19575.1-v1.0-hybrid
Glycine maxGlyma15g11290.2Glyma13g35430.2Glyma12g35125.1Glyma07g38840.1Glyma07g38850.1
Glyma12g35140.2Glyma13g35410.2Glyma13g35410.3Glyma13g35410.4
Gossypium raimondiiGorai.010G178500.1Gorai.010G178800.1Gorai.013G167400.1Gorai.013G167400.2Gorai.013G167400.4
Gorai.011G016100.1.462.724Gorai.011G016100.1.462.724Gorai.013G167400.5Gorai.013G167400.3.150.447Gorai.013G167400.3.150.447
Gorai.013G167400.6
Linum usitatissimumLus10018353Lus10007656Lus10007655Lus10032660Lus10018354
Lus10032659
Malus domesticaMDP0000140817MDP0000213598MDP0000306738MDP0000149337
Manihot esculentacassava4.1_005677mcassava4.1_032622mcassava4.1_031147mcassava4.1_026351m
Medicago truncatulaMedtr4g131800.1Medtr4g131810.1
Mimulus guttatusmgv1a004080m
Oryza sativaLOC_Os04g43360.1LOC_Os04g43410.1LOC_Os04g43390.2LOC_Os04g43360.2LOC_Os04g43390.1
LOC_Os04g43400.1LOC_Os04g43380.1
Panicum virgatumPavirv00042435mPavirv00070823mPavirv00006566mPavirv00021804mPavirv00059033m
Pavirv00006565mPavirv00036311mPavirv00010790m
Physcomitrella patensPp1s22_312V6.2Pp1s22_312V6.1
Phaseolus vulgarisPhvul.005G082900.1Phvul.005G082800.3Phvul.005G082800.2Phvul.005G082800.1Phvul.007G273800.1
Phvul.003G076700.1Phvul.006G151300.1Phvul.003G076800.1Phvul.006G151400.1Phvul.003G076700.2
Picea abiesMA_10431526g0010MA_119280g0010
Populus trichocarpaPotri.004G019800.1Potri.004G019700.1Potri.004G019500.1Potri.004G019300.1Potri.004G019300.2
Potri.001G403900.1Potri.004G019400.1
Prunus persicappa004523mppa025660mppa005194mppa017816m
Ricinus communis29842.m00362929904.m002964
Setaria italicaSi009837mSi009886mSi009882mSi009894mSi009871m
Si009850mSi010127mSi010115mSi010284mSi010346m
Si010332mSi010833m
Sorghum bicolorSb06g022460.1Sb06g022450.1Sb06g022510.1Sb06g022500.1Sb06g022410.1
Sb06g022490.1Sb06g022420.1Sb06g022385.1
Thellungiella halophilaThhalv10024134mThhalv10024920mThhalv10024911mThhalv10023411mThhalv10024896m
Vitis viniferaGSVIVT01014400001GSVIVT01012650001GSVIVT01014399001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny  (This image is cropped. Click for full image.)