CAZyme Information

Basic Information
SpeciesArabidopsis thaliana
Cazyme IDAT5G48375.1
FamilyGH1
Protein PropertiesLength: 440 Molecular Weight: 50757.8 Isoelectric Point: 8.0464
ChromosomeChromosome/Scaffold: 5 Start: 19601303 End: 19603883
DescriptionOs4bglu12 - beta-glucosidase, exo-beta-glucanase, expressed
View CDS
External Links
TAIR
Geo Profiles
ATTED-II
NCBI Taxonomy
Plaza
SIGnAL
CAZyDB
Entrez Gene
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1533880
  KGRGLNVWDGFTHRYPEKGGPDLGNGDSTCGSYEHWQKDIDVMTELGVDGYRFSLAWSRIAPRESNQAGVKYYNDLIDGLLAKNITPFVTLFHWDLPQVL
  QDEYEGFLNHEIIDDFKDYANLCFKIFGDRVKKWITINQLYTVPTRGYAMGTDAPEPYIVAHNQLLAHAKVVHLYRKKYKPKQRGQIGVVMITRWFVPYD
  STQANIDATERNKEFFLGWFMEPLTKGKYPDIMRKLVGRRLPKFNKKEAKLVKGSYDFLGINYYQTQYVYAIPANPPNRLTVLNDSLSAFSYENKDGPIG
  PWFNADSYYHPRGILNVLEHFKTKYGNPLVYITENG
Full Sequence
Protein Sequence     Length: 440     Download
MKFRALGLVL LLAVETCKAE EITCEETKPF TCNQTDRFNR KHFDDDFIFE GGKGRGLNVW    60
DGFTHRYPEK GGPDLGNGDS TCGSYEHWQK DIDVMTELGV DGYRFSLAWS RIAPRESNQA    120
GVKYYNDLID GLLAKNITPF VTLFHWDLPQ VLQDEYEGFL NHEIIDDFKD YANLCFKIFG    180
DRVKKWITIN QLYTVPTRGY AMGTDAPEPY IVAHNQLLAH AKVVHLYRKK YKPKQRGQIG    240
VVMITRWFVP YDSTQANIDA TERNKEFFLG WFMEPLTKGK YPDIMRKLVG RRLPKFNKKE    300
AKLVKGSYDF LGINYYQTQY VYAIPANPPN RLTVLNDSLS AFSYENKDGP IGPWFNADSY    360
YHPRGILNVL EHFKTKYGNP LVYITENGEL LILSGCNVKG YFAWCLGDNY ELWPSRSFHV    420
SPFYLLHRKD KGAFPSFEA* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
COG2723BglB6.0e-8643411426+
TIGR03356BGL5.0e-9044413420+
PLN02814PLN028149.0e-9335412438+
PLN02849PLN028495.0e-9335412439+
pfam00232Glyco_hydro_15.0e-13339415431+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
GO:0012505endomembrane system
GO:0019137thioglucosidase activity
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAD40134.1014121476AF149413_15 Arabidopsis thaliana thioglucosidase (GB:X79195); Pfam PF00232, Score=702.5, E=1.9e-207, N=1
GenBankAAV71147.1014121474myrosinase [Armoracia rusticana]
DDBJBAE98479.10141212487myrosinase TGG2 [Arabidopsis thaliana]
RefSeqNP_568479.10141212487TGG2 (GLUCOSIDE GLUCOHYDROLASE 2); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
RefSeqNP_680406.1014391439TGG3 (THIOGLUCOSIDE GLUCOSIDASE 3); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0204122465A Chain A, Myrosinase From Sinapis Alba
PDB2wxd_M0204122465A Chain A, Myrosinase From Sinapis Alba
PDB1w9d_M0204122465A Chain A, Myrosinase From Sinapis Alba
PDB1w9b_M0204122465A Chain A, Myrosinase From Sinapis Alba
PDB1e73_M0204122465A Chain A, Myrosinase From Sinapis Alba
Metabolic Pathways
Pathway NameReactionECProtein Name
glucosinolate breakdownRXN-8134EC-3.2.1.147thioglucosidase
glucosinolate breakdown (via thiocyanate-forming protein)RXN-12024EC-3.2.1.147thioglucosidase
Signal Peptide
Cleavage Site
19
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EG520295317473400
EG481640293513200
EG453010306202930
EG453007306202930
EG525046319553490
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G25980.2AT1G51490.1AT5G26000.2
AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390m
Thhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny