CAZyme Information

Basic Information
SpeciesBrassica rapa
Cazyme IDBra023838
FamilyGH1
Protein PropertiesLength: 516 Molecular Weight: 58746 Isoelectric Point: 5.869
ChromosomeChromosome/Scaffold: 01 Start: 20334277 End: 20336960
Descriptionthioglucoside glucohydrolase 1
View CDS
External Links
NCBI Taxonomy
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1424930
  SSFEKDFIFGLASSAYQAYKAGPDHGNGDTTCNSYSYWEKDIEVMDELKATGYRFSIAWSRIIPRGKRSRGVHQGGINYYHGLINGLIDKGITPLVTLFH
  WDLPQVLQDDYEGFLDPQIIDDFRDFADLCFEEYGDKVKHWFTINQLYSVPTRGYGLGSDAPGRCSPKVDSTCYAGNSSTEPYIVAHNQLLAHATVHQGG
  KIGPVMITRWFLPYNDTDPDSIAATERMKEFFLGWYMGPLTNGTYPQIMIDTVGERLPSFTPEESKLVKGSYDFLGLNYYFAQYVQPSPNHVDSDGHTAM
  MDAGTRLTYRNASNHAIGPVFTEHKDDETKNTYYYPKGIYYVMDHFKTNYNDPVIYITENGFSTSGDETREEAKFDYRRIDYLCSHLCFLSKVIKETGVK
  VKGYCAWSLGDNYEFGLGFTVRFGLTYIDWNNVTDRDLKESGKWYKKFIATK
Full Sequence
Protein Sequence     Length: 516     Download
MKHLWLTLAF LLALATCKAD EEITCEENLP FTCGQTDRFN SSSFEKDFIF GLASSAYQAY    60
KAGPDHGNGD TTCNSYSYWE KDIEVMDELK ATGYRFSIAW SRIIPRGKRS RGVHQGGINY    120
YHGLINGLID KGITPLVTLF HWDLPQVLQD DYEGFLDPQI IDDFRDFADL CFEEYGDKVK    180
HWFTINQLYS VPTRGYGLGS DAPGRCSPKV DSTCYAGNSS TEPYIVAHNQ LLAHATVHQG    240
GKIGPVMITR WFLPYNDTDP DSIAATERMK EFFLGWYMGP LTNGTYPQIM IDTVGERLPS    300
FTPEESKLVK GSYDFLGLNY YFAQYVQPSP NHVDSDGHTA MMDAGTRLTY RNASNHAIGP    360
VFTEHKDDET KNTYYYPKGI YYVMDHFKTN YNDPVIYITE NGFSTSGDET REEAKFDYRR    420
IDYLCSHLCF LSKVIKETGV KVKGYCAWSL GDNYEFGLGF TVRFGLTYID WNNVTDRDLK    480
ESGKWYKKFI ATKNLAKPNF LRSSLTFEKK KFADA* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
PLN02849PLN028499.0e-9336503496+
COG2723BglB7.0e-9842492487+
PLN02814PLN028141.0e-9836490486+
TIGR03356BGL1.0e-10446486477+
pfam00232Glyco_hydro_12.0e-15840493483+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1DWA0224911498M Chain M, Study On Radiation Damage On A Cryocooled Crystal. Part 1: Structure Prior To Irradiation
PDB1MYR0204911500A Chain A, Myrosinase From Sinapis Alba
DDBJBAE16356.1015151545myrosinase [Eutrema wasabi]
EMBLCAA79989.2015151527myrosinase, thioglucoside glucohydrolase [Brassica napus]
Swiss-ProtP297360204911500MYRA_SINAL RecName: Full=Myrosinase MA1; AltName: Full=Sinigrinase; AltName: Full=Thioglucosidase
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0204911500A Chain A, Myrosinase From Sinapis Alba
PDB2wxd_M0204911500A Chain A, Myrosinase From Sinapis Alba
PDB1w9d_M0204911500A Chain A, Myrosinase From Sinapis Alba
PDB1w9b_M0204911500A Chain A, Myrosinase From Sinapis Alba
PDB1e73_M0204911500A Chain A, Myrosinase From Sinapis Alba
Metabolic Pathways
Pathway NameReactionECProtein Name
coumarin biosynthesis (via 2-coumarate)RXN-8036EC-3.2.1.21β-glucosidase
glucosinolate breakdownRXN-8134EC-3.2.1.147thioglucosidase
glucosinolate breakdown (via thiocyanate-forming protein)RXN-12024EC-3.2.1.147thioglucosidase
linamarin degradationRXN-5341EC-3.2.1.21β-glucosidase
linustatin bioactivationRXN-13602EC-3.2.1.21β-glucosidase
linustatin bioactivationRXN-5341EC-3.2.1.21β-glucosidase
lotaustralin degradationRXN-9674EC-3.2.1.21β-glucosidase
neolinustatin bioactivationRXN-13603EC-3.2.1.21β-glucosidase
neolinustatin bioactivationRXN-9674EC-3.2.1.21β-glucosidase
taxiphyllin bioactivationRXN-13600EC-3.2.1.21β-glucosidase
Signal Peptide
Cleavage Site
19
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EW7331232742284920
EY9349622302875160
EG525046307613570
EG453010306212980
EG453007306212980
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT5G25980.2AT1G51490.1
AT5G26000.2AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra032343Bra036914Bra020523.35.161
Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390m
Thhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny