CAZyme Information

Basic Information
SpeciesBrassica rapa
Cazyme IDBra004012
FamilyGH1
Protein PropertiesLength: 539 Molecular Weight: 61496.8 Isoelectric Point: 8.9
ChromosomeChromosome/Scaffold: 07 Start: 16290701 End: 16294183
Descriptionthioglucoside glucohydrolase 1
View CDS
External Links
NCBI Taxonomy
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1435100
  KGFPKDFIFGVASSAYQACGRGRGLNVWDGFTHRYPEKGGPDLGNGDTTCESYTKWQKDIDILDEMNATGYRFSFAWSRIIPEGKVSRGVNKGGLKYYHK
  LIDGLIAKNITPFVTLYHWDIPQTLQDEYQGFLNRQVIDDFRDFADLCFKEFGGKVKNWLTLNQLYTVPTRGYSTAGADAPGRCSPKVDERCYGGNSSTE
  PYIVAHNQLLAHAAVFQRGIIGPVMITRWFLPFNETDRASIDATERMKEFFFGWYMEPLTRGRYPDIMRRMVGNRLPNFTEAEARLVAGSYDFLGLNYYV
  GQYVQPAPNPLPVTSERYTAMMDPGTTLTSVNARGEKIGPLFEDFQGSRIYYYPKGIYYVMDHFRTRYRNPLIYVTENGFTTPASENRPEAVADSKRIDY
  LCSHLCFLRKVIREKRVNIKGYFAWSLGDNYEFAKGFAVRFGLTYVNWTDVSDRNLKDSGRWYQRFIN
Full Sequence
Protein Sequence     Length: 539     Download
MKLRGLVLIV FLLAVVSCKA NGEITCEENE PFTCNNTARL NSKGFPKDFI FGVASSAYQA    60
CGRGRGLNVW DGFTHRYPEK GGPDLGNGDT TCESYTKWQK DIDILDEMNA TGYRFSFAWS    120
RIIPEGKVSR GVNKGGLKYY HKLIDGLIAK NITPFVTLYH WDIPQTLQDE YQGFLNRQVI    180
DDFRDFADLC FKEFGGKVKN WLTLNQLYTV PTRGYSTAGA DAPGRCSPKV DERCYGGNSS    240
TEPYIVAHNQ LLAHAAVFQR GIIGPVMITR WFLPFNETDR ASIDATERMK EFFFGWYMEP    300
LTRGRYPDIM RRMVGNRLPN FTEAEARLVA GSYDFLGLNY YVGQYVQPAP NPLPVTSERY    360
TAMMDPGTTL TSVNARGEKI GPLFEDFQGS RIYYYPKGIY YVMDHFRTRY RNPLIYVTEN    420
GFTTPASENR PEAVADSKRI DYLCSHLCFL RKVIREKRVN IKGYFAWSLG DNYEFAKGFA    480
VRFGLTYVNW TDVSDRNLKD SGRWYQRFIN VTSKNTANED FLRSSLSFKN KMKTLADA*     540
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
PLN02849PLN028496.0e-10345524495+
COG2723BglB3.0e-10843520492+
PLN02814PLN028141.0e-1111512530+
TIGR03356BGL7.0e-11646505475+
pfam00232Glyco_hydro_16.0e-18041510483+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAX68547.1015381548myrosinase [Brassica rapa var. parachinensis]
GenBankABS30827.101753817546myrosinase [Brassica oleracea]
DDBJBAE16356.1015381545myrosinase [Eutrema wasabi]
EMBLCAA55685.1015381547myrosinase [Brassica napus]
EMBLCAA79990.101753817544myrosinase, thioglucoside glucohydrolase [Brassica napus]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0235103500A Chain A, Myrosinase From Sinapis Alba
PDB1dwj_M0235101498A Chain A, Myrosinase From Sinapis Alba
PDB1dwi_M0235101498A Chain A, Myrosinase From Sinapis Alba
PDB1dwh_M0235101498A Chain A, Myrosinase From Sinapis Alba
PDB1dwg_M0235101498A Chain A, Myrosinase From Sinapis Alba
Metabolic Pathways
Pathway NameReactionECProtein Name
coumarin biosynthesis (via 2-coumarate)RXN-8036EC-3.2.1.21β-glucosidase
glucosinolate breakdownRXN-8134EC-3.2.1.147thioglucosidase
glucosinolate breakdown (via thiocyanate-forming protein)RXN-12024EC-3.2.1.147thioglucosidase
linamarin degradationRXN-5341EC-3.2.1.21β-glucosidase
linustatin bioactivationRXN-13602EC-3.2.1.21β-glucosidase
linustatin bioactivationRXN-5341EC-3.2.1.21β-glucosidase
lotaustralin degradationRXN-9674EC-3.2.1.21β-glucosidase
neolinustatin bioactivationRXN-13603EC-3.2.1.21β-glucosidase
neolinustatin bioactivationRXN-9674EC-3.2.1.21β-glucosidase
taxiphyllin bioactivationRXN-13600EC-3.2.1.21β-glucosidase
Signal Peptide
Cleavage Site
20
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
CD82752324912480
ES9246552392614990
EG525046324653780
CD83230224382490
BM985570281393090
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT5G25980.2AT1G51490.1
AT5G26000.2AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra014287
Bra038094Bra023838Bra032343Bra036914Bra020523.35.161
Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390m
Thhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny