CAZyme Information

Basic Information
SpeciesArabidopsis lyrata
Cazyme ID489446
FamilyGH1
Protein PropertiesLength: 542 Molecular Weight: 61210.4 Isoelectric Point: 5.0064
ChromosomeChromosome/Scaffold: 6 Start: 11788904 End: 11801042
Descriptionthioglucoside glucohydrolase 1
View CDS
External Links
NCBI Taxonomy
Plaza
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1415120
  GSFEKDFIFGVASSAYQVEGGRGRGLNIWDGFTHRYPEKGGADLGNGDTTCDSYTNWQKDIDVMDELNATGYRFSFAWSRILPKGKRSRGVNEGGINYYN
  RLINNTIARNITPFVTLFHWDLPQTLQDEYNGFLNRTIIDDFKDYADLCFELFGDRVKNWITINQLYTVPTRGYALGTDAPGRCSPKIDERCPGGNSSTE
  PYLVAHNQLLAHAAAVDVYRTKYKQDQGGKIGPVMITRWFLPYDDTPESKEATERAKEFFHGWFMGPLTEGKYPDIMREYVGDRLPEFNETEAALVKGSY
  DFLGLNYYVTQYAQNNDTIVPPDVHTALMDSRATLTSTNATGHAPGPPFNAGSYYYPKGIYYVMEYFKNKYGDPLIYITENGISTPGDESFDEAVADYKR
  IDYLCSHLCFLSKVIKEKAVNVKGYFAWALGDNYEFCNGFTVRFGLSYVDFTNVTGDRDLKASGKWYQQFIN
Full Sequence
Protein Sequence     Length: 542     Download
MKLLGFSLAI LLAVVTCKAE EFTCEENEPF TCNQTKLFNS GSFEKDFIFG VASSAYQVEG    60
GRGRGLNIWD GFTHRYPEKG GADLGNGDTT CDSYTNWQKD IDVMDELNAT GYRFSFAWSR    120
ILPKGKRSRG VNEGGINYYN RLINNTIARN ITPFVTLFHW DLPQTLQDEY NGFLNRTIID    180
DFKDYADLCF ELFGDRVKNW ITINQLYTVP TRGYALGTDA PGRCSPKIDE RCPGGNSSTE    240
PYLVAHNQLL AHAAAVDVYR TKYKQDQGGK IGPVMITRWF LPYDDTPESK EATERAKEFF    300
HGWFMGPLTE GKYPDIMREY VGDRLPEFNE TEAALVKGSY DFLGLNYYVT QYAQNNDTIV    360
PPDVHTALMD SRATLTSTNA TGHAPGPPFN AGSYYYPKGI YYVMEYFKNK YGDPLIYITE    420
NGISTPGDES FDEAVADYKR IDYLCSHLCF LSKVIKEKAV NVKGYFAWAL GDNYEFCNGF    480
TVRFGLSYVD FTNVTGDRDL KASGKWYQQF INVTTEDSTN QDLLRSSVSV KNRDRKSLAD    540
A* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
PLN02849PLN028494.0e-11538511482+
PLN02814PLN028141.0e-12038525494+
COG2723BglB2.0e-12343511480+
TIGR03356BGL2.0e-13145507468+
pfam00232Glyco_hydro_1039512482+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAL06896.1015411541AT5g26000/T1N24_7 [Arabidopsis thaliana]
GenBankAAL25596.1015411541AT5g26000/T1N24_7 [Arabidopsis thaliana]
EMBLCAH40799.10184971480thioglucoside glucohydrolase [Arabidopsis thaliana]
EMBLCAH40827.10184971479thioglucoside glucohydrolase [Arabidopsis lyrata subsp. lyrata]
RefSeqNP_851077.1015411541TGG1 (THIOGLUCOSIDE GLUCOHYDROLASE 1); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0205122500A Chain A, Myrosinase From Sinapis Alba
PDB2wxd_M0205122500A Chain A, Myrosinase From Sinapis Alba
PDB1w9d_M0205122500A Chain A, Myrosinase From Sinapis Alba
PDB1w9b_M0205122500A Chain A, Myrosinase From Sinapis Alba
PDB1e73_M0205122500A Chain A, Myrosinase From Sinapis Alba
Signal Peptide
Cleavage Site
19
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EG525046322643850
CB26464530513050
EG5241082952345280
EG520295318513680
EG453010310203290
Orthologous Group
SpeciesID
Arabidopsis lyrata917734
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT5G25980.2AT1G51490.1
AT5G26000.2AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390m
Thhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny