CAZyme Information

Basic Information
SpeciesArabidopsis lyrata
Cazyme ID917734
FamilyGH1
Protein PropertiesLength: 516 Molecular Weight: 59067.2 Isoelectric Point: 8.5748
View CDS
External Links
NCBI Taxonomy
Plaza
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1404850
  RKHFDDDFIFGVASSAYQACAKGRGLNVWDEKGGPDLGNGDNTCGSYEHWQKDIDVMTELGVDGYRFSLAWSRIIPRGKVKRGINEAGVKYYNELIDGLL
  EKNITPFVTLFHWDLPQVLQDEYEGFLHRDIMYVIDVKNWITIKQLYTVPTRGYAMGTGAPGRCSYWLNKDRYAGDSGREPYIVAHNQLLAHAEVVDLYR
  KKYKPKQGGQIGVVMITRWFIPYDSTEANKKATERNKEFFLGWFMEPLTKGKYPDIMRKLVGRRLLNFSEREAKLVKGSYDFLGINYYQTQYVYAIPANP
  PNRLTVMNDSLSAYSYENKDGPIGPWLLPSKRNVNVLEHFETKYGNPLVYITENGYNSPGGNTTAHEVIADSNRTDYICSHLCFLRKAIKESGCNVKGYF
  AWSLGDNYEFGKGFTVRYGLSYVDFTNITADRVLKTSGKWYKQFLN
Full Sequence
Protein Sequence     Length: 516     Download
MKLLGLCLVL LLAVVTCKAE EITCEETKPF TCNQTDRFNR KHFDDDFIFG VASSAYQACA    60
KGRGLNVWDE KGGPDLGNGD NTCGSYEHWQ KDIDVMTELG VDGYRFSLAW SRIIPRGKVK    120
RGINEAGVKY YNELIDGLLE KNITPFVTLF HWDLPQVLQD EYEGFLHRDI MYVIDVKNWI    180
TIKQLYTVPT RGYAMGTGAP GRCSYWLNKD RYAGDSGREP YIVAHNQLLA HAEVVDLYRK    240
KYKPKQGGQI GVVMITRWFI PYDSTEANKK ATERNKEFFL GWFMEPLTKG KYPDIMRKLV    300
GRRLLNFSER EAKLVKGSYD FLGINYYQTQ YVYAIPANPP NRLTVMNDSL SAYSYENKDG    360
PIGPWLLPSK RNVNVLEHFE TKYGNPLVYI TENGYNSPGG NTTAHEVIAD SNRTDYICSH    420
LCFLRKAIKE SGCNVKGYFA WSLGDNYEFG KGFTVRYGLS YVDFTNITAD RVLKTSGKWY    480
KQFLNGTTKI PDENQNFLRS RLFFENRDQK KVADT* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
COG2723BglB8.0e-9543482480+
PLN02849PLN028493.0e-10335494487+
TIGR03356BGL2.0e-10444480461+
PLN02814PLN028142.0e-10535502496+
pfam00232Glyco_hydro_19.0e-15039489478+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAD40134.1015131536AF149413_15 Arabidopsis thaliana thioglucosidase (GB:X79195); Pfam PF00232, Score=702.5, E=1.9e-207, N=1
GenBankAAK32833.1015131536AF361821_1 AT5g25980/T1N24_18 [Arabidopsis thaliana]
GenBankAAV71147.101351513538myrosinase [Armoracia rusticana]
DDBJBAE98479.10151312547myrosinase TGG2 [Arabidopsis thaliana]
RefSeqNP_568479.10151312547TGG2 (GLUCOSIDE GLUCOHYDROLASE 2); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0204852500A Chain A, Myrosinase From Sinapis Alba
PDB2wxd_M0204852500A Chain A, Myrosinase From Sinapis Alba
PDB1w9d_M0204852500A Chain A, Myrosinase From Sinapis Alba
PDB1w9b_M0204852500A Chain A, Myrosinase From Sinapis Alba
PDB1e73_M0204852500A Chain A, Myrosinase From Sinapis Alba
Signal Peptide
Cleavage Site
19
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EG453010310203080
EG453007310203080
EG520295322513510
EG481640292603310
EG525046319633600
Orthologous Group
SpeciesID
Arabidopsis lyrata489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT5G25980.2AT1G51490.1
AT5G26000.2AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390m
Thhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny