CAZyme Information

Basic Information
SpeciesArabidopsis thaliana
Cazyme IDAT5G25980.2
FamilyGH1
Protein PropertiesLength: 548 Molecular Weight: 62732 Isoelectric Point: 7.4854
ChromosomeChromosome/Scaffold: 5 Start: 9072727 End: 9075690
DescriptionOs4bglu12 - beta-glucosidase, exo-beta-glucanase, expressed
View CDS
External Links
TAIR
Geo Profiles
ATTED-II
NCBI Taxonomy
Plaza
SIGnAL
CAZyDB
Entrez Gene
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1525240
  KQDFESDFIFGVASSAYQIEGGRGRGLNVWDGFTHRYPEKGGADLGNGDTTCDSYRTWQKDLDVMEELGVKGYRFSFAWSRILPKGKRSRGINEDGINYY
  SGLIDGLIARNITPFVTLFHWDLPQSLQDEYEGFLDRTIIDDFKDYADLCFERFGDRVKHWITINQLFTVPTRGYALGTDAPGRCSQWVDKRCYGGDSST
  EPYIVAHNQLLAHATVVDLYRTRYKYQGGKIGPVMITRWFLPYDDTLESKQATWRAKEFFLGWFMEPLTKGKYPYIMRKLVGNRLPKFNSTEARLLKGSY
  DFLGLNYYVTQYAHALDPSPPEKLTAMTDSLANLTSLDANGQPPGPPFSKGSYYHPRGMLNVMEHFKTKYGDPLIYVTENGFSTSGGPIPFTEAFHDYNR
  IDYLCSHLCFLRKAIKEKRVNVKGYFVWSLGDNYEFCNGYTVRFGLSYVDFNNVTADRDLKASGLWYQSFLRD
Full Sequence
Protein Sequence     Length: 548     Download
MQHNTYIYIL TMKLLGFALA ILLVVATCKP EEEITCEENV PFTCSQTDRF NKQDFESDFI    60
FGVASSAYQI EGGRGRGLNV WDGFTHRYPE KGGADLGNGD TTCDSYRTWQ KDLDVMEELG    120
VKGYRFSFAW SRILPKGKRS RGINEDGINY YSGLIDGLIA RNITPFVTLF HWDLPQSLQD    180
EYEGFLDRTI IDDFKDYADL CFERFGDRVK HWITINQLFT VPTRGYALGT DAPGRCSQWV    240
DKRCYGGDSS TEPYIVAHNQ LLAHATVVDL YRTRYKYQGG KIGPVMITRW FLPYDDTLES    300
KQATWRAKEF FLGWFMEPLT KGKYPYIMRK LVGNRLPKFN STEARLLKGS YDFLGLNYYV    360
TQYAHALDPS PPEKLTAMTD SLANLTSLDA NGQPPGPPFS KGSYYHPRGM LNVMEHFKTK    420
YGDPLIYVTE NGFSTSGGPI PFTEAFHDYN RIDYLCSHLC FLRKAIKEKR VNVKGYFVWS    480
LGDNYEFCNG YTVRFGLSYV DFNNVTADRD LKASGLWYQS FLRDTTKNQD ILRSSLPFKN    540
GDRKSLT* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
PLN02849PLN028497.0e-11744523488+
COG2723BglB4.0e-11755519476+
PLN02814PLN028142.0e-11745533496+
TIGR03356BGL4.0e-12758518465+
pfam00232Glyco_hydro_1055523475+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005773vacuole
GO:0005777peroxisome
GO:0005975carbohydrate metabolic process
GO:0009507chloroplast
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAD40134.10125471536AF149413_15 Arabidopsis thaliana thioglucosidase (GB:X79195); Pfam PF00232, Score=702.5, E=1.9e-207, N=1
GenBankAAK32833.10125471536AF361821_1 AT5g25980/T1N24_18 [Arabidopsis thaliana]
GenBankAAN86072.1013547112646carboxypeptidase Y/myrosinase fusion protein [synthetic construct]
DDBJBAE98479.1015471547myrosinase TGG2 [Arabidopsis thaliana]
RefSeqNP_568479.1015471547TGG2 (GLUCOSIDE GLUCOHYDROLASE 2); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0315221499A Chain A, Myrosinase From Sinapis Alba
PDB2wxd_M0315221499A Chain A, Myrosinase From Sinapis Alba
PDB1w9d_M0315221499A Chain A, Myrosinase From Sinapis Alba
PDB1w9b_M0315221499A Chain A, Myrosinase From Sinapis Alba
PDB1e73_M0315221499A Chain A, Myrosinase From Sinapis Alba
Transmembrane Domains
StartEnd
527
Signal Peptide
Cleavage Site
20
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EG520295320633820
EG453010305323360
EG453007305323360
EG481640292723630
EG4888962872625480
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT1G51490.1AT5G26000.2
AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390m
Thhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny