CAZyme Information

Basic Information
SpeciesCapsella rubella
Cazyme IDCarubv10000656m
FamilyGH1
Protein PropertiesLength: 542 Molecular Weight: 61809.4 Isoelectric Point: 6.083
ChromosomeChromosome/Scaffold: 6 Start: 9054521 End: 9057366
Descriptionthioglucoside glucohydrolase 1
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1425120
  KSFGDKFIFGVASSAYQIEGGRGRGVNVWDGFTHRYPEKGGADLKNGDTTCDSYTYWQKDIDVMGELNATGYRFSLAWSRIIPKGKRSRGVNQDGIDYYN
  GLIDGLIARNITPFVTLYHWDLPQTLQDEYEGFLDRTIIDDFKDFADLCFEKFGDRVKHWITINQLYTVPTRGYAIATDAPGRCSPKIDKRCYGGNSSTE
  PYIVAHNQLLAHAVAVDLYRTKYKNQGGMIGPVMITRWFLPFDDTQESKDATERSKEFFHGWFMEPLTQGKYPQIMIDLVGDRLPTFNETEAKLVKGSYD
  FLGLNYYVTQYAQNNDTIVPSDVHTAMMDSKATLTAKNSKGEAPGPMFNANTYYYPRGIYYVMEYFKNKYGDPLIYITENGISSPGDQTFDESIADYKRI
  DYLCSHLCFLRKVIREKNVNVRGYFAWSLGDNYEFCNGFTVRFGLSYVDFNNVTADRDLKASGKWFQQFIN
Full Sequence
Protein Sequence     Length: 542     Download
MKFQWLALVF LLAMATCKGK EEYICEENEP FHCNQTSRFN GKSFGDKFIF GVASSAYQIE    60
GGRGRGVNVW DGFTHRYPEK GGADLKNGDT TCDSYTYWQK DIDVMGELNA TGYRFSLAWS    120
RIIPKGKRSR GVNQDGIDYY NGLIDGLIAR NITPFVTLYH WDLPQTLQDE YEGFLDRTII    180
DDFKDFADLC FEKFGDRVKH WITINQLYTV PTRGYAIATD APGRCSPKID KRCYGGNSST    240
EPYIVAHNQL LAHAVAVDLY RTKYKNQGGM IGPVMITRWF LPFDDTQESK DATERSKEFF    300
HGWFMEPLTQ GKYPQIMIDL VGDRLPTFNE TEAKLVKGSY DFLGLNYYVT QYAQNNDTIV    360
PSDVHTAMMD SKATLTAKNS KGEAPGPMFN ANTYYYPRGI YYVMEYFKNK YGDPLIYITE    420
NGISSPGDQT FDESIADYKR IDYLCSHLCF LRKVIREKNV NVRGYFAWSL GDNYEFCNGF    480
TVRFGLSYVD FNNVTADRDL KASGKWFQQF INVSSNDPAD QDLLSSSLSS KNRDRKSLAD    540
A* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
PLN02849PLN028497.0e-11636521495+
PLN02814PLN028141.0e-11739522493+
COG2723BglB5.0e-11942511480+
TIGR03356BGL8.0e-12846507473+
pfam00232Glyco_hydro_1042512479+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAL06896.1015411541AT5g26000/T1N24_7 [Arabidopsis thaliana]
GenBankAAL25596.1015411541AT5g26000/T1N24_7 [Arabidopsis thaliana]
EMBLCAH40827.10214973479thioglucoside glucohydrolase [Arabidopsis lyrata subsp. lyrata]
RefSeqNP_568479.10153912547TGG2 (GLUCOSIDE GLUCOHYDROLASE 2); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
RefSeqNP_851077.1015411541TGG1 (THIOGLUCOSIDE GLUCOHYDROLASE 1); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0215122500A Chain A, Myrosinase From Sinapis Alba
PDB2wxd_M0215122500A Chain A, Myrosinase From Sinapis Alba
PDB1w9d_M0215122500A Chain A, Myrosinase From Sinapis Alba
PDB1w9b_M0215122500A Chain A, Myrosinase From Sinapis Alba
PDB1e73_M0215122500A Chain A, Myrosinase From Sinapis Alba
Signal Peptide
Cleavage Site
19
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EG525046322653850
EG520295317523680
EG453010305213250
EG453007305213250
EG481640292613520
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT5G25980.2AT1G51490.1
AT5G26000.2AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237mCarubv10007356m.45.502
Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390m
Thhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny