CAZyme Information

Basic Information
SpeciesThellungiella halophila
Cazyme IDThhalv10002474m
FamilyGH1
Protein PropertiesLength: 541 Molecular Weight: 61473.5 Isoelectric Point: 7.4398
ChromosomeChromosome/Scaffold: 4 Start: 5621222 End: 5624194
Descriptionthioglucoside glucohydrolase 1
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1415120
  SKSFGKDFIFGVASSAYQVEGGRGRGLNTWDAFTHRYPEKAGPDLGNGDTTCESYTRWQKDIDVMDELNATGYRFSFAWSRILPKGKVSRGVNPGGLKYY
  HDLIDGLLAKKITPFVTLYHWDLPQTLQDEYEGFLDRRIIDDFRDYADLCFKEFGGKVKNWITINQLYTVPTRGYAIGTDAPGRCSPAVDERCYGGNSST
  EPYIVAHNQLLAHAAVVDLYRTKYKFQGGKIGTVMITRWFLPYDVNDKASIEATERMKEFFFGWFMEPLTKGRYPDIMRQIVGSRLPNFTEAEARSVAGS
  YDFLGLNYYVTQYVQENKNTGPPEKHTAMMDADTNASGHIIGPLFAEDKIGGNSYYYPKGIYYTMDHFKTRYGDPLIYITENGISTPSEETREQAVADSS
  RIDYLCSHLCFLRKVIKEKRVNVKGYFAWALGDNYEFCKGFTVRFGLSYVNWTDLDDRNLKDSGKWFQRFIN
Full Sequence
Protein Sequence     Length: 541     Download
MKLLGLALVF LLAVATCKAN EEITCEENLP FTCSNTDRLN SKSFGKDFIF GVASSAYQVE    60
GGRGRGLNTW DAFTHRYPEK AGPDLGNGDT TCESYTRWQK DIDVMDELNA TGYRFSFAWS    120
RILPKGKVSR GVNPGGLKYY HDLIDGLLAK KITPFVTLYH WDLPQTLQDE YEGFLDRRII    180
DDFRDYADLC FKEFGGKVKN WITINQLYTV PTRGYAIGTD APGRCSPAVD ERCYGGNSST    240
EPYIVAHNQL LAHAAVVDLY RTKYKFQGGK IGTVMITRWF LPYDVNDKAS IEATERMKEF    300
FFGWFMEPLT KGRYPDIMRQ IVGSRLPNFT EAEARSVAGS YDFLGLNYYV TQYVQENKNT    360
GPPEKHTAMM DADTNASGHI IGPLFAEDKI GGNSYYYPKG IYYTMDHFKT RYGDPLIYIT    420
ENGISTPSEE TREQAVADSS RIDYLCSHLC FLRKVIKEKR VNVKGYFAWA LGDNYEFCKG    480
FTVRFGLSYV NWTDLDDRNL KDSGKWFQRF INVTTIKPPS AKQEFLRSSL SFQNKKLADA    540

Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
COG2723BglB2.0e-10742511484+
PLN02849PLN028496.0e-10833531508+
PLN02814PLN028141.0e-11034514493+
TIGR03356BGL1.0e-12246507474+
pfam00232Glyco_hydro_1040512480+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAV80207.1015251531myrosinase [Brassica rapa subsp. pekinensis]
GenBankAAX68547.1015251531myrosinase [Brassica rapa var. parachinensis]
GenBankABG77972.1015251531myrosinase [Brassica oleracea var. alboglabra]
DDBJBAE16356.1015401545myrosinase [Eutrema wasabi]
Swiss-ProtQ00326015251531MYRO_BRANA RecName: Full=Myrosinase; AltName: Full=Sinigrinase; AltName: Full=Thioglucosidase; Flags: Precursor
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1myr_A0205121500A Chain A, Myrosinase From Sinapis Alba
PDB2wxd_M0205121500A Chain A, Myrosinase From Sinapis Alba
PDB1w9d_M0205121500A Chain A, Myrosinase From Sinapis Alba
PDB1w9b_M0205121500A Chain A, Myrosinase From Sinapis Alba
PDB1e73_M0205121500A Chain A, Myrosinase From Sinapis Alba
Signal Peptide
Cleavage Site
19
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
BM985570281373170
BM9860732661293890
FY450174280162930
EG453010309213290
EG453007309213290
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT5G25980.2AT1G51490.1
AT5G26000.2AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470mThhalv10003945m
Thhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390mThhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny