CAZyme Information

Basic Information
SpeciesThellungiella halophila
Cazyme IDThhalv10001184m
FamilyGH1
Protein PropertiesLength: 474 Molecular Weight: 54387.6 Isoelectric Point: 6.2055
ChromosomeChromosome/Scaffold: 20 Start: 1345834 End: 1348467
Descriptionthioglucoside glucohydrolase 1
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1344730
  CSNTDRFNKKAGKDLANGDTTCESYTRWQKDIDVMDELNANGYRFSFAWSRIIPKGKVSRGVNKGGLEYYHKLIDGLIAKKITPFVTLFHWDLPQTLQDE
  YEGFLDRTIIDDFRDYADLCFKEFGGKVKNWITINQLYTVPTRGYAIGTDAPGRCSPAVDERCYEGNSSTEPYIVAHNQLLAHAAVVDLYRTKYKFQGGK
  IGPVMITRWFLPFDETDKASIDATERMKEFFFGWFMEPLTKGRYPDIMRQIVGSRLPNFTEAEAKLVKGSYDFLGLNYYVTQYAQPSNNIVPPEKHTAMM
  DSGTTLTYKNARGELIGPLFSKEEDETKNSYYYPKGIYYVMDHFKTRYGDPLIYVTENGISTPGEETREEAVADSKRIDYLCSHLCFLRKVIKEKRVNVK
  GYFAWSLGDNYEFCKGFTVRFGLSYVNWGDLDDRNLKDSG
Full Sequence
Protein Sequence     Length: 474     Download
MKFYGLAIVF LLAVATTCKA DDEITCEENE PFTCSNTDRF NKKAGKDLAN GDTTCESYTR    60
WQKDIDVMDE LNANGYRFSF AWSRIIPKGK VSRGVNKGGL EYYHKLIDGL IAKKITPFVT    120
LFHWDLPQTL QDEYEGFLDR TIIDDFRDYA DLCFKEFGGK VKNWITINQL YTVPTRGYAI    180
GTDAPGRCSP AVDERCYEGN SSTEPYIVAH NQLLAHAAVV DLYRTKYKFQ GGKIGPVMIT    240
RWFLPFDETD KASIDATERM KEFFFGWFME PLTKGRYPDI MRQIVGSRLP NFTEAEAKLV    300
KGSYDFLGLN YYVTQYAQPS NNIVPPEKHT AMMDSGTTLT YKNARGELIG PLFSKEEDET    360
KNSYYYPKGI YYVMDHFKTR YGDPLIYVTE NGISTPGEET REEAVADSKR IDYLCSHLCF    420
LRKVIKEKRV NVKGYFAWSL GDNYEFCKGF TVRFGLSYVN WGDLDDRNLK DSGK          480
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
PLN02998PLN029985.0e-9738474439+
PLN02814PLN028142.0e-10150463423+
PLN02849PLN028492.0e-10246463419+
TIGR03356BGL4.0e-10938474447+
pfam00232Glyco_hydro_15.0e-17235474446+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAV80207.101747417513myrosinase [Brassica rapa subsp. pekinensis]
GenBankAAX68547.1034744513myrosinase [Brassica rapa var. parachinensis]
DDBJBAE16356.1014741511myrosinase [Eutrema wasabi]
EMBLCAA79990.101747417509myrosinase, thioglucoside glucohydrolase [Brassica napus]
Swiss-ProtQ0032601747417513MYRO_BRANA RecName: Full=Myrosinase; AltName: Full=Sinigrinase; AltName: Full=Thioglucosidase; Flags: Precursor
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB2wxd_M0214741493A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
PDB1w9d_M0214741493A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
PDB1w9b_M0214741493A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
PDB1e73_M0214741493A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
PDB1e72_M0214741493A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
Signal Peptide
Cleavage Site
20
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EG525046314353470
BM986073266923570
CX192297262633220
EG520295296373320
EV094420279473230
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT5G25980.2AT1G51490.1
AT5G26000.2AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10002471mThhalv10002470mThhalv10003945m
Thhalv10003954mThhalv10012086mThhalv10000681mThhalv10011390mThhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny