CAZyme Information

Basic Information
SpeciesThellungiella halophila
Cazyme IDThhalv10011390m
FamilyGH1
Protein PropertiesLength: 522 Molecular Weight: 58966.9 Isoelectric Point: 8.3036
ChromosomeChromosome/Scaffold: 7 Start: 10897600 End: 10900313
Descriptionbeta glucosidase 34
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH1475200
  TGFPKNFTFGAATSAYQISRSFVTRIINLHVHVLKGIQILERSEKVPDRSSGDLACDSYDLYKEDVKLLKRMNAQAYRLSIAWSRVLPKGRLTGGIDENG
  ITYYNNLINELKANGIEPYVTIFHWDVPQTLEDEYGGFLSPRIVEDYKNYAELLFQRFGDRVKFWITLNQPYSLASKGYGDGSYPPGRCTDCEFGGDSGT
  EPYIVAHHQLLAHAEAVSLYRRRYQKFQGGKIGTTLIGRWFTPLNETSILDKAAAKRAFDFFVGWFLDPLVYGDYPKIMQEIVGDRLPKFTRQESALVRG
  SLDFLGLNYYVTQYATDAPPSIPTQPNVLTDPRVTIGFYRNGVPIGVQAPSFVYYPPGFRQILNHIKNNYKNPLTYITENGVADLDLGNLTLANALADKG
  RIQNHCSHLSCLKCSIEDGCNVGGYFAWSLMDNYEFGNGYTLRFGMNWVNFTNPAHRREKDSGKWFSKFIDNNN
Full Sequence
Protein Sequence     Length: 522     Download
MAIPKSHYSL AFLVILFAVT SCQNVCNPAC KAKEPFNCDN ILTFNRTGFP KNFTFGAATS    60
AYQISRSFVT RIINLHVHVL KGIQILERSE KVPDRSSGDL ACDSYDLYKE DVKLLKRMNA    120
QAYRLSIAWS RVLPKGRLTG GIDENGITYY NNLINELKAN GIEPYVTIFH WDVPQTLEDE    180
YGGFLSPRIV EDYKNYAELL FQRFGDRVKF WITLNQPYSL ASKGYGDGSY PPGRCTDCEF    240
GGDSGTEPYI VAHHQLLAHA EAVSLYRRRY QKFQGGKIGT TLIGRWFTPL NETSILDKAA    300
AKRAFDFFVG WFLDPLVYGD YPKIMQEIVG DRLPKFTRQE SALVRGSLDF LGLNYYVTQY    360
ATDAPPSIPT QPNVLTDPRV TIGFYRNGVP IGVQAPSFVY YPPGFRQILN HIKNNYKNPL    420
TYITENGVAD LDLGNLTLAN ALADKGRIQN HCSHLSCLKC SIEDGCNVGG YFAWSLMDNY    480
EFGNGYTLRF GMNWVNFTNP AHRREKDSGK WFSKFIDNNN N* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
COG2723BglB7.0e-10949519488+
TIGR03356BGL3.0e-11550512472+
PLN02814PLN028142.0e-11644517488+
PLN02849PLN028494.0e-11844520492+
pfam00232Glyco_hydro_19.0e-15949519481+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankAAD46026.1015161494AC007519_11 Similar to gi
GenBankAAG52628.1015161463AC024261_15 myrosinase precursor, putative; 53323-50499 [Arabidopsis thaliana]
GenBankACO95141.1015161510beta-thioglucoside glucohydrolase [Arabidopsis thaliana]
RefSeqNP_175191.2015161509BGLU34 (BETA GLUCOSIDASE 34); hydrolase, hydrolyzing O-glycosyl compounds / thioglucosidase [Arabidopsis thaliana]
RefSeqNP_175558.3015161509BGLU35 (BETA GLUCOSIDASE 35); catalytic/ cation binding / hydrolase, hydrolyzing O-glycosyl compounds [Arabidopsis thaliana]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3ptq_B0185161503B Chain B, Reassembly And Co-Crystallization Of A Family 9 Processive Endoglucanase From Separately Expressed Gh9 And Cbm3c Modules
PDB3ptq_A0185161503B Chain B, Reassembly And Co-Crystallization Of A Family 9 Processive Endoglucanase From Separately Expressed Gh9 And Cbm3c Modules
PDB3ptm_B0185161503B Chain B, Reassembly And Co-Crystallization Of A Family 9 Processive Endoglucanase From Separately Expressed Gh9 And Cbm3c Modules
PDB3ptm_A0185161503B Chain B, Reassembly And Co-Crystallization Of A Family 9 Processive Endoglucanase From Separately Expressed Gh9 And Cbm3c Modules
PDB3ptk_B0185161503A Chain A, The Crystal Structure Of Rice (Oryza Sativa L.) Os4bglu12
Signal Peptide
Cleavage Site
22
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
ES900918284323120
ES90779427312700
ES90925226512620
ES90606526712650
EV11754126312610
Orthologous Group
SpeciesID
Arabidopsis lyrata917734489446
Arabidopsis thalianaAT5G25980.1AT5G26000.1AT5G48375.1AT5G25980.2AT1G51490.1
AT5G26000.2AT5G25980.3
Brassica rapaBra016676Bra039823Bra039705Bra039824Bra004012
Bra014287Bra038094Bra023838Bra032343Bra036914
Bra020523.35.161Bra020523.35.161Bra020549
Carica papayaevm.model.supercontig_17.152
Capsella rubellaCarubv10000656mCarubv10007356m.543.827Carubv10007356m.543.827Carubv10006917mCarubv10022237m
Carubv10007356m.45.502Carubv10007356m.45.502
Linum usitatissimumLus10012868
Thellungiella halophilaThhalv10002474mThhalv10004165mThhalv10001184mThhalv10002471mThhalv10002470m
Thhalv10003945mThhalv10003954mThhalv10012086mThhalv10000681mThhalv10012108m
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny