CAZyme Information

Basic Information
SpeciesThellungiella halophila
Cazyme IDThhalv10010080m
FamilyGH2
Protein PropertiesLength: 1108 Molecular Weight: 125592 Isoelectric Point: 5.5453
ChromosomeChromosome/Scaffold: 16 Start: 693944 End: 702138
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2879730
  VKSLSGFWKFFLAPSPANVPDKFYDAAFPDSDWKSLPVPSNWQCHGFDRPIYTNIVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSA
  FFAWINGKPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGLHRDVLLLAKPKVFIDDYFFKSKLADDFSYADIQVEVK
  IDNMLETSKDLVLSNFIIEAAVFDTKSWYNSGGFSYELSPKVASLKLNPSPSSSLGFHGYLLEGKLDSPNLWSAEQPNVYILVITLKDKSGKLLDSESSI
  VGVRQVSKAFKQLLVNGHPVMIKGVNRHEHHPRVGKTNIEACMIKDLIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHP
  TKEPSWAAAMLDRVVGMVERDKNHACIISWSLGNEANYGPNHSAMAGWIREKDPSRLVHYEGGGSRTDSTDIVCPMYMRVWDIVKIALDKNESRPLILCE
  YSHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGLLKLGSDGIKHWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGTMR
  VANAYFFHTTEELEFSWTIHGDGVELGSGTLSIPVIKPQNIYDMEWKSGPWFSLWNDSNTGESFLTITAKLLNPTRSLQAGHLLSSTQIPLPAKRQIIPQ
  AIKITDAIINCETVGDFIKISQQDSWELMIDVRKGAIEGWKMQGVLLTKEAILPCFWRAPTDNDKGGDDSSYFSRWKAAHMDNVQFLVQSCSVKSITDKS
  VEIEFIYLGSSASDSSKSDALFNVSVTYMIYGSGDIITNWYVVPNSDLPPLPRVGIEFHIEKTLDRVEWYGRGPFECYPDRKSAAHV
Full Sequence
Protein Sequence     Length: 1108     Download
MASLATQMIL PSENGYRVWE DQTLFKWRKR DPHVTLRCHD SVEGSLRYWY QRTNVDLTVS    60
KSAVWNDDAV QGALDSAAFW VEGLPFVKSL SGFWKFFLAP SPANVPDKFY DAAFPDSDWK    120
SLPVPSNWQC HGFDRPIYTN IVYPFPNDPP HVPEDNPTGC YRTYFQIPKE WKDRRILLHF    180
EAVDSAFFAW INGKPVGYSQ DSRLPAEFEI SDYCYPWDSG KQNVLAVQVF RWSDGSYLED    240
QDHWWLSGLH RDVLLLAKPK VFIDDYFFKS KLADDFSYAD IQVEVKIDNM LETSKDLVLS    300
NFIIEAAVFD TKSWYNSGGF SYELSPKVAS LKLNPSPSSS LGFHGYLLEG KLDSPNLWSA    360
EQPNVYILVI TLKDKSGKLL DSESSIVGVR QVSKAFKQLL VNGHPVMIKG VNRHEHHPRV    420
GKTNIEACMI KDLIMMKEYN INAVRNSHYP QHPRWYELCD LFGMYMIDEA NIETHGFDLS    480
GHLKHPTKEP SWAAAMLDRV VGMVERDKNH ACIISWSLGN EANYGPNHSA MAGWIREKDP    540
SRLVHYEGGG SRTDSTDIVC PMYMRVWDIV KIALDKNESR PLILCEYSHA MGNSNGNIDE    600
YWEAIDNTFG LQGGFIWDWV DQGLLKLGSD GIKHWAYGGD FGDQPNDLNF CLNGLIWPDR    660
TPHPALHEVK HCYQPIKVSL TDGTMRVANA YFFHTTEELE FSWTIHGDGV ELGSGTLSIP    720
VIKPQNIYDM EWKSGPWFSL WNDSNTGESF LTITAKLLNP TRSLQAGHLL SSTQIPLPAK    780
RQIIPQAIKI TDAIINCETV GDFIKISQQD SWELMIDVRK GAIEGWKMQG VLLTKEAILP    840
CFWRAPTDND KGGDDSSYFS RWKAAHMDNV QFLVQSCSVK SITDKSVEIE FIYLGSSASD    900
SSKSDALFNV SVTYMIYGSG DIITNWYVVP NSDLPPLPRV GIEFHIEKTL DRVEWYGRGP    960
FECYPDRKSA AHVAIYEDNV GDMHVPYIVP GECGGRTDVR WVTFRNKDGV GIYASTYGSS    1020
SPMQMNASYY TTSELHRATH EEDLIKGQNI EVHLDHKHMG VGGDDSWTPC VHEKYLIPPE    1080
PYSFSIRLCP ITAATSVLDI YKNQLPC* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N3.0e-968101088280+
pfam02836Glyco_hydro_2_C1.0e-110397677291+
COG3250LacZ2.0e-14187734659+
PRK09525lacZ08510901054+
PRK10340ebgA08810941011+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
EMBLCAB77564.101110711075beta Galactosidase-like protein [Arabidopsis thaliana]
RefSeqNP_001030858.101110711108glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqNP_680128.101110711107glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqXP_002303929.101110511111predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101110211105beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3t2q_D0871089801050A Chain A, Polygalacturonase From Erwinia Carotovora Ssp. Carotovora
PDB3t2q_C0871089801050A Chain A, Polygalacturonase From Erwinia Carotovora Ssp. Carotovora
PDB3t2q_B0871089801050A Chain A, Polygalacturonase From Erwinia Carotovora Ssp. Carotovora
PDB3t2q_A0871089801050A Chain A, Polygalacturonase From Erwinia Carotovora Ssp. Carotovora
PDB3t2p_D0871089801050A Chain A, Polygalacturonase From Erwinia Carotovora Ssp. Carotovora
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743851295100
DK49736726012600
DK50207125612560
DK4995622752445180
HO804274475045490.0000002
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny