CAZyme Information

Basic Information
SpeciesLinum usitatissimum
Cazyme IDLus10024155
FamilyGH2
Protein PropertiesLength: 1079 Molecular Weight: 121289 Isoelectric Point: 6.0525
ChromosomeChromosome/Scaffold: 353 Start: 800551 End: 807681
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2889440
  PYVRSLSGFWKFFLAPCPSKVPPAFFDVAFQDSEWGMLPVPSNWQMHGFDKPIYTNVVYPFPLDPPHVPQDDNPTGCYRTCFQIPTEWQGRRIFLHFEAV
  DSAFFVWVNGVLVGYSQDSRLPAEFEISDHCYPSGSDEMNILAVQVLRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFITDYYFKSSLTEDFSSSDIQV
  EVKIDNTRANPKDDIRASFTMDAALYDTGSWYAGDKSGLISTNVANLTAIPSSDAILGFIGYTLAGKLDMPKLWSAEQPNLYILVVTLKDVSGHVVDCES
  SLVGIRQVSKAPKQLLVNGKPVMIRGVNRHEHHPRIGKTNIESCMIKDLVLMKQYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFHLSGHLK
  PPAMEICWATAILDRVIGMVERDKNHASIISWSLGNESSYGPNHSAAAGWIREKDTSRLLHYEGGGSRTSSTDIICPMYMRVWDILKIAKDPTELRPLIL
  CEYSHAMGNSNGSLDEYWEAIDSTFGLQGGFIWDWVDQGLLKESTDGSKHWAYGGDFGDTPNDLNFCLNGLIWPDRTPHPAMHELKYVYQPIKVSIKDGK
  LKIINTNFFDTTQLLGFTWAAHGDGIELGSGELSLPVIEPQKSYEMKWESAGHIISSTQLQLPGRKEILPHALKSKDATICTEVVGDTIRISHQSLWEIT
  FNKLTGAVASWKVKGVSVLNKGIFPCFWRAPTDNDKGGEDRSYYSKWKAAHLDSIIFETKSCSIQNSANDHVKIEVVYSGIPRDGDGSTTLFQVEMTYII
  HGSGDLIVKCKATPSSNLPPLPRVGVEFHLEKTMDQVSWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1079     Download
MSASLPSPGL VFPFEDVSHR VWEDPSFIKW RKRDPHVPLH CHESVQGSLK YWHQRNRVDV    60
LLSKSAVWGD DAVQEALDSA AFWVKELPYV RSLSGFWKFF LAPCPSKVPP AFFDVAFQDS    120
EWGMLPVPSN WQMHGFDKPI YTNVVYPFPL DPPHVPQDDN PTGCYRTCFQ IPTEWQGRRI    180
FLHFEAVDSA FFVWVNGVLV GYSQDSRLPA EFEISDHCYP SGSDEMNILA VQVLRWSDGS    240
YLEDQDHWWL SGIHRDVLLL SKPQVFITDY YFKSSLTEDF SSSDIQVEVK IDNTRANPKD    300
DIRASFTMDA ALYDTGSWYA GDKSGLISTN VANLTAIPSS DAILGFIGYT LAGKLDMPKL    360
WSAEQPNLYI LVVTLKDVSG HVVDCESSLV GIRQVSKAPK QLLVNGKPVM IRGVNRHEHH    420
PRIGKTNIES CMIKDLVLMK QYNINAVRNS HYPQHPRWYE LCDLFGMYMI DEANIETHGF    480
HLSGHLKPPA MEICWATAIL DRVIGMVERD KNHASIISWS LGNESSYGPN HSAAAGWIRE    540
KDTSRLLHYE GGGSRTSSTD IICPMYMRVW DILKIAKDPT ELRPLILCEY SHAMGNSNGS    600
LDEYWEAIDS TFGLQGGFIW DWVDQGLLKE STDGSKHWAY GGDFGDTPND LNFCLNGLIW    660
PDRTPHPAMH ELKYVYQPIK VSIKDGKLKI INTNFFDTTQ LLGFTWAAHG DGIELGSGEL    720
SLPVIEPQKS YEMKWESAGH IISSTQLQLP GRKEILPHAL KSKDATICTE VVGDTIRISH    780
QSLWEITFNK LTGAVASWKV KGVSVLNKGI FPCFWRAPTD NDKGGEDRSY YSKWKAAHLD    840
SIIFETKSCS IQNSANDHVK IEVVYSGIPR DGDGSTTLFQ VEMTYIIHGS GDLIVKCKAT    900
PSSNLPPLPR VGVEFHLEKT MDQVSWYGRG PFECYPDRKA AAHVGVYEQK VADMHVPYIV    960
PGECGGRTDV RWVTFRDKDG VGIFASSYGN SPPLQVSASY YSTSELDRAT HNERLIQGND    1020
IEVHLDHKHM GLGGDDSWSP CVHEKFLVPA MPYTFSLRMC PVTAQTSGLD TYQSQVQN*     1080
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N2.0e-1017841059277+
pfam02836Glyco_hydro_2_C3.0e-110399680292+
COG3250LacZ2.0e-15590942866+
PRK09525lacZ02010611100+
PRK10340ebgA07610651020+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqXP_002266400.1010107881114PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.1010107881115PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002299206.1010107881110predicted protein [Populus trichocarpa]
RefSeqXP_002303929.1010107881113predicted protein [Populus trichocarpa]
RefSeqXP_002513059.103107821110beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_409010605110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_309010605110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_209010605110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_109010605110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1jz7_D0901060511021A Chain A, E. Coli (Lacz) Beta-Galactosidase In Complex With Galactose
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743861325130
EL4443012984417380
EY8193132693506180
HO804274505045520.000002
EY819313256206446.6
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny