CAZyme Information

Basic Information
SpeciesCucumis sativus
Cazyme IDCucsa.307940.11
FamilyGH2
Protein PropertiesLength: 1115 Molecular Weight: 125866 Isoelectric Point: 6.1308
ChromosomeChromosome/Scaffold: 02978 Start: 221828 End: 230918
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2879800
  IKSLSGYWKFYLAATPTSVPHNFHATVFEDSQWANLPVPSNWQMHGFDRPIYTNVVYPFPLDPPHVPEDNPTGCYRTYFHLPEEWKGRRILLHFEAVDSA
  FFAWINGSLVGYSQDSRLPAEFEITEYCHPCGSQSKNVLAVQVLKWSDGSYLEDQDQWWLSGIHRDVILLSKPQVFIGDYFFKSHVGEDFSYADIQVEVK
  IDSSLEGRKENFLNNFKLEAVLFDSGSWDNHDGNIDLLSSNMANVKLSLLSVTTLGFHGYVLGGRLQKPKLWSAEQPHLYTLIVLLKDSSDQIVDCESCL
  VGIRSITKGPKQLLVNGRPVVIRGVNRHEHHPRLGKTNIEACMVRDLVLMKQHNINAVRNSHYPQHSRWYELCDLFGMYMVDEANIETHGFDFSGHVKHP
  TLQPSWAAAMLDRVIGMVERDKNHACIIVWSLGNESGYGPNHSALAGWIRGKDSSRVLHYEGGGSRTSSTDIICPMYMRVWDIVNIANDPNETRPLILCE
  YSHSMGNSTGNLHKYWEAIDNTFGLQGGFIWDWVDQALLKEVGNGRKRWAYGGEFGDIPNDSTFCLNGVTWPDRTPHPALHEVKYLHQAIKISSKDGTLE
  VLNGHFFSTTEDLEFSWSIYGDGLELGNGILSLPVIGPRGSYNIEWQSSPWYDLWASSSALEFFLTISVKLLHSTRWAEAGHIVSLSQVQLPMKREFFPH
  SIKNGSSTLVNEILGDSVRVYQQNLWEIKLDVQTGTLESWKVKGVPLIIKGIIPSFWRAPTENDKGGGSCSYLSVWKAAHIDNLSFTAERCSILSTTEHY
  VKIAVIFLGVRSDDRQASNSDLEKSNVLIQADMTYTIFGSGDVLVNCNVQPSPNLPPLPRVGVKFHLDKSMDRVKWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1115     Download
MAALASKLLM PSDNGYRVWE DQTFIKWRKR DSHVPLRCQD SVEGCLKYWQ DRTKVDLLVS    60
NSAVWNDDAV QSALDSAAFW VKDLPFIKSL SGYWKFYLAA TPTSVPHNFH ATVFEDSQWA    120
NLPVPSNWQM HGFDRPIYTN VVYPFPLDPP HVPEDNPTGC YRTYFHLPEE WKGRRILLHF    180
EAVDSAFFAW INGSLVGYSQ DSRLPAEFEI TEYCHPCGSQ SKNVLAVQVL KWSDGSYLED    240
QDQWWLSGIH RDVILLSKPQ VFIGDYFFKS HVGEDFSYAD IQVEVKIDSS LEGRKENFLN    300
NFKLEAVLFD SGSWDNHDGN IDLLSSNMAN VKLSLLSVTT LGFHGYVLGG RLQKPKLWSA    360
EQPHLYTLIV LLKDSSDQIV DCESCLVGIR SITKGPKQLL VNGRPVVIRG VNRHEHHPRL    420
GKTNIEACMV RDLVLMKQHN INAVRNSHYP QHSRWYELCD LFGMYMVDEA NIETHGFDFS    480
GHVKHPTLQP SWAAAMLDRV IGMVERDKNH ACIIVWSLGN ESGYGPNHSA LAGWIRGKDS    540
SRVLHYEGGG SRTSSTDIIC PMYMRVWDIV NIANDPNETR PLILCEYSHS MGNSTGNLHK    600
YWEAIDNTFG LQGGFIWDWV DQALLKEVGN GRKRWAYGGE FGDIPNDSTF CLNGVTWPDR    660
TPHPALHEVK YLHQAIKISS KDGTLEVLNG HFFSTTEDLE FSWSIYGDGL ELGNGILSLP    720
VIGPRGSYNI EWQSSPWYDL WASSSALEFF LTISVKLLHS TRWAEAGHIV SLSQVQLPMK    780
REFFPHSIKN GSSTLVNEIL GDSVRVYQQN LWEIKLDVQT GTLESWKVKG VPLIIKGIIP    840
SFWRAPTEND KGGGSCSYLS VWKAAHIDNL SFTAERCSIL STTEHYVKIA VIFLGVRSDD    900
RQASNSDLEK SNVLIQADMT YTIFGSGDVL VNCNVQPSPN LPPLPRVGVK FHLDKSMDRV    960
KWYGRGPFEC YPDRKAAAHV GVYEKNVSEM HVPYIVPGES SGRTDVRWVT FENKDGVGIY    1020
ASIYGSSPPM QMRASYYSTA ELERAVHNDD LVEGDDIEVN LDHKHMGVGG DDSWSPCVHE    1080
EYLLPPVPYS FSIRFCPVTP STSGYDAYRS QLLL* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N1.0e-948121095286+
pfam02836Glyco_hydro_2_C3.0e-107396678293+
COG3250LacZ1.0e-15987978900+
PRK09525lacZ01710971123+
PRK10340ebgA08810981026+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqXP_002266400.101111211112PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.101111211113PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002299206.101111211108predicted protein [Populus trichocarpa]
RefSeqXP_002303929.101111311112predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101111111107beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_408710965110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_308710965110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_208710965110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_108710965110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1jz7_D0871096511021A Chain A, E. Coli (Lacz) Beta-Galactosidase In Complex With Galactose
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743901295150
ES7944864054518550
EL4443012984387350
EG3970692724016720
HO804274505015490.0000000008
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny