CAZyme Information

Basic Information
SpeciesCucumis sativus
Cazyme IDCucsa.307940.12
FamilyGH2
Protein PropertiesLength: 1113 Molecular Weight: 125611 Isoelectric Point: 6.0871
ChromosomeChromosome/Scaffold: 02978 Start: 221828 End: 230967
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2879780
  IKSLSGYWKFYLAATPTSVPHNFHATVFEDSQWANLPVPSNWQMHGFDRPIYTNVVYPFPLDPPHVPEDNPTGCYRTYFHLPEEWKGRRILLHFEAVDSA
  FFAWINGSLVGYSQDSRLPAEFEITEYCHPCGSQSKNVLAVQVLKWSDGSYLEDQDQWWLSGIHRDVILLSKPQVFIGDYFFKSHVGEDFSYADIQVEVK
  IDSSLEGRKENFLNNFKLEAVLFDSGSWDNHDGNIDLLSSNMANVKLSLLSVTTLGFHGYVLGGRLQKPKLWSAEQPHLYTLIVLLKDSSDQIVDCESCL
  VGIRSITKGPKQLLVNGRPVVIRGVNRHEHHPRLGKTNIEACMDLVLMKQHNINAVRNSHYPQHSRWYELCDLFGMYMVDEANIETHGFDFSGHVKHPTL
  QPSWAAAMLDRVIGMVERDKNHACIIVWSLGNESGYGPNHSALAGWIRGKDSSRVLHYEGGGSRTSSTDIICPMYMRVWDIVNIANDPNETRPLILCEYS
  HSMGNSTGNLHKYWEAIDNTFGLQGGFIWDWVDQALLKEVGNGRKRWAYGGEFGDIPNDSTFCLNGVTWPDRTPHPALHEVKYLHQAIKISSKDGTLEVL
  NGHFFSTTEDLEFSWSIYGDGLELGNGILSLPVIGPRGSYNIEWQSSPWYDLWASSSALEFFLTISVKLLHSTRWAEAGHIVSLSQVQLPMKREFFPHSI
  KNGSSTLVNEILGDSVRVYQQNLWEIKLDVQTGTLESWKVKGVPLIIKGIIPSFWRAPTENDKGGGSCSYLSVWKAAHIDNLSFTAERCSILSTTEHYVK
  IAVIFLGVRSDDRQASNSDLEKSNVLIQADMTYTIFGSGDVLVNCNVQPSPNLPPLPRVGVKFHLDKSMDRVKWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1113     Download
MAALASKLLM PSDNGYRVWE DQTFIKWRKR DSHVPLRCQD SVEGCLKYWQ DRTKVDLLVS    60
NSAVWNDDAV QSALDSAAFW VKDLPFIKSL SGYWKFYLAA TPTSVPHNFH ATVFEDSQWA    120
NLPVPSNWQM HGFDRPIYTN VVYPFPLDPP HVPEDNPTGC YRTYFHLPEE WKGRRILLHF    180
EAVDSAFFAW INGSLVGYSQ DSRLPAEFEI TEYCHPCGSQ SKNVLAVQVL KWSDGSYLED    240
QDQWWLSGIH RDVILLSKPQ VFIGDYFFKS HVGEDFSYAD IQVEVKIDSS LEGRKENFLN    300
NFKLEAVLFD SGSWDNHDGN IDLLSSNMAN VKLSLLSVTT LGFHGYVLGG RLQKPKLWSA    360
EQPHLYTLIV LLKDSSDQIV DCESCLVGIR SITKGPKQLL VNGRPVVIRG VNRHEHHPRL    420
GKTNIEACMD LVLMKQHNIN AVRNSHYPQH SRWYELCDLF GMYMVDEANI ETHGFDFSGH    480
VKHPTLQPSW AAAMLDRVIG MVERDKNHAC IIVWSLGNES GYGPNHSALA GWIRGKDSSR    540
VLHYEGGGSR TSSTDIICPM YMRVWDIVNI ANDPNETRPL ILCEYSHSMG NSTGNLHKYW    600
EAIDNTFGLQ GGFIWDWVDQ ALLKEVGNGR KRWAYGGEFG DIPNDSTFCL NGVTWPDRTP    660
HPALHEVKYL HQAIKISSKD GTLEVLNGHF FSTTEDLEFS WSIYGDGLEL GNGILSLPVI    720
GPRGSYNIEW QSSPWYDLWA SSSALEFFLT ISVKLLHSTR WAEAGHIVSL SQVQLPMKRE    780
FFPHSIKNGS STLVNEILGD SVRVYQQNLW EIKLDVQTGT LESWKVKGVP LIIKGIIPSF    840
WRAPTENDKG GGSCSYLSVW KAAHIDNLSF TAERCSILST TEHYVKIAVI FLGVRSDDRQ    900
ASNSDLEKSN VLIQADMTYT IFGSGDVLVN CNVQPSPNLP PLPRVGVKFH LDKSMDRVKW    960
YGRGPFECYP DRKAAAHVGV YEKNVSEMHV PYIVPGESSG RTDVRWVTFE NKDGVGIYAS    1020
IYGSSPPMQM RASYYSTAEL ERAVHNDDLV EGDDIEVNLD HKHMGVGGDD SWSPCVHEEY    1080
LLPPVPYSFS IRFCPVTPST SGYDAYRSQL LL* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N1.0e-948101093286+
pfam02836Glyco_hydro_2_C2.0e-104396676293+
COG3250LacZ1.0e-15687976900+
PRK09525lacZ01710951123+
PRK10340ebgA08810961026+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqXP_002266400.101111011112PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.101111011113PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002299206.101111011108predicted protein [Populus trichocarpa]
RefSeqXP_002303929.101111111112predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101110911107beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_408710945110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_308710945110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_208710945110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_108710945110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1f4h_D08710944910191 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743901295130
ES7944864054498530
EL4443012984367330
EG3970692724016700
HO804274504995470.0000000008
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny