CAZyme Information

Basic Information
SpeciesPrunus persica
Cazyme IDppa000508m
FamilyGH2
Protein PropertiesLength: 1122 Molecular Weight: 127697 Isoelectric Point: 6.4537
ChromosomeChromosome/Scaffold: 2 Start: 2104602 End: 2115599
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2889870
  VKSLSGYWKFFLASSPRNVPVNFYDTAFQDSEWETLPVPSNWQMHGFDRPIYTNVVYPFPLDPPVVPVDNPTGCYRTYFHIPKEWKGRRILLHFEAVDSA
  FCAWLNGVPIGYSQDSRLPAEFEITDYCYPSDMDKKNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQVFIADYFFKSTLAEDFSYADIQVEVK
  IDNSRETSKDSVLANYVIEAALFDTACWYSIDGYGDLHLSYVASIKLNLSSSTSLGFHGYLLVGRLDMPRLWSAEQPSLYALAVTLKDASGNLLDCESSL
  VGIRQVSKAPKQLLVNGHPIIIRGVNRHEHHPRLGKTNIESCMVKDLVLMKQYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIGTHGFDLSDHVKHP
  TLEPSWATAMMDRVIGMVERDKNHACIISWSLGNEAGYGPNHSALAGTFRKCYYFVLVRELLDPSRLVHYEGGGSRTSSTDIVCPMYMRVWDMMKISRDP
  NETRPLILCEYSHAMGNSNGNLHEYWERIDSTFGLQGGFIWDWVDQALLKDNADGSKHWAYGGDFGDVPNDLNFCLNGLIWPDRTPHPALHEVKYVYQPI
  KVSFSKETLRITNTHFYKTTQGLEFSWDVHGDGCKLGSGILPFPLIEPQKSYDIKWRLALWYPLWTSSSAEEYFLTITAKLLRSTRWVEAGHVISSTQVQ
  LPSKREIVPHVIKTEDATFVSETLGDKIRVSRHSFWEIILSVQTGTVDSWTVEGVPLMTKGIFPCFWRASTDNDKGGGASSYFSLWKAAHIDNLHHITQS
  CSIQNKTDHLVKIVVAFHGVPKSEDALYKRKKIKIEVDVIYTIYGSGDVVVECNVRPSSNLRLLPRVGVEFHLDKSMDQIKWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1122     Download
MASSLPGLFV FLLENGHHVW EDQSLIKWRK RDAHVPLRCH DSIEGSLKYL YERNKVNFLV    60
SNSAVWDDDA VPGALDSAAL WVKDLPFVKS LSGYWKFFLA SSPRNVPVNF YDTAFQDSEW    120
ETLPVPSNWQ MHGFDRPIYT NVVYPFPLDP PVVPVDNPTG CYRTYFHIPK EWKGRRILLH    180
FEAVDSAFCA WLNGVPIGYS QDSRLPAEFE ITDYCYPSDM DKKNVLAVQV FRWSDGSYLE    240
DQDHWWLSGI HRDVLLLSKP QVFIADYFFK STLAEDFSYA DIQVEVKIDN SRETSKDSVL    300
ANYVIEAALF DTACWYSIDG YGDLHLSYVA SIKLNLSSST SLGFHGYLLV GRLDMPRLWS    360
AEQPSLYALA VTLKDASGNL LDCESSLVGI RQVSKAPKQL LVNGHPIIIR GVNRHEHHPR    420
LGKTNIESCM VKDLVLMKQY NINAVRNSHY PQHPRWYELC DLFGMYMIDE ANIGTHGFDL    480
SDHVKHPTLE PSWATAMMDR VIGMVERDKN HACIISWSLG NEAGYGPNHS ALAGTFRKCY    540
YFVLVRELLD PSRLVHYEGG GSRTSSTDIV CPMYMRVWDM MKISRDPNET RPLILCEYSH    600
AMGNSNGNLH EYWERIDSTF GLQGGFIWDW VDQALLKDNA DGSKHWAYGG DFGDVPNDLN    660
FCLNGLIWPD RTPHPALHEV KYVYQPIKVS FSKETLRITN THFYKTTQGL EFSWDVHGDG    720
CKLGSGILPF PLIEPQKSYD IKWRLALWYP LWTSSSAEEY FLTITAKLLR STRWVEAGHV    780
ISSTQVQLPS KREIVPHVIK TEDATFVSET LGDKIRVSRH SFWEIILSVQ TGTVDSWTVE    840
GVPLMTKGIF PCFWRASTDN DKGGGASSYF SLWKAAHIDN LHHITQSCSI QNKTDHLVKI    900
VVAFHGVPKS EDALYKRKKI KIEVDVIYTI YGSGDVVVEC NVRPSSNLRL LPRVGVEFHL    960
DKSMDQIKWY GRGPFECYPD RKAAAHVAVY EQKVDDMHVP YIVPMECSGR ADVRWVTFQN    1020
KDGFGIYASV YGSSTPMQIN ASYYTTAELD RATHNEDLIK GDDIEVHLDH KHMGLGGDDS    1080
WSPCVQHEYR VHADPYSFSI RLCPITPATS GQVMYKTQLQ K* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N4.0e-928231102281+
pfam02836Glyco_hydro_2_C8.0e-106397688302+
COG3250LacZ1.0e-14888985909+
PRK09525lacZ02011041118+
PRK10340ebgA08911051021+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqXP_002266400.103112121114PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.103112121115PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002299206.103111921108predicted protein [Populus trichocarpa]
RefSeqXP_002303929.103111921111predicted protein [Populus trichocarpa]
RefSeqXP_002513059.103112021109beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_408811035110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_308811035110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_208811035110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_108811035110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1jz2_D08811035110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743851305110
ES7944863644528150
EL4443013064397440
FC8646342416498890
HO804274605025600.005
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny