CAZyme Information

Basic Information
SpeciesCarica papaya
Cazyme IDevm.model.supercontig_540.3
FamilyGH2
Protein PropertiesLength: 931 Molecular Weight: 104645 Isoelectric Point: 5.7424
ChromosomeChromosome/Scaffold: 540 Start: 10622 End: 27959
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2157960
  SQDSRLPAEFEISDYCYQCDSGKKNVLAVQVYRWSDGTYLEDQDHWWLSGIHRDVLLLAKPQVFVADYFFTSKLAEDYSSADIQVEVKIDNSKEISKDRV
  LDNFIIEAEIYDTASWYDQEGYMDLLSSKAANIRLNPSPTRLLGFHGYVLSGKLETPRLWSAEQPNLYVLVIILKDASGHIVDCESCLAGIRQVSKATKQ
  LLVNGQPVIIRGVNRHEHHPRLGKTNIESCLVKDLVIMKQNNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHPTKEPSWAAAMLD
  RVVGMVERDKNHTCIICWSLGNESSYGPTHSAMAGWIRGKDPSRLVHYEGGGSRTQSTDIVCPMYMRVWDIVKIANDPNEARPLILCEYSHAMGNSSGNI
  FEYWEAIDSTFGLQGGFIWDWVDQGLLKDNADGSKHWAYGGDFGDTPNDLNFCLNGLLWPDRTPHPALHEVKHVYQPIKVSLRGGTLKITNTNFFETTQG
  LEFSWAIHGDGYQLGSGILSLPLIKPQGSFDVELDSGPWCSVWTSSTAEECFITITSKLLHSTSWVEHGHTISSTQVQLPAKRERIPHVIKDKGTTISSE
  TYEDKIKLFSQQNLWEIQFNVQTGALESWKVQGVTVMENGILPCFWRAPTDNDKGGCARSYYSRWKDAHMDKLVFLTESCSIQDKKDELVKIAVVYLGVT
  KGEDGSFTESERSSVPFKVDMVYTIYGSGDIIMECDVNPSSGLPPLPRVGVEFHIEKSLDQITWYGKGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 931     Download
MVRPGAFGWI SRAVSQDSRL PAEFEISDYC YQCDSGKKNV LAVQVYRWSD GTYLEDQDHW    60
WLSGIHRDVL LLAKPQVFVA DYFFTSKLAE DYSSADIQVE VKIDNSKEIS KDRVLDNFII    120
EAEIYDTASW YDQEGYMDLL SSKAANIRLN PSPTRLLGFH GYVLSGKLET PRLWSAEQPN    180
LYVLVIILKD ASGHIVDCES CLAGIRQVSK ATKQLLVNGQ PVIIRGVNRH EHHPRLGKTN    240
IESCLVKDLV IMKQNNINAV RNSHYPQHPR WYELCDLFGM YMIDEANIET HGFDLSGHLK    300
HPTKEPSWAA AMLDRVVGMV ERDKNHTCII CWSLGNESSY GPTHSAMAGW IRGKDPSRLV    360
HYEGGGSRTQ STDIVCPMYM RVWDIVKIAN DPNEARPLIL CEYSHAMGNS SGNIFEYWEA    420
IDSTFGLQGG FIWDWVDQGL LKDNADGSKH WAYGGDFGDT PNDLNFCLNG LLWPDRTPHP    480
ALHEVKHVYQ PIKVSLRGGT LKITNTNFFE TTQGLEFSWA IHGDGYQLGS GILSLPLIKP    540
QGSFDVELDS GPWCSVWTSS TAEECFITIT SKLLHSTSWV EHGHTISSTQ VQLPAKRERI    600
PHVIKDKGTT ISSETYEDKI KLFSQQNLWE IQFNVQTGAL ESWKVQGVTV MENGILPCFW    660
RAPTDNDKGG CARSYYSRWK DAHMDKLVFL TESCSIQDKK DELVKIAVVY LGVTKGEDGS    720
FTESERSSVP FKVDMVYTIY GSGDIIMECD VNPSSGLPPL PRVGVEFHIE KSLDQITWYG    780
KGPFECYPDR KAAAHVGIYE QTVGDMHVPY TVPGECGGRA DVRWVTLRNN DGIGVYASIY    840
RSAPPMQLSA SYYTTAELDR ATHNEELIKG DNIEVHLDHK HMGLGGDDSW TPSVHDKYLI    900
PPVPYSFSMR FSPITGSNSG YDIYKSQVQN * 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N4.0e-104629911284+
pfam02836Glyco_hydro_2_C2.0e-113213493291+
COG3250LacZ2.0e-1249794795+
PRK09525lacZ015913925+
PRK10340ebgA015918908+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqXP_002266400.1029301831114PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.1029301831115PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002299206.10159301991110predicted protein [Populus trichocarpa]
RefSeqXP_002303929.10109301941113predicted protein [Populus trichocarpa]
RefSeqXP_002513059.10159301991110beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1f4h_D0169091611016A Chain A, Barley Limit Dextrinase In Complex With Beta-Cyclodextrin
PDB1f4h_C0169091611016A Chain A, Barley Limit Dextrinase In Complex With Beta-Cyclodextrin
PDB1f4h_B0169091611016A Chain A, Barley Limit Dextrinase In Complex With Beta-Cyclodextrin
PDB1f4h_A0169091611016A Chain A, Barley Limit Dextrinase In Complex With Beta-Cyclodextrin
PDB1f4a_D0169091611016A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
Metabolic Pathways
Pathway NameReactionECProtein Name
lactose degradation IIIBETAGALACTOSID-RXNEC-3.2.1.23β-galactosidase
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
EX2760783305688940
EX2727303625368950
ES7944863712676370
EL4443013002515500
EY7234412872465320
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny