CAZyme Information

Basic Information
SpeciesMimulus guttatus
Cazyme IDmgv1a000801m
FamilyGH2
Protein PropertiesLength: 983 Molecular Weight: 110228 Isoelectric Point: 5.7509
ChromosomeChromosome/Scaffold: 173 Start: 350852 End: 357636
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2228460
  KVPEDNPTGCYRTYFHLPKEWEGRRIFLHFEAVDSAFFAWVNGHPTGYSQDSRLPAEFEITEFCHPFGSDKSNCLAVQVMRWSDGSYLEDQDHWWLSGIH
  RDVLLLSKPKVFIADYFFTSNLSEDFSSADIQVEVKIDHSALNIDNNSVITGSWFKAAEDKFIANFTIQAQIFDTDGKTSLALLELTNSVDYILGFIGYQ
  LKGKLLMPKLWSAEQPNLYTLVLTLKDSSGNIVDVESCQVGIRQITKATKQLLVNGQPVMIRGVNRHEHHPRIGKTNLESCMVQDLVLMKQNNINAVRNS
  HYPQHQRWYELCDLFGMYMIDEANIETHGFHLSSNVRHPTSETMWAPSMLDRVIGMVERDKNHASIISWSLGNESSYGPNHWALAGWVRGKDSTRFLHYE
  GGGARTSSTDIVCPMYMRVWDIVKIAEDPSELRPLILCEYSHSMGNSNGNIHEYWEAIDSTFGLQGGFIWDWVDQGLLKESADGTKHWAYGGDFGDFPND
  LNFCLNGLIWPDRTPHPALHEVKYVYQPIKVSLKEGIIKITNTHFFDTTEALSFDWIIHGDGIDLGSGLLSLPAIVPQKSYDVKWDAGPWYDLWCTSDAA
  EIFLTITAKLLGSTRWAEKGHIVSSTQVSLPIKNEAVPHVIKGGDAALLTEILDDSIHVKNTNMWEIKFSKKTGGIESWKVDGVLVMNKGILPCFWRAPT
  DNDKGGEAESYLSKWKAANLNNLNFTTSSCTVQNVSDNLVKISVAYLGTPGGAETKSPLFNVDLTYSIYNSGDVIVECHVKPNSELPPLPRVGIEFHLDK
  SMDQITWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 983     Download
MHGFDKPIYT NIVYPFPLNP PKVPEDNPTG CYRTYFHLPK EWEGRRIFLH FEAVDSAFFA    60
WVNGHPTGYS QDSRLPAEFE ITEFCHPFGS DKSNCLAVQV MRWSDGSYLE DQDHWWLSGI    120
HRDVLLLSKP KVFIADYFFT SNLSEDFSSA DIQVEVKIDH SALNIDNNSV ITGSWFKAAE    180
DKFIANFTIQ AQIFDTDGKT SLALLELTNS VDYILGFIGY QLKGKLLMPK LWSAEQPNLY    240
TLVLTLKDSS GNIVDVESCQ VGIRQITKAT KQLLVNGQPV MIRGVNRHEH HPRIGKTNLE    300
SCMVQDLVLM KQNNINAVRN SHYPQHQRWY ELCDLFGMYM IDEANIETHG FHLSSNVRHP    360
TSETMWAPSM LDRVIGMVER DKNHASIISW SLGNESSYGP NHWALAGWVR GKDSTRFLHY    420
EGGGARTSST DIVCPMYMRV WDIVKIAEDP SELRPLILCE YSHSMGNSNG NIHEYWEAID    480
STFGLQGGFI WDWVDQGLLK ESADGTKHWA YGGDFGDFPN DLNFCLNGLI WPDRTPHPAL    540
HEVKYVYQPI KVSLKEGIIK ITNTHFFDTT EALSFDWIIH GDGIDLGSGL LSLPAIVPQK    600
SYDVKWDAGP WYDLWCTSDA AEIFLTITAK LLGSTRWAEK GHIVSSTQVS LPIKNEAVPH    660
VIKGGDAALL TEILDDSIHV KNTNMWEIKF SKKTGGIESW KVDGVLVMNK GILPCFWRAP    720
TDNDKGGEAE SYLSKWKAAN LNNLNFTTSS CTVQNVSDNL VKISVAYLGT PGGAETKSPL    780
FNVDLTYSIY NSGDVIVECH VKPNSELPPL PRVGIEFHLD KSMDQITWYG RGPFECYPDR    840
KAAAHVGVYE QDAGSMHVPY IVPGECSGRA DVRWATFRDK GGFGIYASAY GGSPPMQMSA    900
SYHSTAELER ATHNEELVKG DNIEVHFDHK HMGVGGDDSW SPCVHDKYLV PAVPYTFTVR    960
LSPLTASTLS GHSIYKSQLD EN* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N6.0e-100686961277+
pfam02836Glyco_hydro_2_C8.0e-108271551291+
COG3250LacZ4.0e-1471844848+
PRK09525lacZ01963999+
PRK10340ebgA01973985+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqXP_002266400.1019791301112PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.1019791301113PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002299206.1019791301108predicted protein [Populus trichocarpa]
RefSeqXP_002303929.1019791301111predicted protein [Populus trichocarpa]
RefSeqXP_002513059.1019781301107beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_4019599210181 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_3019599210181 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_2019599210181 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_1019599210181 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1jz2_D019599210181 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO80427439813840
GR1410752407059440
EL4443012973126080
EY7234412743045770
HO804274503754230.00000003
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny