CAZyme Information

Basic Information
SpeciesPopulus trichocarpa
Cazyme IDPotri.001G027400.1
FamilyGH2
Protein PropertiesLength: 1111 Molecular Weight: 125076 Isoelectric Point: 5.5139
ChromosomeChromosome/Scaffold: 01 Start: 2040981 End: 2050196
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
Plaza
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2869760
  FVQSLSGLWKFFLAPDPTSVPNKFYGTAFEDSEWETLPVPSNWEMHGYDRPIYTNVIYPFPVDPPHVPDDNPTGCYRTYFDIPEEWQGRRILLHFEAVDS
  AFCAWINGVPVGYSQDSRLPAEFEITDYCHPCGSGKKNVLAVQVFRWSDGSYLEDQDHWWLSGVHRDVLLLSKPQVFIADYFFKSNLAENFTCADIQVEV
  KIESSLAIPKEKILANFTIEAALYDTGSWYDSEESANLLSSNVANLKLTHSPMGLLGFLGNVLEGKLEMPKLWSAEQPNLYILVLSLKDATGQVVDCESC
  LVGIRQVSKAPKQLLVNGHPVILRGVNRHEHHPRVGKTNIESCMIKDLVLMKQNNMNAVRNSHYPQHHRWYELCDLFGMYMIDEANIETHGFYLCEHLKH
  PTQEQSWAAAMMDRVISMVERDKNHACIISWSLGNEASYGPNHSAAAGWIREKDTSRLVHYEGGGSRTTSTDIVCPMYMRVWDIVKIAKDPAESRPLILC
  EYSHAMGNSNGNIHEYWEAINSTFGLQGGFIWDWVDQGLLKDSGDGTKHWAYGGDFGDTPNDLNFCLNGLTWPDRTPHPALHEVKYVYQPIKVSLEESRI
  KITSTHFFQTTQGLEFSWATQGDGYEIGSGILSLPLIEPQSSYELEWESGPWYPLLASSFAEEIFLTITTTLLHSTRWVEAGHVVSSSQVQLPTTRKILP
  HVIKTTDAKVLIETLGDIVRVSLPSFWEITWNIQTGSVESWKVGGVPVMNKGIFPCFWRAPTDNDKGGEKKSYYSRWKEARIDSIVYHTKSCSVKSTAND
  IVKIEVVYVGAPSCEEGSSSHSNAVFTVNMIYTIYSSGDLIIECNVIPSSELPPLPRVGVELHLEKSVDQIKWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1111     Download
MTSLVAQVVS PVETGHKVWQ DQSFIKWRKR DPHVTLHFHE SVEGSLRYWY QRNKVDHLVS    60
NSAVWNDDAV QGALDCAAFW VKDLPFVQSL SGLWKFFLAP DPTSVPNKFY GTAFEDSEWE    120
TLPVPSNWEM HGYDRPIYTN VIYPFPVDPP HVPDDNPTGC YRTYFDIPEE WQGRRILLHF    180
EAVDSAFCAW INGVPVGYSQ DSRLPAEFEI TDYCHPCGSG KKNVLAVQVF RWSDGSYLED    240
QDHWWLSGVH RDVLLLSKPQ VFIADYFFKS NLAENFTCAD IQVEVKIESS LAIPKEKILA    300
NFTIEAALYD TGSWYDSEES ANLLSSNVAN LKLTHSPMGL LGFLGNVLEG KLEMPKLWSA    360
EQPNLYILVL SLKDATGQVV DCESCLVGIR QVSKAPKQLL VNGHPVILRG VNRHEHHPRV    420
GKTNIESCMI KDLVLMKQNN MNAVRNSHYP QHHRWYELCD LFGMYMIDEA NIETHGFYLC    480
EHLKHPTQEQ SWAAAMMDRV ISMVERDKNH ACIISWSLGN EASYGPNHSA AAGWIREKDT    540
SRLVHYEGGG SRTTSTDIVC PMYMRVWDIV KIAKDPAESR PLILCEYSHA MGNSNGNIHE    600
YWEAINSTFG LQGGFIWDWV DQGLLKDSGD GTKHWAYGGD FGDTPNDLNF CLNGLTWPDR    660
TPHPALHEVK YVYQPIKVSL EESRIKITST HFFQTTQGLE FSWATQGDGY EIGSGILSLP    720
LIEPQSSYEL EWESGPWYPL LASSFAEEIF LTITTTLLHS TRWVEAGHVV SSSQVQLPTT    780
RKILPHVIKT TDAKVLIETL GDIVRVSLPS FWEITWNIQT GSVESWKVGG VPVMNKGIFP    840
CFWRAPTDND KGGEKKSYYS RWKEARIDSI VYHTKSCSVK STANDIVKIE VVYVGAPSCE    900
EGSSSHSNAV FTVNMIYTIY SSGDLIIECN VIPSSELPPL PRVGVELHLE KSVDQIKWYG    960
RGPFECYPDR KAAAHVGVYE QNASDMHVPY IVPGECSGRA DVRWVTFQNK DGVGIFASTY    1020
GSSPPMQMSA SYYSTAELDR ATHNEELAQG NDIEVHLDHK HMGVGGDDSW SPCVHDNYLV    1080
PAVPYSYSIR LCPITAATSG LEIYKSQLPN * 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N9.0e-968121091281+
pfam02836Glyco_hydro_2_C3.0e-105396677292+
COG3250LacZ4.0e-14887974901+
PRK09525lacZ08510931035+
PRK10340ebgA08810971018+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqXP_002266400.101111011114PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.101111011115PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002299206.101111011110predicted protein [Populus trichocarpa]
RefSeqXP_002303929.101111011113predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101111011110beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_408710925110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_308710925110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_208710925110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_108710925110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1f4h_D08710924910191 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
Metabolic Pathways
Pathway NameReactionECProtein Name
lactose degradation IIIBETAGALACTOSID-RXNEC-3.2.1.23β-galactosidase
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743851295100
ES7944863004517500
EL4443012974387340
HO804274505015490.0000006
ES794486447598020.004
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny