CAZyme Information

Basic Information
SpeciesPhyscomitrella patens
Cazyme IDPp1s130_43V6.1
FamilyGH2
Protein PropertiesLength: 1234 Molecular Weight: 138676 Isoelectric Point: 5.6216
ChromosomeChromosome/Scaffold: 130 Start: 251487 End: 258762
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
Plaza
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH217210960
  EYVCSLAGLWKFHLACCPEEVPEQFSSVGFDDSSWGSLPVPSNWQVHGHDRPIYTNIVYPFPINPPFVPSENPTGCYRTSFRVPSDWTGRRLFLNFEAVD
  SAFYVWVNGAKIGYSQDSRLPSDWDITDCCHFGEENVLAVQVMRWSDGSYLEDQDHWWLSGIHRNVLIYSKPQVMLADYFVKTDVENDFLSATVKVEVTV
  EGPREMIANSKLCHYTTEAVLFEEFDFQGDFKMPSEAAHLQPQGLDSAMIGCHAHTILTAKLQGPKLWSAEHPNLYTLVVLLKDPSGAVIDCEACRVGVR
  KISTRPKELLVNGEPVVIRGVNRHEHHPRLGKTNIEACMIKDITLMKQHNINAVRNSHYPMHSRWYELCDLFGLYMVDEANLETHGFDPEPWAWPERQLT
  FDPKWANAFLQRMINMVERDKNHASIIFWSLGNEAGYGPNHQAMAGWTRGRDSTRLLHYEGGGSRTTSTDVVCPMYTRVWDIIKIAEDPSESRPVILCEY
  SHAMGNSNGNIQAYWDAIDGIHGLQGGFIWDWADQGLLKEGKDGVKYWAYGGDFGDVPHDLNFCLNGLIWPNRRPHPALEEVKHAYQPIGIFLKDGTIEI
  WNKHFFTPLDYVKFSWSLSADGSVLESGTLDLPAIEPTKKHYLKLNSGPWASRWKEAEANEIFLDITAYLSAPTRWADAGHVLASEQMELPVSKHAQRQV
  LSASSKPALSVEEAEWVLKVKPAGGEDWEIQFDKKKGLLSSWKVNGTCVLSNGPLPCFWRAPTDNDKGGSVLSYVSQWKANGLDTLTCTGCERFRVEKLS
  DSTLLLKAVIFMEPKSEEPPPPQVSESQTGDVDKDTEKSIKAQFAEMNEERARRDSSLGFKIKVQYIVFGDGNIVTSYDVEPPSRIPTLPRVGVQFNIDK
  ECSEVEWYGRGPFECYPDRKSAARV
Full Sequence
Protein Sequence     Length: 1234     Download
MSLPTILITG SAIDSVALAR KTSSSVLRGD FSYPCGQFLF RNRAGFGLDR CISAHVINCS    60
QAPGITNKEN DCVRAAAQNS TQPSEGGLNG PMAKDRKVPE VRRRDWEDPM TVEWNKRNAH    120
VPLHCHTTIV GALKFWQQRS HTDFRAAEEA VWEEEAVEAA LQSADSWIQG LEYVCSLAGL    180
WKFHLACCPE EVPEQFSSVG FDDSSWGSLP VPSNWQVHGH DRPIYTNIVY PFPINPPFVP    240
SENPTGCYRT SFRVPSDWTG RRLFLNFEAV DSAFYVWVNG AKIGYSQDSR LPSDWDITDC    300
CHFGEENVLA VQVMRWSDGS YLEDQDHWWL SGIHRNVLIY SKPQVMLADY FVKTDVENDF    360
LSATVKVEVT VEGPREMIAN SKLCHYTTEA VLFEEFDFQG DFKMPSEAAH LQPQGLDSAM    420
IGCHAHTILT AKLQGPKLWS AEHPNLYTLV VLLKDPSGAV IDCEACRVGV RKISTRPKEL    480
LVNGEPVVIR GVNRHEHHPR LGKTNIEACM IKDITLMKQH NINAVRNSHY PMHSRWYELC    540
DLFGLYMVDE ANLETHGFDP EPWAWPERQL TFDPKWANAF LQRMINMVER DKNHASIIFW    600
SLGNEAGYGP NHQAMAGWTR GRDSTRLLHY EGGGSRTTST DVVCPMYTRV WDIIKIAEDP    660
SESRPVILCE YSHAMGNSNG NIQAYWDAID GIHGLQGGFI WDWADQGLLK EGKDGVKYWA    720
YGGDFGDVPH DLNFCLNGLI WPNRRPHPAL EEVKHAYQPI GIFLKDGTIE IWNKHFFTPL    780
DYVKFSWSLS ADGSVLESGT LDLPAIEPTK KHYLKLNSGP WASRWKEAEA NEIFLDITAY    840
LSAPTRWADA GHVLASEQME LPVSKHAQRQ VLSASSKPAL SVEEAEWVLK VKPAGGEDWE    900
IQFDKKKGLL SSWKVNGTCV LSNGPLPCFW RAPTDNDKGG SVLSYVSQWK ANGLDTLTCT    960
GCERFRVEKL SDSTLLLKAV IFMEPKSEEP PPPQVSESQT GDVDKDTEKS IKAQFAEMNE    1020
ERARRDSSLG FKIKVQYIVF GDGNIVTSYD VEPPSRIPTL PRVGVQFNID KECSEVEWYG    1080
RGPFECYPDR KSAARVGTYS KEVKDLHVPY IVPGENGGRA DVRWVAFTSK TKGVGLLAIS    1140
GEDSPPMQMS ASFYTSQELD RATHEEELQQ GDKIEVHLDH KHMGIGGDDS WTPCVHPQYL    1200
LPPELYHFSI RFCPLIGPTS PLEISRNQLE NVS* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N4.0e-938961212318+
pfam02836Glyco_hydro_2_C2.0e-109476762300+
COG3250LacZ9.0e-1591761098932+
PRK09525lacZ010312141139+
PRK10340ebgA017711911020+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
RefSeqNP_001045421.101041229181115Os01g0952600 [Oryza sativa (japonica cultivar-group)]
RefSeqNP_680128.101041229171105glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqXP_001770793.1092123311105predicted protein [Physcomitrella patens subsp. patens]
RefSeqXP_002266400.101001231131114PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.101001231131115PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_4010112131110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_3010112131110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_2010112131110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_1010112131110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1jz2_D0921213210211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
Metabolic Pathways
Pathway NameReactionECProtein Name
lactose degradation IIIBETAGALACTOSID-RXNEC-3.2.1.23β-galactosidase
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
BJ8705242734046760
HO8042743892165940
EG3970692744827550
EL4443013005198180
HO804274475886330.0000002
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny