y PlantCAZyme

CAZyme Information

Basic Information
SpeciesOryza sativa
Cazyme IDLOC_Os01g72340.1
FamilyGH2
Protein PropertiesLength: 1118 Molecular Weight: 126241 Isoelectric Point: 6.1189
ChromosomeChromosome/Scaffold: 1 Start: 41954633 End: 41962787
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2879830
  YVQTLSGYWKFLLASSPESVPEKFYDAYFNDSDWEALPVPSNWQMHGFDRPIYTNVTYPFTMNPPFVPNDNPTGCYRTVFRIPKEWKGRRILLHFEAVDS
  AFFAWVNGVPVGYSQDSRLPAEFEITDFCHPCDSEKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLVSKPQIFITDYFFKATLDEGFRVADIEVEV
  EIDSQKQDREHVSTLSIEATLYDNYGPADVLTSDMSAASVANLKLKPASRPKHCYGFHGYVLGGKVENPKLWSSEHPNLYTLVVVLKDSNGKLIECESCQ
  VGIRNVVLAHKQMLVNGCPVVIRGVNRHEHHPRVGKTNLEACMIKDLVLMRQNNINAVRNSHYPQHPRWYELCDIFGLYVIDEANIETHGFDESSHFKHP
  TLEPFWASAMLDRVVGMVERDKNHACIIVWSLGNESSYGPNHSAMSGWIRGKDPTRPIHYEGGGSRTSSTDIVCPMYMRVWDILKIAQDPSENRPLILCE
  YSHAMGNSNGNIDAYWMAIDNTVGLQGGFIWDWVDQGLLKEDADGSKNWAYGGDFGDTPNDLNFCLNGIVWPDRTIHPAVHEVKYLYQPIKITMMDNMLK
  IENVHFFETTEALDFSWLLQGDGCALGSGSLNVPSIAPQSTHLINMKSSPWFTIWSTCVVKEIFLSINVKLRYQTQWAKDGHILASAQICLPPKKGFVPH
  AIALPRSSLVSERVGDHVLISKSNAWQIKVNSISGTIDSWKVNNIELMSKGIHPCFWRTPTDNDKGGFYTKPYVSRWREASLDNISFYSSQFSLKELPDQ
  TVEISTIYYGLPGNQPKPDETSLSDESESVLFRVQMRGRIYDSGDVILDYEVSPKNDLPPLPRVGVVFNADKSLSRAKWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1118     Download
MAVASASALF SAKNLPHKPW EDPSFFRWRK REAHVPLRSH DTPEGALKYW HERRNVNYLN    60
SDSAVWNDDA VRGALESAAF WSKGLPYVQT LSGYWKFLLA SSPESVPEKF YDAYFNDSDW    120
EALPVPSNWQ MHGFDRPIYT NVTYPFTMNP PFVPNDNPTG CYRTVFRIPK EWKGRRILLH    180
FEAVDSAFFA WVNGVPVGYS QDSRLPAEFE ITDFCHPCDS EKENVLAVQV MRWSDGSYLE    240
DQDHWWLSGI HRDVLLVSKP QIFITDYFFK ATLDEGFRVA DIEVEVEIDS QKQDREHVST    300
LSIEATLYDN YGPADVLTSD MSAASVANLK LKPASRPKHC YGFHGYVLGG KVENPKLWSS    360
EHPNLYTLVV VLKDSNGKLI ECESCQVGIR NVVLAHKQML VNGCPVVIRG VNRHEHHPRV    420
GKTNLEACMI KDLVLMRQNN INAVRNSHYP QHPRWYELCD IFGLYVIDEA NIETHGFDES    480
SHFKHPTLEP FWASAMLDRV VGMVERDKNH ACIIVWSLGN ESSYGPNHSA MSGWIRGKDP    540
TRPIHYEGGG SRTSSTDIVC PMYMRVWDIL KIAQDPSENR PLILCEYSHA MGNSNGNIDA    600
YWMAIDNTVG LQGGFIWDWV DQGLLKEDAD GSKNWAYGGD FGDTPNDLNF CLNGIVWPDR    660
TIHPAVHEVK YLYQPIKITM MDNMLKIENV HFFETTEALD FSWLLQGDGC ALGSGSLNVP    720
SIAPQSTHLI NMKSSPWFTI WSTCVVKEIF LSINVKLRYQ TQWAKDGHIL ASAQICLPPK    780
KGFVPHAIAL PRSSLVSERV GDHVLISKSN AWQIKVNSIS GTIDSWKVNN IELMSKGIHP    840
CFWRTPTDND KGGFYTKPYV SRWREASLDN ISFYSSQFSL KELPDQTVEI STIYYGLPGN    900
QPKPDETSLS DESESVLFRV QMRGRIYDSG DVILDYEVSP KNDLPPLPRV GVVFNADKSL    960
SRAKWYGRGP FECYPDRKAA AHVGVYESGV DELHVPYIVP GECGGRADVR WVALQDADGF    1020
GLFASAYGES PPMQVSASYY GAAELDRATH NHKLVKGDDI EVHLDHKHMG LGGDDSWSPC    1080
VHEQYLLPPA RYAFSVRLCP LLPSSSCHDI YHSQLPC*                            1140
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N3.0e-898081098292+
pfam02836Glyco_hydro_2_C5.0e-113392678297+
COG3250LacZ1.0e-15988981904+
PRK09525lacZ02011001108+
PRK10340ebgA07410851017+
Gene Ontology
GO TermDescription
GO:0003824catalytic activity
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005488binding
GO:0005975carbohydrate metabolic process
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankEEC72176.101111711032hypothetical protein OsI_05227 [Oryza sativa Indica Group]
RefSeqNP_001045421.101111711117Os01g0952600 [Oryza sativa (japonica cultivar-group)]
RefSeqXP_002266400.10181115171112PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.10181115171113PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
RefSeqXP_002513059.106111451107beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1f4h_D0881099491019B Chain B, X-Ray Crystal Structure Of Ngt-Bound Hexa
PDB1f4h_C0881099491019B Chain B, X-Ray Crystal Structure Of Ngt-Bound Hexa
PDB1f4h_B0881099491019B Chain B, X-Ray Crystal Structure Of Ngt-Bound Hexa
PDB1f4h_A0881099491019B Chain B, X-Ray Crystal Structure Of Ngt-Bound Hexa
PDB1f4a_D0881099491019A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
Metabolic Pathways
Pathway NameReactionECProtein Name
lactose degradation IIIBETAGALACTOSID-RXNEC-3.2.1.23β-galactosidase
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
CB626663279393170
CB6735222646338960
HO8042743941305150
EG3970692724016720
HO804274475045490.0000002
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny