CAZyme Information

Basic Information
SpeciesSetaria italica
Cazyme IDSi000119m
FamilyGH2
Protein PropertiesLength: 1117 Molecular Weight: 125981 Isoelectric Point: 5.9392
ChromosomeChromosome/Scaffold: 5 Start: 46448261 End: 46457402
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2879830
  YTKSLSGYWKFLLAPSAESVPEKFFDAHFDDSNWEALPVPSNWQMHGFDRPIYTNTTYPFPINPPFVSTDNPTGCYRTVFHIPKEWKGRRILLHFEAVDS
  AFFAWVNGVPIGYSQDSRLPAEFEVTDCCHPCDSDKENVLAVQVMRWSDGSYLEDQDHWWLSGIHRDVLLLSKPQIFITDYFFKATMDENFSLADIEVEV
  EIDSHKQDREHVSTLSIEATLYDNSGPSISLDGDLSFANVVNLKPKPKTSRGPCLGFHGYVLGGKIENPKLWSSEHPNLYTLVVLLKDANGKLIECESCQ
  VGIRNVVRAHKQMLVNGCPVVLRGVNRHEHHPRLGKTNIEACMIKDLILMRQNNINAVRNSHYPQHSRWYELCDIFGLYVIDEANIETHGFDENSHFKHP
  TLEPIWANAMLDRVVGMVERDKNHACIIVWSLGNESSYGPNHASMSGWIRERDPTRLLHYEGGGSRTSSTDIVCPMYMRVWDIIKIAKDPSETRPLILCE
  YSHAMGNSNGNIDAYWMAIDNTFGLQGGFIWDWVDQGLLKEDSDGSKFWAYGGDFGDTPNDLNFCLNGIVWPDRTIHPAVHEVKYLYQPIKISSADNMLK
  IENGHFFDTTEALDFSWVLQGDGCILGSGSLNVPTLAPQTSHLINMESSPWFALWSTCAVKEVFLSVNVKQRYHTRWAKDGHLLASAQLCLPQKNGFVPH
  AVAFSSSPLVCERTGDSVIISKNDAWKIKVNSQLGTIDSWKVSNVELMSKGIFPCFWRAPTDNDKGGFYTKPYVSQWREASLDNVSFYSSQFSVKELPDN
  TVELSTVYYGLPGNLPKPDDAALSQAPESTLFQVNMLCRIYESGDVVLEYEVNPKADLPPLPRVGVVFNAEKSLSHVMWYGRGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1117     Download
MALAYASAVV PPSNRSYKAW EDPSFFKWRK RDAHVPLRSQ DTLEGALRYW HERRNVNYLN    60
ADTAVWNDDA VRGALESAAL WSKGLPYTKS LSGYWKFLLA PSAESVPEKF FDAHFDDSNW    120
EALPVPSNWQ MHGFDRPIYT NTTYPFPINP PFVSTDNPTG CYRTVFHIPK EWKGRRILLH    180
FEAVDSAFFA WVNGVPIGYS QDSRLPAEFE VTDCCHPCDS DKENVLAVQV MRWSDGSYLE    240
DQDHWWLSGI HRDVLLLSKP QIFITDYFFK ATMDENFSLA DIEVEVEIDS HKQDREHVST    300
LSIEATLYDN SGPSISLDGD LSFANVVNLK PKPKTSRGPC LGFHGYVLGG KIENPKLWSS    360
EHPNLYTLVV LLKDANGKLI ECESCQVGIR NVVRAHKQML VNGCPVVLRG VNRHEHHPRL    420
GKTNIEACMI KDLILMRQNN INAVRNSHYP QHSRWYELCD IFGLYVIDEA NIETHGFDEN    480
SHFKHPTLEP IWANAMLDRV VGMVERDKNH ACIIVWSLGN ESSYGPNHAS MSGWIRERDP    540
TRLLHYEGGG SRTSSTDIVC PMYMRVWDII KIAKDPSETR PLILCEYSHA MGNSNGNIDA    600
YWMAIDNTFG LQGGFIWDWV DQGLLKEDSD GSKFWAYGGD FGDTPNDLNF CLNGIVWPDR    660
TIHPAVHEVK YLYQPIKISS ADNMLKIENG HFFDTTEALD FSWVLQGDGC ILGSGSLNVP    720
TLAPQTSHLI NMESSPWFAL WSTCAVKEVF LSVNVKQRYH TRWAKDGHLL ASAQLCLPQK    780
NGFVPHAVAF SSSPLVCERT GDSVIISKND AWKIKVNSQL GTIDSWKVSN VELMSKGIFP    840
CFWRAPTDND KGGFYTKPYV SQWREASLDN VSFYSSQFSV KELPDNTVEL STVYYGLPGN    900
LPKPDDAALS QAPESTLFQV NMLCRIYESG DVVLEYEVNP KADLPPLPRV GVVFNAEKSL    960
SHVMWYGRGP FECYPDRKAA AHVGVYESSV EDLHVPYIVP GECGGRADVR WVALRNADGL    1020
GLQASVHGES PPMQMSASYY GTEELDRATH VHKLVKGDDI EVHLDHRHMG LGGDDSWTPC    1080
VHEQYLLPPT RYAFSMRLCP LLPSSSCHDI YKSQLP* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N5.0e-858081098292+
pfam02836Glyco_hydro_2_C1.0e-111397678292+
COG3250LacZ6.0e-15589981902+
PRK09525lacZ02011001104+
PRK10340ebgA07410851025+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
GenBankEEC72176.101111611031hypothetical protein OsI_05227 [Oryza sativa Indica Group]
RefSeqNP_001045421.101111611116Os01g0952600 [Oryza sativa (japonica cultivar-group)]
RefSeqNP_680128.103111621106glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqXP_002266400.10121115111112PREDICTED: hypothetical protein isoform 1 [Vitis vinifera]
RefSeqXP_002266434.10121115111113PREDICTED: hypothetical protein isoform 2 [Vitis vinifera]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1f4h_D0891099501019A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
PDB1f4h_C0891099501019A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
PDB1f4h_B0891099501019A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
PDB1f4h_A0891099501019A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
PDB1f4a_D0891099501019A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
Metabolic Pathways
Pathway NameReactionECProtein Name
lactose degradation IIIBETAGALACTOSID-RXNEC-3.2.1.23β-galactosidase
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
FL7270742724707410
FL7691292614707300
FL8237572614637230
HO8042743911305150
HO804274475045490.0000002
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny