CAZyme Information

Basic Information
SpeciesArabidopsis thaliana
Cazyme IDAT3G54440.3
FamilyGH2
Protein PropertiesLength: 1121 Molecular Weight: 126827 Isoelectric Point: 5.5638
ChromosomeChromosome/Scaffold: 3 Start: 20148292 End: 20157067
Descriptionbeta-galactosidase, putative, expressed
View CDS
External Links
TAIR
Geo Profiles
ATTED-II
NCBI Taxonomy
Plaza
SIGnAL
CAZyDB
Entrez Gene
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH21009860
  VKSLSGYWKFFLAPKPANVPDKFYDAAFSDSDWNALQVPSNWQCHGFDRPIYTNVVYPFPNDPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSA
  FFAWINGNPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPKVFIADYFFKSKLADDFSYADIQVEVK
  IDNMQESSKDLVLSNFIIEAAIFDTKNWYNSEGFSCELSPKVANLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSESSI
  VGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHP
  AKEPSWAAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYMRVWDIIKIALDQNESRPLILCE
  YQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGMIK
  VANTYFFNTTEELEFSWTIHGDGLELGSGTLSIPVIKPQNSFEMEWKSGPWFSFWNDSNAGELFLTINAKLLNLTRSLEAGHLLSSTQIPLPAKGQIIPQ
  AIKKTDTSITCETVGDFIKISQKDSWELMVNVRKGTIEGWKIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWKAAQLDNVEFLVESCSVKSITDKS
  VEIEFIYLGSSASGSSKSDALFKVNVTYLIYGSGDIITNWFVEPNSDLPPLPRVGIEFHIEKTLDRVEWYGKGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1121     Download
MVSLATQMIL PSENGYRVWE DQTLFKWRKR DPHVTLRCHE SVQVSQGRVK ILCDCIGALR    60
YWYQRNNVDL TVSKSAVWND DAVQAALDSA AFWVDGLPFV KSLSGYWKFF LAPKPANVPD    120
KFYDAAFSDS DWNALQVPSN WQCHGFDRPI YTNVVYPFPN DPPYVPEDNP TGCYRTYFQI    180
PKEWKDRRIL LHFEAVDSAF FAWINGNPVG YSQDSRLPAE FEISDYCYPW DSGKQNVLAV    240
QVFRWSDGSY LEDQDHWWLS GIHRDVLLLA KPKVFIADYF FKSKLADDFS YADIQVEVKI    300
DNMQESSKDL VLSNFIIEAA IFDTKNWYNS EGFSCELSPK VANLKLNPSP SPTLGFHGYL    360
LEGKLDSPNL WSAEQPNVYI LVLTLKDTSG KVLDSESSIV GIRQVSKAFK QLLVNGHPVV    420
IKGVNRHEHH PRVGKTNIEA CMVKDLIMMK EYNINAVRNS HYPQHPRWYE LCDLFGMYMI    480
DEANIETHGF DLSGHLKHPA KEPSWAAAML DRVVGMVERD KNHTCIISWS LGNEAGYGPN    540
HSAMAGWIRE KDPSRLVHYE GGGSRTSSTD IVCPMYMRVW DIIKIALDQN ESRPLILCEY    600
QHAMGNSNGN IDEYWEAIDN TFGLQGGFIW DWVDQGLLKL GSDGIKRWAY GGDFGDQPND    660
LNFCLNGLIW PDRTPHPALH EVKHCYQPIK VSLTDGMIKV ANTYFFNTTE ELEFSWTIHG    720
DGLELGSGTL SIPVIKPQNS FEMEWKSGPW FSFWNDSNAG ELFLTINAKL LNLTRSLEAG    780
HLLSSTQIPL PAKGQIIPQA IKKTDTSITC ETVGDFIKIS QKDSWELMVN VRKGTIEGWK    840
IQGVLLMNEA ILPCFWRAPT DNDKGGGDSS YFSRWKAAQL DNVEFLVESC SVKSITDKSV    900
EIEFIYLGSS ASGSSKSDAL FKVNVTYLIY GSGDIITNWF VEPNSDLPPL PRVGIEFHIE    960
KTLDRVEWYG KGPFECYPDR KAAAHVAIYE HNVGDMHVPY IVPGENGGRT DVRWVTFRNK    1020
DGVGIYASTY GSSSLMQMNA SYYTTGELHR ATHEEDLIKG QNIEVHLDHK HMGLGGDDSW    1080
TPCVHDKFLI PPAQYSFSLR LCPITASTSG LNIYKDQLPC * 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N3.0e-968231101280+
pfam02836Glyco_hydro_2_C2.0e-112410690291+
COG3250LacZ1.0e-150100984889+
PRK09525lacZ09811031054+
PRK10340ebgA010111061010+
Gene Ontology
GO TermDescription
GO:0003824catalytic activity
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
EMBLCAB77564.101112011075beta Galactosidase-like protein [Arabidopsis thaliana]
RefSeqNP_001030858.101112011108glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqNP_680128.101112011107glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqXP_002299206.101111911109predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101111711107beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_4010011025110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_3010011025110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_2010011025110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_1010011025110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1jz7_D01001102511021A Chain A, E. Coli (Lacz) Beta-Galactosidase In Complex With Galactose
Metabolic Pathways
Pathway NameReactionECProtein Name
lactose degradation IIIBETAGALACTOSID-RXNEC-3.2.1.23β-galactosidase
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743851425230
DK4995622752575310
EL4443012974517470
DK49736727312730
HO804274475175620.00000001
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny