CAZyme Information

Basic Information
SpeciesArabidopsis thaliana
Cazyme IDAT3G54440.2
FamilyGH2
Protein PropertiesLength: 1109 Molecular Weight: 125511 Isoelectric Point: 5.4995
ChromosomeChromosome/Scaffold: 3 Start: 20148336 End: 20157138
Descriptionbeta-galactosidase, putative, expressed
View CDS
External Links
TAIR
Geo Profiles
ATTED-II
NCBI Taxonomy
Plaza
SIGnAL
CAZyDB
Entrez Gene
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2879730
  VKSLSGYWKFFLAPKPANVPDKFYDAAFSDSDWNALQVPSNWQCHGFDRPIYTNVVYPFPNDPPYVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSA
  FFAWINGNPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPKVFIADYFFKSKLADDFSYADIQVEVK
  IDNMQESSKDLVLSNFIIEAAIFDTKNWYNSEGFSCELSPKVANLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSESSI
  VGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHP
  AKEPSWAAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYMRVWDIIKIALDQNESRPLILCE
  YQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGMIK
  VANTYFFNTTEELEFSWTIHGDGLELGSGTLSIPVIKPQNSFEMEWKSGPWFSFWNDSNAGELFLTINAKLLNLTRSLEAGHLLSSTQIPLPAKGQIIPQ
  AIKKTDTSITCETVGDFIKISQKDSWELMVNVRKGTIEGWKIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWKAAQLDNVEFLVESCSVKSITDKS
  VEIEFIYLGSSASGSSKSDALFKVNVTYLIYGSGDIITNWFVEPNSDLPPLPRVGIEFHIEKTLDRVEWYGKGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1109     Download
MVSLATQMIL PSENGYRVWE DQTLFKWRKR DPHVTLRCHE SVQGALRYWY QRNNVDLTVS    60
KSAVWNDDAV QAALDSAAFW VDGLPFVKSL SGYWKFFLAP KPANVPDKFY DAAFSDSDWN    120
ALQVPSNWQC HGFDRPIYTN VVYPFPNDPP YVPEDNPTGC YRTYFQIPKE WKDRRILLHF    180
EAVDSAFFAW INGNPVGYSQ DSRLPAEFEI SDYCYPWDSG KQNVLAVQVF RWSDGSYLED    240
QDHWWLSGIH RDVLLLAKPK VFIADYFFKS KLADDFSYAD IQVEVKIDNM QESSKDLVLS    300
NFIIEAAIFD TKNWYNSEGF SCELSPKVAN LKLNPSPSPT LGFHGYLLEG KLDSPNLWSA    360
EQPNVYILVL TLKDTSGKVL DSESSIVGIR QVSKAFKQLL VNGHPVVIKG VNRHEHHPRV    420
GKTNIEACMV KDLIMMKEYN INAVRNSHYP QHPRWYELCD LFGMYMIDEA NIETHGFDLS    480
GHLKHPAKEP SWAAAMLDRV VGMVERDKNH TCIISWSLGN EAGYGPNHSA MAGWIREKDP    540
SRLVHYEGGG SRTSSTDIVC PMYMRVWDII KIALDQNESR PLILCEYQHA MGNSNGNIDE    600
YWEAIDNTFG LQGGFIWDWV DQGLLKLGSD GIKRWAYGGD FGDQPNDLNF CLNGLIWPDR    660
TPHPALHEVK HCYQPIKVSL TDGMIKVANT YFFNTTEELE FSWTIHGDGL ELGSGTLSIP    720
VIKPQNSFEM EWKSGPWFSF WNDSNAGELF LTINAKLLNL TRSLEAGHLL SSTQIPLPAK    780
GQIIPQAIKK TDTSITCETV GDFIKISQKD SWELMVNVRK GTIEGWKIQG VLLMNEAILP    840
CFWRAPTDND KGGGDSSYFS RWKAAQLDNV EFLVESCSVK SITDKSVEIE FIYLGSSASG    900
SSKSDALFKV NVTYLIYGSG DIITNWFVEP NSDLPPLPRV GIEFHIEKTL DRVEWYGKGP    960
FECYPDRKAA AHVAIYEHNV GDMHVPYIVP GENGGRTDVR WVTFRNKDGV GIYASTYGSS    1020
SLMQMNASYY TTGELHRATH EEDLIKGQNI EVVHLDHKHM GLGGDDSWTP CVHDKFLIPP    1080
AQYSFSLRLC PITASTSGLN IYKDQLPC* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N4.0e-958101089281+
pfam02836Glyco_hydro_2_C3.0e-112397677291+
COG3250LacZ3.0e-15087971889+
PRK09525lacZ01710911123+
PRK10340ebgA08810941011+
Gene Ontology
GO TermDescription
GO:0003824catalytic activity
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
EMBLCAB77564.101110811075beta Galactosidase-like protein [Arabidopsis thaliana]
RefSeqNP_001030858.101110811108glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqNP_680128.101110811107glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqXP_002299206.101110711109predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101110511107beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3muy_408710905110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_308710905110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_208710905110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB3muy_108710905110211 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
PDB1f4h_D08710904910191 Chain 1, E. Coli (Lacz) Beta-Galactosidase (R599a)
Metabolic Pathways
Pathway NameReactionECProtein Name
lactose degradation IIIBETAGALACTOSID-RXNEC-3.2.1.23β-galactosidase
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743851295100
DK4995622752445180
DK49736726012600
EL4443012974387340
HO804274475045490.000000009
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny