CAZyme Information

Basic Information
SpeciesArabidopsis thaliana
Cazyme IDAT4G38590.2
FamilyGH35
Protein PropertiesLength: 1053 Molecular Weight: 119544 Isoelectric Point: 7.4711
ChromosomeChromosome/Scaffold: 4 Start: 18036116 End: 18040928
Descriptionbeta-galactosidase precursor, putative, expressed
View CDS
External Links
TAIR
Geo Profiles
ATTED-II
NCBI Taxonomy
Plaza
SIGnAL
CAZyDB
Entrez Gene
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH35653500
  SRKHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTE
  RYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLW
  TENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAHFVTTRYYDDAPLDEFGLEKAPKYGHLKHVHR
Full Sequence
Protein Sequence     Length: 1053     Download
MKSRTRYLIA ILLVISLCSK ASSHDDEKKK KGVTYDGSER NFIDHKWKKR ASFLWFCSLP    60
SKHTSRKHMW PSIIDKARIG GLNTIQTYVF WNVHEPEQGK YDFKGRFDLV KFIKLIHEKG    120
LYVTLRLGPF IQAEWNHGGL PYWLREVPDV YFRTNNEPFK EHTERYVRKI LGMMKEEKLF    180
ASQGGPIILG QIENEYNAVQ LAYKENGEKY IKWAANLVES MNLGIPWVMC KQNDAPGNLI    240
NACNGRHCGD TFPGPNRHDK PSLWTENWTT QFRVFGDPPT QRTVEDIAFS VARYFSKNGS    300
HVNYYMYHGG TNFGRTSAHF VTTRYYDDAP LDEFGLEKAP KYGHLKHVHR ALRLCKKALF    360
WGQLRAQTLG PDTEVRYYEQ PGTKVCAAFL SNNNTRDTNT IKFKGQDYVL PSRSISILPD    420
CKTVVYNTAQ IVAQHSWRDF VKSEKTSKGL KFEMFSENIP SLLDGDSLIP GELYYLTKDK    480
TDYACVKIDE DDFPDQKGLK TILRVASLGH ALIVYVNGEY AGKAHGRHEM KSFEFAKPVN    540
FKTGDNRISI LGVLTGLPDS GSYMEHRFAG PRAISIIGLK SGTRDLTENN EWGHLAGLEG    600
EKKEVYTEEG SKKVKWEKDG KRKPLTWYKT YFETPEGVNA VAIRMKAMGK GLIWVNGIGV    660
GRYWMSFLSP LGEPTQTEYH IPRSFMKGEK KKNMLVILEE EPGVKLESID FVLVNRDTIC    720
SNVGEDYPVS VKSWKREGPK IVSRSKDMRL KAVMRCPPEK QMVEVQFASF GDPTGTCGNF    780
TMGKCSASKS KEVVEKECLG RNYCSIVVAR ETFGDKGCPE IVKTLAVQVK CEKKEGKQDE    840
KKKKEDKDEE EEDDEDDDEE EEEEDKENKD TKDMENKNQD ILDSDSALVS DLGFGPFSTV    900
VVNVPLIGGA APPQPRFNLM PPSNYVAGLG RGAAGFTTRS DIGPARANGD GNADVNHKFD    960
DFEGHDAGLF ANAESDDQDK EADAIWDAID RRMDSRRKDR REAKLKQEIE NYRASNPKVS    1020
GQFVDLTRKL HTLSEDEWDS IPEIGNYSHR LY* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
COG1874LacA2.0e-1469196136+
pfam02140Gal_Lectin6.0e-2075483181+
pfam06424PRP1_N3.0e-449211050131+
pfam01301Glyco_hydro_352.0e-13567352300+
PLN03059PLN03059065831799+
Gene Ontology
GO TermDescription
GO:0000398mRNA splicing, via spliceosome
GO:0003824catalytic activity
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005529Interacting selectively and non-covalently with any carbohydrate, which includes monosaccharides, oligosaccharides and polysaccharides as well as substances derived from monosaccharides by reduction of the carbonyl group (alditols), by oxidation of one or more hydroxy groups to afford the corresponding aldehydes, ketones, or carboxylic acids, or by replacement of one or more hydroxy group(s) by a hydrogen atom. Cyclitols are generally not regarded as carbohydrates." [CHEBI:16646, GOC:mah]
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
EMBLCAB37515.101001052801036galactosidase like protein [Arabidopsis thaliana]
EMBLCAB64750.1018819887putative beta-galactosidase [Arabidopsis thaliana]
RefSeqNP_001154292.101105211052glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
RefSeqNP_195571.206910521988glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
Swiss-ProtQ9SCU8018819887BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags: Precursor
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB3d3a_A2e-366734836325A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides Thetaiotaomicron
PDB3thd_D4e-306134831330A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides Thetaiotaomicron
PDB3thd_C4e-306134831330A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides Thetaiotaomicron
Metabolic Pathways
Pathway NameReactionECProtein Name
lactose degradation IIIBETAGALACTOSID-RXNEC-3.2.1.23β-galactosidase
Signal Peptide
Cleavage Site
23
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO778160670657030
GR4538012611313910
HO780602593896690
CO128575296693640
CO121155290943830
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00260.2Aquca_014_00260.1Aquca_006_00263.3Aquca_006_00263.2Aquca_006_00263.1
Arabidopsis lyrata496568476906491156480556912308
Arabidopsis thalianaAT5G63800.1AT1G77410.1AT4G38590.1AT2G16730.1AT4G35010.1
Brachypodium distachyonBradi2g56597.1Bradi3g42330.1Bradi4g36517.1Bradi2g24670.1
Brassica rapaBra029200Bra011577Bra017687Bra013052Bra001991
Bra033571Bra024237
Carica papayaevm.model.supercontig_37.29evm.model.supercontig_64.121evm.model.supercontig_64.120evm.model.supercontig_37.144
Capsella rubellaCarubv10028665mCarubv10022317mCarubv10007813mCarubv10006573mCarubv10016129m
Citrus clementinaCiclev10000420mCiclev10004034mCiclev10003478mCiclev10023501mCiclev10026929m
Ciclev10030124m
Citrus sinensisorange1.1g003612morange1.1g041957morange1.1g045037morange1.1g035496morange1.1g006326m
orange1.1g006301m
Cucumis sativusCucsa.060550.1Cucsa.370570.1
Eucalyptus grandisEucgr.H05090.1Eucgr.G01775.1Eucgr.I02332.1Eucgr.I01412.1
Fragaria vescamrna24180.1-v1.0-hybridmrna11229.1-v1.0-hybridmrna06055.1-v1.0-hybridmrna07651.1-v1.0-hybrid
Glycine maxGlyma06g16430.1Glyma04g38580.3Glyma08g00470.1Glyma04g38580.1Glyma04g38580.2
Glyma08g00470.2Glyma04g42620.3Glyma06g12150.1Glyma04g42620.2Glyma06g12150.2
Glyma12g03650.1Glyma12g03650.2Glyma04g00520.1Glyma11g11500.1
Gossypium raimondiiGorai.012G037700.1Gorai.010G052200.1Gorai.010G052300.1Gorai.009G265200.1Gorai.002G197200.3
Gorai.002G197200.2Gorai.002G197200.1Gorai.002G197100.1Gorai.008G036000.1
Linum usitatissimumLus10020875.306.587Lus10020875.306.587Lus10028538Lus10033502Lus10033427
Lus10018138Lus10008259Lus10033500Lus10020875.38.306Lus10020875.38.306
Lus10014126Lus10019784Lus10005071Lus10027844Lus10027843
Malus domesticaMDP0000202466MDP0000316304MDP0000243780MDP0000206284MDP0000322037
MDP0000320504MDP0000225967MDP0000222132MDP0000863563
Manihot esculentacassava4.1_033913mcassava4.1_025630mcassava4.1_023088mcassava4.1_031197mcassava4.1_023181m
cassava4.1_023219m
Medicago truncatulaMedtr3g096910.1Medtr8g095690.1Medtr3g088520.1Medtr3g117840.1Medtr4g073290.1
Mimulus guttatusmgv1a018569mmgv1a001345m
Oryza sativaLOC_Os05g35360.1LOC_Os08g43570.1LOC_Os09g36810.1
Panicum virgatumPavirv00062685mPavirv00037142mPavirv00063105mPavirv00038064mPavirv00037143m
Pavirv00021342mPavirv00010263mPavirv00038026mPavirv00010484m
Phaseolus vulgarisPhvul.009G153000.1Phvul.009G127500.1Phvul.009G119400.1Phvul.L007600.1Phvul.009G119600.1
Phvul.011G035300.1
Picea abiesMA_65469g0020
Populus trichocarpaPotri.005G069200.2Potri.005G069200.1Potri.005G180600.1Potri.002G080700.1Potri.013G105100.1
Potri.009G134400.1Potri.004G174800.1
Prunus persicappa001308mppa019277mppa026900mppa020492m
Ricinus communis29648.m00200830074.m001370
Setaria italicaSi004751mSi021244mSi024535mSi013260mSi028962m
Sorghum bicolorSb09g021140.1Sb07g024870.1Sb02g031260.1
Thellungiella halophilaThhalv10018129mThhalv10005464mThhalv10024392mThhalv10022550m
Vitis viniferaGSVIVT01022353001GSVIVT01008835001GSVIVT01008834001GSVIVT01024031001
Zea maysAC234152.1_FGT005
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny  (This image is cropped. Click for full image.)