CAZyme Information

Basic Information
SpeciesArabidopsis lyrata
Cazyme ID485844
FamilyGH2
Protein PropertiesLength: 1108 Molecular Weight: 125680 Isoelectric Point: 5.787
ChromosomeChromosome/Scaffold: 5 Start: 17013865 End: 17022452
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
NCBI Taxonomy
Plaza
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2879730
  VKSLSGYWKFFLAPKPANVPDKFYDPAFPDSDWNALPVPSNWQCHGFDRPIYTNVVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSA
  FFAWINGNPVGYSQDSRLPAEFEISDYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPKVFIADYFFKSKLADDFSYADIQVEVK
  IDNMQESSKHLVLSNFIIEAAVFDTKNWYNSEGFNCELSPKVAHLKLNPSPSPTLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKVLDSESSI
  VGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIEACMVKDLIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHP
  AKEPSWAAAMLDRVVGMVERDKNHTCIISWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIVCPMYMRVWDIIKIALDQNESRPLILCE
  YQHAMGNSNGNIDEYWDAIDNTFGLQGGFIWDWVDQGLLKLGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKHCYQPIKVSLTDGLIK
  VANTYFFHTTEELEFSWKIHGDGLELGSGTLSIPVIKPQNSFEIEWKSGPWFSFWNDSNAGELFLTINAKLLNPTRSLEAGHLLSSTQIPLPAKRQIIPQ
  AIKKTDTIITCETVGDFIKISQQDSWELMINVRKGAIEGWKIQGVLLMKEDILPCFWRAPTDNDKGGGDSSYFLRWKAAQLDNVEFLVESCSVKSITDKA
  VEIEFIYLGSSASVSSKTDALFKVNVTYLIYGSGDIITNWSVEPNSDLPPLPRVGIEFHIEKTLDRVEWYGKGPFECYPDRKAAAHV
Full Sequence
Protein Sequence     Length: 1108     Download
MVSLATQMII PSENGYRVWE DQTLFKWRKR DPHVTLRCHE SVQGALRYWY QRNNVDLTVS    60
RSAVWNDDAV QAALDSAAFW VDGLPFVKSL SGYWKFFLAP KPANVPDKFY DPAFPDSDWN    120
ALPVPSNWQC HGFDRPIYTN VVYPFPNDPP HVPEDNPTGC YRTYFQIPKE WKDRRILLHF    180
EAVDSAFFAW INGNPVGYSQ DSRLPAEFEI SDYCYPWDSG KQNVLAVQVF RWSDGSYLED    240
QDHWWLSGIH RDVLLLAKPK VFIADYFFKS KLADDFSYAD IQVEVKIDNM QESSKHLVLS    300
NFIIEAAVFD TKNWYNSEGF NCELSPKVAH LKLNPSPSPT LGFHGYLLEG KLDSPNLWSA    360
EQPNVYILVL TLKDTSGKVL DSESSIVGIR QVSKAFKQLL VNGHPVVIKG VNRHEHHPRV    420
GKTNIEACMV KDLIMMKEYN INAVRNSHYP QHPRWYELCD LFGMYMIDEA NIETHGFDLS    480
GHLKHPAKEP SWAAAMLDRV VGMVERDKNH TCIISWSLGN EAGYGPNHSA MAGWIREKDP    540
SRLVHYEGGG SRTSSTDIVC PMYMRVWDII KIALDQNESR PLILCEYQHA MGNSNGNIDE    600
YWDAIDNTFG LQGGFIWDWV DQGLLKLGSD GIKRWAYGGD FGDQPNDLNF CLNGLIWPDR    660
TPHPALHEVK HCYQPIKVSL TDGLIKVANT YFFHTTEELE FSWKIHGDGL ELGSGTLSIP    720
VIKPQNSFEI EWKSGPWFSF WNDSNAGELF LTINAKLLNP TRSLEAGHLL SSTQIPLPAK    780
RQIIPQAIKK TDTIITCETV GDFIKISQQD SWELMINVRK GAIEGWKIQG VLLMKEDILP    840
CFWRAPTDND KGGGDSSYFL RWKAAQLDNV EFLVESCSVK SITDKAVEIE FIYLGSSASV    900
SSKTDALFKV NVTYLIYGSG DIITNWSVEP NSDLPPLPRV GIEFHIEKTL DRVEWYGKGP    960
FECYPDRKAA AHVAIYEHNV GDMHVPYIVP GESGGRTDVR WVTFRNKDGV GIYASTYGNS    1020
SPMQMNASYY TTGELNRATH EEDLIKGQNI EVHLDHKHMG LGGDDSWTPC VHDKYLIPPK    1080
PYSFSLRLCP ITASTSVLDI YKDQLPC* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N1.0e-978101088280+
pfam02836Glyco_hydro_2_C3.0e-112397677291+
COG3250LacZ2.0e-15087971890+
PRK09525lacZ01710901122+
PRK10340ebgA05010931055+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
EMBLCAB77564.101110711075beta Galactosidase-like protein [Arabidopsis thaliana]
RefSeqNP_001030858.101110711108glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqNP_680128.101110711107glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqXP_002299206.101110611109predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101110411107beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1jz2_D0871089511021A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
PDB1jz2_C0871089511021A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
PDB1jz2_B0871089511021A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
PDB1jz2_A0871089511021A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
PDB1jz1_P0871089511021A Chain A, Crystal Structure Of Bermuda Grass Isoallergen Bg60 Provides Insight Into The Various Cross-Allergenicity Of The Pollen Group 4 Allergens
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743851295100
DK4995622752445180
DK49736726012600
DK50207125612560
HO804274475045490.000000007
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Capsella rubellaCarubv10019396m
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny