CAZyme Information

Basic Information
SpeciesCapsella rubella
Cazyme IDCarubv10019396m
FamilyGH2
Protein PropertiesLength: 1108 Molecular Weight: 125350 Isoelectric Point: 5.3812
ChromosomeChromosome/Scaffold: 5 Start: 10335713 End: 10344011
Descriptionglycoside hydrolase family 2 protein
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH2879730
  VKSLSGYWKFFLAPKPANVPENFYDAAFPDSDWDALPVPSNWQCHGFDRPIYTNVVYPFPNDPPHVPEDNPTGCYRTYFQIPKEWKDRRILLHFEAVDSA
  FFAWINGNPIGYSQDSRLPAEFEISEYCYPWDSGKQNVLAVQVFRWSDGSYLEDQDHWWLSGIHRDVLLLAKPKVFIADYFFKSKLADDFSYADIQVEVK
  IDNMQESSKDLVLSNFIIEAAVFSTKNWYNSEGFSSELSPKVANLTLNPSPSPVLGFHGYLLEGKLDSPNLWSAEQPNVYILVLTLKDTSGKILDSESSI
  VGIRQVSKAFKQLLVNGHPVVIKGVNRHEHHPRVGKTNIESCMVKDLIMMKEYNINAVRNSHYPQHPRWYELCDLFGMYMIDEANIETHGFDLSGHLKHP
  AKEPSWAAAMLDRVVGMVERDKNHTCIVSWSLGNEAGYGPNHSAMAGWIREKDPSRLVHYEGGGSRTSSTDIICPMYMRVWDIVKIALDQNESRPLILCE
  YQHAMGNSNGNIDEYWEAIDNTFGLQGGFIWDWVDQGLLKPGSDGIKRWAYGGDFGDQPNDLNFCLNGLIWPDRTPHPALHEVKYCYQPINVSLTDGTMK
  VANTYFFHTTEELEFSWTVHGDGLELGSGALSIPVIKPQNSFDMEWKSGPWFSFWNDSNAGELFLTITAKLLSPTRSLETGHLVSSTQIPLPAKRQIIPQ
  ALKKTDTIIACETVGDFIKISQQDSWELMINVRKGAIEGWKIQGVLLMNEAILPCFWRAPTDNDKGGGDSSYFSRWKAAQLDDVEFLVESCSVKSITDKS
  VEIEFIYLGSSASGSSKSEALFKVNVTYLIYGSGDIITNWIVEPNSDLPPLPRVGIEFHIEKTLDRVKWYGKGPYECYPDRKSAAHV
Full Sequence
Protein Sequence     Length: 1108     Download
MVSLATRMIL PSENGYRAWE DQTLFKWRKR DPHVTLRCHE SVEGSLRYWY QRNNVDLAVS    60
KTAVWNDDAV QAALDSAAFW VDGLPFVKSL SGYWKFFLAP KPANVPENFY DAAFPDSDWD    120
ALPVPSNWQC HGFDRPIYTN VVYPFPNDPP HVPEDNPTGC YRTYFQIPKE WKDRRILLHF    180
EAVDSAFFAW INGNPIGYSQ DSRLPAEFEI SEYCYPWDSG KQNVLAVQVF RWSDGSYLED    240
QDHWWLSGIH RDVLLLAKPK VFIADYFFKS KLADDFSYAD IQVEVKIDNM QESSKDLVLS    300
NFIIEAAVFS TKNWYNSEGF SSELSPKVAN LTLNPSPSPV LGFHGYLLEG KLDSPNLWSA    360
EQPNVYILVL TLKDTSGKIL DSESSIVGIR QVSKAFKQLL VNGHPVVIKG VNRHEHHPRV    420
GKTNIESCMV KDLIMMKEYN INAVRNSHYP QHPRWYELCD LFGMYMIDEA NIETHGFDLS    480
GHLKHPAKEP SWAAAMLDRV VGMVERDKNH TCIVSWSLGN EAGYGPNHSA MAGWIREKDP    540
SRLVHYEGGG SRTSSTDIIC PMYMRVWDIV KIALDQNESR PLILCEYQHA MGNSNGNIDE    600
YWEAIDNTFG LQGGFIWDWV DQGLLKPGSD GIKRWAYGGD FGDQPNDLNF CLNGLIWPDR    660
TPHPALHEVK YCYQPINVSL TDGTMKVANT YFFHTTEELE FSWTVHGDGL ELGSGALSIP    720
VIKPQNSFDM EWKSGPWFSF WNDSNAGELF LTITAKLLSP TRSLETGHLV SSTQIPLPAK    780
RQIIPQALKK TDTIIACETV GDFIKISQQD SWELMINVRK GAIEGWKIQG VLLMNEAILP    840
CFWRAPTDND KGGGDSSYFS RWKAAQLDDV EFLVESCSVK SITDKSVEIE FIYLGSSASG    900
SSKSEALFKV NVTYLIYGSG DIITNWIVEP NSDLPPLPRV GIEFHIEKTL DRVKWYGKGP    960
YECYPDRKSA AHVAIYEHNV GDMHVPYIVP GESGGRTDVR WVTFQNKDGL GIYVSTYGSS    1020
SPMQMNASYY TTGELHRATH EEDLIKGKNI EVHLDHKHMG LGGDDSWTPC VHDKYLIPPQ    1080
PYSFSLRLCP ITAGTSVLDI YKDQLPC* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
smart01038Bgal_small_N5.0e-968101088280+
pfam02836Glyco_hydro_2_C7.0e-111397677291+
COG3250LacZ4.0e-14687971894+
PRK09525lacZ01710901109+
PRK10340ebgA08810941011+
Gene Ontology
GO TermDescription
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0004565beta-galactosidase activity
GO:0005975carbohydrate metabolic process
GO:0009341beta-galactosidase complex
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
EMBLCAB77564.101110711075beta Galactosidase-like protein [Arabidopsis thaliana]
RefSeqNP_001030858.101110711108glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqNP_680128.101110711107glycoside hydrolase family 2 protein [Arabidopsis thaliana]
RefSeqXP_002303929.101110511111predicted protein [Populus trichocarpa]
RefSeqXP_002513059.101110411107beta-galactosidase, putative [Ricinus communis]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB1f4h_D0871089491019A Chain A, Peanut Peroxidase
PDB1f4h_C0871089491019A Chain A, Peanut Peroxidase
PDB1f4h_B0871089491019A Chain A, Peanut Peroxidase
PDB1f4h_A0871089491019A Chain A, Peanut Peroxidase
PDB1f4a_D0871089491019A Chain A, E. Coli (Lacz) Beta-Galactosidase (Ncs Constrained Monomer- Orthorhombic)
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO8042743851295100
DK4995622752445180
DK49736726012600
DK50207125612560
HO804274475045490.00000001
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_014_00605.1Aquca_014_00605.2
Arabidopsis lyrata485844
Arabidopsis thalianaAT3G54440.1AT3G54440.2AT3G54440.3
Brachypodium distachyonBradi2g61010.1
Brassica rapaBra014816
Carica papayaevm.model.supercontig_540.3
Chlamydomonas reinhardtiiCre08.g379450.t1.3
Citrus clementinaCiclev10030068mCiclev10027821m
Citrus sinensisorange1.1g004315morange1.1g004363morange1.1g006933morange1.1g023943morange1.1g023667m
orange1.1g022918morange1.1g007038m
Cucumis sativusCucsa.307940.11Cucsa.307940.1Cucsa.307940.12
Eucalyptus grandisEucgr.J01181.1
Fragaria vescamrna14286.1-v1.0-hybrid
Glycine maxGlyma13g26700.1Glyma15g37670.1
Gossypium raimondiiGorai.010G216500.1
Linum usitatissimumLus10024155Lus10039520Lus10039519
Manihot esculentacassava4.1_002723m
Medicago truncatulaMedtr8g039160.1
Mimulus guttatusmgv1a000801m
Oryza sativaLOC_Os01g72340.1
Panicum virgatumPavirv00010284mPavirv00069653m
Physcomitrella patensPp1s130_43V6.1
Phaseolus vulgarisPhvul.011G206800.1
Picea abiesMA_10429668g0010
Populus trichocarpaPotri.003G196500.1Potri.003G196500.2Potri.001G027400.1
Prunus persicappa000508mppa000532m
Ricinus communis30131.m007196
Setaria italicaSi000118mSi000119m
Selaginella moellendorffii181800
Sorghum bicolorSb03g046050.2Sb03g046050.1
Thellungiella halophilaThhalv10010080m
Vitis viniferaGSVIVT01013489001
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny