CAZyme Information

Basic Information
SpeciesCapsella rubella
Cazyme IDCarubv10000251m
FamilyGH89
Protein PropertiesLength: 807 Molecular Weight: 93550.3 Isoelectric Point: 8.9498
ChromosomeChromosome/Scaffold: 6 Start: 4406731 End: 4411443
Descriptionalpha-N-acetylglucosaminidase family / NAGLU family
View CDS
External Links
CAZyDB
Signature Domain  Download full data set without filtering
FamilyStartEndEvalue
GH89938010
  PEIVIKGTTGVEIASGLHWYLKYKCNAHVSWDKTGGIQIASVPQPGHLPRLDSKRILIRRPVPWNYYQNVVTSSYSYVWWGWERWQREIDWMALQGINLP
  LAFTGQEAIWQKVFKRFNISREDLDDYFGGPAFLAWARMGNLHAWGGPLSKNWLRDQLLLQKQILSQMLKLGMTPVLPSFSGNVPSALRKIYPGANITRL
  DNWNTVDGDSRWCCTYLLNPSDPLFIDIGEAFIKQQIEEYGEVTNIYNCDTFNENTPPTSEPGYISSLGAAVYKAMSKGNKNAVWLMQGWLFSSDSTFWK
  PPQMKALLHSVPFGKMIVLDLYAEVKPIWNTSIQFYRTPYIWCMLHNFGGNIEMYGVLDSISSGPVDARVSKNSTMVGVGMCMEGIEQNPVVYELTSEMA
  FRDEKVNVQKWLKSYARRRYMKENHQIEAAWEILYHTIYNCTDGIADHNTDFIVKLPDWDPSCSVQDESKETESYMTSTAPYETKRRFLFQDNISGFPKA
  HLWYSTKEVIKALKLFLEAGDDLSRSLTYRYDMVDLTRQVLSKLANQVYIEAVTAFVKKDIGRLRRLSEKFLEIMKDMDVLLASDNNFLLGTWLESAKKL
  ARNDDERKQYEWNARTQVTMWYDSKDVNQSKLHDYGNKFWSGLLEDYYLPRATLYFSEMIKSLRDKKKFKIEKWRREWIMKSHKWQQSSSEIYGVKAKGD
  ALAISRHLL
Full Sequence
Protein Sequence     Length: 807     Download
MRSIKLVLLV LWIFSLHSQS FSKQHPTIEN LLDRLDSLRP TRSVQESAAK GLLQRLLPAH    60
FHSFDFRIIS KNVCDGSSCF LIENYDGTRR FGPEIVIKGT TGVEIASGLH WYLKYKCNAH    120
VSWDKTGGIQ IASVPQPGHL PRLDSKRILI RRPVPWNYYQ NVVTSSYSYV WWGWERWQRE    180
IDWMALQGIN LPLAFTGQEA IWQKVFKRFN ISREDLDDYF GGPAFLAWAR MGNLHAWGGP    240
LSKNWLRDQL LLQKQILSQM LKLGMTPVLP SFSGNVPSAL RKIYPGANIT RLDNWNTVDG    300
DSRWCCTYLL NPSDPLFIDI GEAFIKQQIE EYGEVTNIYN CDTFNENTPP TSEPGYISSL    360
GAAVYKAMSK GNKNAVWLMQ GWLFSSDSTF WKPPQMKALL HSVPFGKMIV LDLYAEVKPI    420
WNTSIQFYRT PYIWCMLHNF GGNIEMYGVL DSISSGPVDA RVSKNSTMVG VGMCMEGIEQ    480
NPVVYELTSE MAFRDEKVNV QKWLKSYARR RYMKENHQIE AAWEILYHTI YNCTDGIADH    540
NTDFIVKLPD WDPSCSVQDE SKETESYMTS TAPYETKRRF LFQDNISGFP KAHLWYSTKE    600
VIKALKLFLE AGDDLSRSLT YRYDMVDLTR QVLSKLANQV YIEAVTAFVK KDIGRLRRLS    660
EKFLEIMKDM DVLLASDNNF LLGTWLESAK KLARNDDERK QYEWNARTQV TMWYDSKDVN    720
QSKLHDYGNK FWSGLLEDYY LPRATLYFSE MIKSLRDKKK FKIEKWRREW IMKSHKWQQS    780
SSEIYGVKAK GDALAISRHL LCKYFP* 
Functional Domains Download unfiltered results here
Cdd IDDomainE-ValueStartEndLengthDomain Description
pfam12971NAGLU_N2.0e-244514399+
pfam12972NAGLU_C6.0e-106502804303+
pfam05089NAGLU0158497340+
Annotations - NR Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
EMBLCBI24942.10580568867unnamed protein product [Vitis vinifera]
GenBankEEE59081.104580556811hypothetical protein OsJ_10898 [Oryza sativa Japonica Group]
RefSeqNP_196873.1018061806alpha-N-acetylglucosaminidase family / NAGLU family [Arabidopsis thaliana]
RefSeqXP_002273084.1058053802PREDICTED: hypothetical protein [Vitis vinifera]
RefSeqXP_002314048.102780528805predicted protein [Populus trichocarpa]
Annotations - PDB Download unfiltered results here
SourceHit IDE-ValueQuery StartQuery EndHit StartHit EndDescription
PDB2vcc_A050743175816A Chain A, Solution Structure Of The C-Terminal Domain Ole E 9
PDB2vcb_A050743175816A Chain A, Solution Structure Of The C-Terminal Domain Ole E 9
PDB2vca_A050743175816A Chain A, Solution Structure Of The C-Terminal Domain Ole E 9
PDB2vc9_A050743175816A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens In Complex With 2-Acetamido-1,2-Dideoxynojirmycin
PDB4a4a_A050743198839A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In Complex With Its Substrate Glcnac-Alpha-1,4-Galactose
Signal Peptide
Cleavage Site
22
Hydropathy
EST Download unfiltered results here
HitLengthStartEndEValue
HO7834553243036260
HO7834551456357790
EG45830027212720
GT6276572712054750
GT6223562732785500
Orthologous Group
SpeciesID
Aquilegia coeruleaAquca_057_00157.1Aquca_039_00072.1Aquca_057_00157.2Aquca_057_00157.3Aquca_057_00157.4
Aquca_002_00330.4.155.497Aquca_002_00330.4.155.497Aquca_002_00330.2.100.442Aquca_002_00330.2.100.442Aquca_002_00330.1.155.497
Aquca_002_00330.1.155.497Aquca_002_00330.3.100.434Aquca_002_00330.3.100.434Aquca_002_00330.1.514.887Aquca_002_00330.1.514.887
Aquca_002_00330.3.451.826Aquca_002_00330.3.451.826Aquca_002_00330.2.459.834Aquca_002_00330.2.459.834Aquca_002_00330.4.514.785
Aquca_002_00330.4.514.785
Arabidopsis lyrata488189
Arabidopsis thalianaAT5G13690.1
Brachypodium distachyonBradi1g62007.1Bradi5g24207.1
Brassica rapaBra023429
Carica papayaevm.model.supercontig_125.32evm.model.supercontig_35.39evm.model.supercontig_35.44
Citrus clementinaCiclev10030724mCiclev10018883mCiclev10019020mCiclev10019066mCiclev10019065m
Citrus sinensisorange1.1g003545morange1.1g006843morange1.1g006829morange1.1g008173morange1.1g009062m
orange1.1g009057morange1.1g009049morange1.1g012032morange1.1g012026morange1.1g009153m.237.531
orange1.1g009153m.237.531
Cucumis sativusCucsa.128090.1Cucsa.197210.1
Eucalyptus grandisEucgr.B00338.1Eucgr.G03358.1Eucgr.G03358.2Eucgr.B00338.2
Fragaria vescamrna09491.1-v1.0-hybrid.106.845mrna29475.1-v1.0-hybrid
Glycine maxGlyma10g11720.3Glyma06g19791.1.94.437Glyma06g19791.1.94.437Glyma06g19791.1.438.763Glyma06g19791.1.438.763
Gossypium raimondiiGorai.004G170700.1Gorai.004G170700.3Gorai.004G170700.4Gorai.004G170700.2.346.780Gorai.004G170700.2.346.780
Gorai.001G078400.1.96.430Gorai.001G078400.1.96.430Gorai.004G170700.2.108.346Gorai.004G170700.2.108.346Gorai.001G078400.1.466.832
Gorai.001G078400.1.466.832
Linum usitatissimumLus10039598Lus10029494Lus10043468Lus10034116.106.408Lus10034116.106.408
Lus10034116.462.875Lus10034116.462.875
Malus domesticaMDP0000220242MDP0000138607MDP0000208532MDP0000203950MDP0000134637
Manihot esculentacassava4.1_001859mcassava4.1_003710mcassava4.1_022588mcassava4.1_012214m
Medicago truncatulaMedtr3g032980.2.357.804Medtr3g032980.2.357.804Medtr3g032980.1.357.829Medtr3g032980.1.357.829Medtr3g032980.3.95.333
Medtr3g032980.3.95.333Medtr3g032980.2.95.333Medtr3g032980.2.95.333Medtr3g032980.1.95.333Medtr3g032980.1.95.333
Medtr3g032980.3.357.529Medtr3g032980.3.357.529
Mimulus guttatusmgv1a001508mmgv1a018437m
Oryza sativaLOC_Os04g55730.1.99.440LOC_Os04g55730.1.99.440LOC_Os04g55730.1.441.772LOC_Os04g55730.1.441.772
Panicum virgatumPavirv00063217mPavirv00008468mPavirv00041670mPavirv00026596m.238.567Pavirv00026596m.238.567
Physcomitrella patensPp1s329_23V6.1
Phaseolus vulgarisPhvul.005G023300.1Phvul.005G023300.2Phvul.009G182100.1.92.435Phvul.009G182100.1.92.435Phvul.009G182100.2.253.577
Phvul.009G182100.2.253.577Phvul.009G182100.1.436.760Phvul.009G182100.1.436.760
Picea abiesMA_10437144g0010
Populus trichocarpaPotri.009G058100.1Potri.012G075900.1.100.435Potri.012G075900.1.100.435Potri.012G075900.1.436.751Potri.012G075900.1.436.751
Prunus persicappa001555mppa001642m
Ricinus communis30147.m01401129864.m001461
Setaria italicaSi034295mSi009392m.103.444Si009392m.103.444Si009392m.445.774Si009392m.445.774
Selaginella moellendorffii102402
Sorghum bicolorSb01g034960.1Sb06g030930.1
Thellungiella halophilaThhalv10012727m
Vitis viniferaGSVIVT01032165001GSVIVT01007826001.97.439GSVIVT01007826001.97.439GSVIVT01007826001.468.836GSVIVT01007826001.468.836
Sequence Alignments  (This image is cropped. Click for full image.)
Phylogeny  (This image is cropped. Click for full image.)