Basic Information | |
---|---|
Species | Gossypium raimondii |
Cazyme ID | Gorai.001G078400.1 |
Family | GH89 |
Protein Properties | Length: 839 Molecular Weight: 95842.6 Isoelectric Point: 5.9208 |
Chromosome | Chromosome/Scaffold: 01 Start: 8106500 End: 8117642 |
Description | alpha-N-acetylglucosaminidase family / NAGLU family |
View CDS |
External Links |
---|
NCBI Taxonomy |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
GH89 | 96 | 430 | 0 |
PEILISGVTGVEVLAGLHWYLKYWCGSHISWQKTGGAQLFSIPPLGSLPPVQDGGILVQRPIPWNYYQNAVTSSYTFAWWDWERWEQEIDWMALQGINLP LAFTGQETIWRKVFQKFNITNSDLDDFFGGPAFLAWSRMGNLHGWGGPLPESWFDGQLTLQKKILARMYELGMTPVLPAFSGNVPAAFKDMFPSAKITRL GNWFSVKRNPKWCCTYLLDATDPLFIEIGRTFIEEQLKGVDTFDENTPPVDDPGYISSLGAAIFSGMQSGDDNAMWLMQGWLFSYDPFWRPPQMKALLHS VPLGKLVVLDLYAEVKPIWISSEQFYGVPYIWKAL | |||
GH89 | 466 | 832 | 0 |
EVWNLCMLHNFAGNIEMYGILDAIASGPIEALTSENSTMVGIGMSMEGIEQNPIVYDLMSEMAFQHKKVDVKAWIELYIARRYGRSSPLIRDAWNILYHT IYNCTDGAYDKNRDVIVAFPDVNPSLISSPLEMYPHNGKPTSRRAVQREKTNAYEQPHLWYSTSEVIKALELFIASGNELSASNTYSYDLVDLTRQSLAK YANELFLKIIDAYKFKDVDKVTSLSQKFLDLVEDMDTLLACHDGFLLGPWLESAKQLAQNEEEEKQFEWNARTQITMWFDNTEEEASLLRDYGNKYWSGL LRDYYGPRAAIYFKILIESVENGEDFKLHKWRREWIKLTNNWQSSRKLFPVKSSGNAVSISRSLYNK |
Full Sequence |
---|
Protein Sequence Length: 839 Download |
MDSSLAAISL LLFSLFSMVY SSPIGVQYIS KLLQIQERER ALPSFQVAAA RGVLHRLLPS 60 HSSAFEFRII SKEKCGGEGA CFIINNHPSS YKSGAPEILI SGVTGVEVLA GLHWYLKYWC 120 GSHISWQKTG GAQLFSIPPL GSLPPVQDGG ILVQRPIPWN YYQNAVTSSY TFAWWDWERW 180 EQEIDWMALQ GINLPLAFTG QETIWRKVFQ KFNITNSDLD DFFGGPAFLA WSRMGNLHGW 240 GGPLPESWFD GQLTLQKKIL ARMYELGMTP VLPAFSGNVP AAFKDMFPSA KITRLGNWFS 300 VKRNPKWCCT YLLDATDPLF IEIGRTFIEE QLKGVDTFDE NTPPVDDPGY ISSLGAAIFS 360 GMQSGDDNAM WLMQGWLFSY DPFWRPPQMK ALLHSVPLGK LVVLDLYAEV KPIWISSEQF 420 YGVPYIWKAL LERDLDLFRV FDRNFAYFIH PLKYVSVAYL KASKMEVWNL CMLHNFAGNI 480 EMYGILDAIA SGPIEALTSE NSTMVGIGMS MEGIEQNPIV YDLMSEMAFQ HKKVDVKAWI 540 ELYIARRYGR SSPLIRDAWN ILYHTIYNCT DGAYDKNRDV IVAFPDVNPS LISSPLEMYP 600 HNGKPTSRRA VQREKTNAYE QPHLWYSTSE VIKALELFIA SGNELSASNT YSYDLVDLTR 660 QSLAKYANEL FLKIIDAYKF KDVDKVTSLS QKFLDLVEDM DTLLACHDGF LLGPWLESAK 720 QLAQNEEEEK QFEWNARTQI TMWFDNTEEE ASLLRDYGNK YWSGLLRDYY GPRAAIYFKI 780 LIESVENGED FKLHKWRREW IKLTNNWQSS RKLFPVKSSG NAVSISRSLY NKYLRSES* 840 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam12971 | NAGLU_N | 9.0e-20 | 46 | 146 | 101 | + Alpha-N-acetylglucosaminidase (NAGLU) N-terminal domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This N-terminal domain has an alpha-beta fold. | ||
pfam12972 | NAGLU_C | 8.0e-100 | 538 | 833 | 296 | + Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This C-terminal domain has an all alpha helical fold. | ||
pfam05089 | NAGLU | 8.0e-169 | 161 | 531 | 379 | + Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This central domain has a tim barrel fold. |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
EMBL | CBI15090.1 | 0 | 21 | 835 | 23 | 839 | unnamed protein product [Vitis vinifera] |
GenBank | EEC78143.1 | 0 | 28 | 835 | 32 | 808 | hypothetical protein OsI_17702 [Oryza sativa Indica Group] |
RefSeq | XP_002280399.1 | 0 | 21 | 835 | 23 | 806 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002318632.1 | 0 | 21 | 834 | 26 | 804 | predicted protein [Populus trichocarpa] |
RefSeq | XP_002511461.1 | 0 | 1 | 836 | 1 | 802 | alpha-n-acetylglucosaminidase, putative [Ricinus communis] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 2vcc_A | 0 | 93 | 819 | 209 | 862 | A Chain A, Structure Of Ugt78g1 Complexed With Myricetin And Udp |
PDB | 2vcb_A | 0 | 93 | 819 | 209 | 862 | A Chain A, Structure Of Ugt78g1 Complexed With Myricetin And Udp |
PDB | 2vca_A | 0 | 93 | 819 | 209 | 862 | A Chain A, Structure Of Ugt78g1 Complexed With Myricetin And Udp |
PDB | 2vc9_A | 0 | 93 | 819 | 209 | 862 | A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens In Complex With 2-Acetamido-1,2-Dideoxynojirmycin |
PDB | 4a4a_A | 0 | 93 | 819 | 232 | 885 | A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In Complex With Its Substrate Glcnac-Alpha-1,4-Galactose |