Basic Information | |
---|---|
Species | Carica papaya |
Cazyme ID | evm.model.supercontig_125.32 |
Family | GH89 |
Protein Properties | Length: 808 Molecular Weight: 92420.5 Isoelectric Point: 7.0799 |
Chromosome | Chromosome/Scaffold: 125 Start: 548748 End: 568741 |
Description | alpha-N-acetylglucosaminidase family / NAGLU family |
View CDS |
External Links |
---|
NCBI Taxonomy |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
GH89 | 96 | 804 | 0 |
PQIMIKGTTAVEIASGLHWYLKYWCGAHVSWDKTGGAQIDSVPKPGSIPFVKDEGFIIQRPVPWNYYQNVVTSSYSYVWWDWQRWEKEIDWMALQGINLP LAFTGQEAIWQKVFKGFNISKEDLHDFFGGPAFLAWARMGNLHRWGGPLSQNWLDQQLLLQKKILSRMLELGMTPVLPSFSGNVPAALRNTFPSAKITRL GEWNTVDGDPRWCCTYLLDPSDPLFVEIGEAFIRQQIEEYGDVTDIYNCDTFNENKPPTNDPAYISSLGAAVYKAMSKGDKDAVWLMQGWLFYSDSEFWK PSQMKALLHSVPYGKLVVLDLFADVKPLWKISSQFYGTPYIWCLLHNFGGNIEMYGLLDSVSSGPVDSRISKNSTMVGVGMCMEGIEQNPVVYELMSEMA FRSEQVQVVEWLKAYAQRRYGKAVDQLQAAWEVLYHTVYNCTDGIADHNKDFIVQFPDWDPSLGSWSDNSKQSQMHKLPTLSGTRRFMFQQTDPNLPQAH LWYSTEKVIKALKLFLDAGNDLAGSLTYRYDLVDLTRQVLSKLSNQVYMDAVTAFRMKHVKAFNLHSQKFLQLIKDIEVLLASDDNFLLGTWLESAKKLT VNSWERKQYEWNARTQLTMWFDNTKTNQSRLHDYANKFWSGLLEDYYLPRASTYFTYLSESLTKNESFKLEEWRREWISFSNKWQAGNKLYPVKAKGDAL VISRALYKK |
Full Sequence |
---|
Protein Sequence Length: 808 Download |
MSTLASIFFI LILTQSLLLL SPLASSSPET IKGLLTRLDS QKSSASVQES AARGVLKRLL 60 PTHLNSFEIN IVSKDVCGGH SCFLIENYHS SSQSGPQIMI KGTTAVEIAS GLHWYLKYWC 120 GAHVSWDKTG GAQIDSVPKP GSIPFVKDEG FIIQRPVPWN YYQNVVTSSY SYVWWDWQRW 180 EKEIDWMALQ GINLPLAFTG QEAIWQKVFK GFNISKEDLH DFFGGPAFLA WARMGNLHRW 240 GGPLSQNWLD QQLLLQKKIL SRMLELGMTP VLPSFSGNVP AALRNTFPSA KITRLGEWNT 300 VDGDPRWCCT YLLDPSDPLF VEIGEAFIRQ QIEEYGDVTD IYNCDTFNEN KPPTNDPAYI 360 SSLGAAVYKA MSKGDKDAVW LMQGWLFYSD SEFWKPSQMK ALLHSVPYGK LVVLDLFADV 420 KPLWKISSQF YGTPYIWCLL HNFGGNIEMY GLLDSVSSGP VDSRISKNST MVGVGMCMEG 480 IEQNPVVYEL MSEMAFRSEQ VQVVEWLKAY AQRRYGKAVD QLQAAWEVLY HTVYNCTDGI 540 ADHNKDFIVQ FPDWDPSLGS WSDNSKQSQM HKLPTLSGTR RFMFQQTDPN LPQAHLWYST 600 EKVIKALKLF LDAGNDLAGS LTYRYDLVDL TRQVLSKLSN QVYMDAVTAF RMKHVKAFNL 660 HSQKFLQLIK DIEVLLASDD NFLLGTWLES AKKLTVNSWE RKQYEWNART QLTMWFDNTK 720 TNQSRLHDYA NKFWSGLLED YYLPRASTYF TYLSESLTKN ESFKLEEWRR EWISFSNKWQ 780 AGNKLYPVKA KGDALVISRA LYKKYFD* |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam12971 | NAGLU_N | 4.0e-23 | 48 | 146 | 99 | + Alpha-N-acetylglucosaminidase (NAGLU) N-terminal domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This N-terminal domain has an alpha-beta fold. | ||
pfam12972 | NAGLU_C | 9.0e-104 | 505 | 805 | 301 | + Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This C-terminal domain has an all alpha helical fold. | ||
pfam05089 | NAGLU | 4.0e-176 | 161 | 500 | 340 | + Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This central domain has a tim barrel fold. |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
EMBL | CBI24942.1 | 0 | 6 | 807 | 67 | 868 | unnamed protein product [Vitis vinifera] |
GenBank | EEC75285.1 | 0 | 43 | 806 | 51 | 811 | hypothetical protein OsI_11626 [Oryza sativa Indica Group] |
RefSeq | NP_196873.1 | 0 | 21 | 806 | 18 | 805 | alpha-N-acetylglucosaminidase family / NAGLU family [Arabidopsis thaliana] |
RefSeq | XP_002273084.1 | 0 | 6 | 807 | 2 | 803 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002314048.1 | 0 | 21 | 806 | 19 | 805 | predicted protein [Populus trichocarpa] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 2vcc_A | 0 | 67 | 795 | 192 | 866 | A Chain A, Structural Insights Into The Processivity Of Endopolygalacturonase I From Aspergillus Niger |
PDB | 2vcb_A | 0 | 67 | 795 | 192 | 866 | A Chain A, Structural Insights Into The Processivity Of Endopolygalacturonase I From Aspergillus Niger |
PDB | 2vca_A | 0 | 67 | 795 | 192 | 866 | A Chain A, Structural Insights Into The Processivity Of Endopolygalacturonase I From Aspergillus Niger |
PDB | 2vc9_A | 0 | 67 | 795 | 192 | 866 | A Chain A, Family 89 Glycoside Hydrolase From Clostridium Perfringens In Complex With 2-Acetamido-1,2-Dideoxynojirmycin |
PDB | 4a4a_A | 0 | 67 | 795 | 215 | 889 | A Chain A, Cpgh89 (E483q, E601q), From Clostridium Perfringens, In Complex With Its Substrate Glcnac-Alpha-1,4-Galactose |
EST Download unfiltered results here | ||||
---|---|---|---|---|
Hit | Length | Start | End | EValue |
HO783455 | 323 | 306 | 628 | 0 |
HO783455 | 148 | 637 | 784 | 0 |
GT622356 | 273 | 281 | 553 | 0 |
HO783128 | 426 | 347 | 772 | 0 |
HO783455 | 32 | 628 | 659 | 1.1 |
Sequence Alignments (This image is cropped. Click for full image.) |
---|