Basic Information | |
---|---|
Species | Linum usitatissimum |
Cazyme ID | Lus10034116 |
Family | GH89 |
Protein Properties | Length: 892 Molecular Weight: 101651 Isoelectric Point: 5.5301 |
Chromosome | Chromosome/Scaffold: 292 Start: 1191219 End: 1197052 |
Description | alpha-N-acetylglucosaminidase family / NAGLU family |
View CDS |
External Links |
---|
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
GH89 | 106 | 408 | 0 |
DIMIYGVTGVEIVAGLHWYLKYWCGAHISWEKTGGAQLNSVPRSGSLPRVHDDGVLVQRPVPWNYYQNAVSSSYTFAWWDWQRWEKEIDWMALHGINLPL AFTGQEAIWQKVFQKFNITKAGLDDFFGGPAFLAWSRMANLHGWGGPLPQSWLDKQLVMQKKILARMYELGMTPVLPAFSGNVPAALIELFPSAKITRLG NWFSVESNPRWCCTYLLDATDPLFIEIGKAFIEEQLKEYGRTSHIYNCDTFDENTPPVDDPEYVSSLGAATFKGMQAGDKDAIWLMQGWLFAYDDFWKPP QMK | |||
GH89 | 462 | 875 | 0 |
HALLHSVPLGRLVVLDLYAEVKPIWSASEQFYGVPYIWCMLHNFAGNVEMYGVLDSVASGPVEARLISLVFQLTAFYYLPTLKVGVGMSMEGIEQNPIVY DLMSEMAFQHNKVDVKAWIDLYATRRYGQPVPLIQDAWNVLYHTVYNCTDGAYDKNRDVIVAFPDVDPSLISTPLEKYLDDAKPALRISILQQGSGLYEQ PHLWYSTLEVVHALKLFISSGGDLSGSNTFRYDLVDLTRQALAKYANALFLKITKAYESKDVNGVAEQSRKFVELVEDMDSLLSCHEGFLLGPWLESAKQ LAEDEEQEKQFEWNARTQITMWYDNTEEEASLLRDYGNKYWSGLVRDYYGQRAAIYFKYLLESLENDHGFRLKEWRREWIKLTNQCNQWQTSRKKFPVAS NGDALLLSTRLYEK |
Full Sequence |
---|
Protein Sequence Length: 892 Download |
MASPTPPPPL LFLLTVFFLL LSCILPLSRL ADAIGVDSIS RLLEIQDRER ASPSLQVAAA 60 RGVLHRLLPS HTSSFEFRIV SEEKCGGKSC FIISNHPYSA RHGAPDIMIY GVTGVEIVAG 120 LHWYLKYWCG AHISWEKTGG AQLNSVPRSG SLPRVHDDGV LVQRPVPWNY YQNAVSSSYT 180 FAWWDWQRWE KEIDWMALHG INLPLAFTGQ EAIWQKVFQK FNITKAGLDD FFGGPAFLAW 240 SRMANLHGWG GPLPQSWLDK QLVMQKKILA RMYELGMTPV LPAFSGNVPA ALIELFPSAK 300 ITRLGNWFSV ESNPRWCCTY LLDATDPLFI EIGKAFIEEQ LKEYGRTSHI YNCDTFDENT 360 PPVDDPEYVS SLGAATFKGM QAGDKDAIWL MQGWLFAYDD FWKPPQMKVV PIMSGFCVNV 420 LKRLWIDLRN SSRNFCSLDC EPMNKEIVGD RVAHFCIYID FHALLHSVPL GRLVVLDLYA 480 EVKPIWSASE QFYGVPYIWC MLHNFAGNVE MYGVLDSVAS GPVEARLISL VFQLTAFYYL 540 PTLKVGVGMS MEGIEQNPIV YDLMSEMAFQ HNKVDVKAWI DLYATRRYGQ PVPLIQDAWN 600 VLYHTVYNCT DGAYDKNRDV IVAFPDVDPS LISTPLEKYL DDAKPALRIS ILQQGSGLYE 660 QPHLWYSTLE VVHALKLFIS SGGDLSGSNT FRYDLVDLTR QALAKYANAL FLKITKAYES 720 KDVNGVAEQS RKFVELVEDM DSLLSCHEGF LLGPWLESAK QLAEDEEQEK QFEWNARTQI 780 TMWYDNTEEE ASLLRDYGNK YWSGLVRDYY GQRAAIYFKY LLESLENDHG FRLKEWRREW 840 IKLTNQCNQW QTSRKKFPVA SNGDALLLST RLYEKYLKDS SAGNAHYYDD E* 900 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam12971 | NAGLU_N | 5.0e-20 | 56 | 155 | 100 | + Alpha-N-acetylglucosaminidase (NAGLU) N-terminal domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This N-terminal domain has an alpha-beta fold. | ||
pfam12972 | NAGLU_C | 7.0e-93 | 578 | 876 | 299 | + Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This C-terminal domain has an all alpha helical fold. | ||
pfam05089 | NAGLU | 1.0e-161 | 170 | 573 | 404 | + Alpha-N-acetylglucosaminidase (NAGLU) tim-barrel domain. Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. The structure shows that the enzyme is composed of three domains. This central domain has a tim barrel fold. |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
EMBL | CBI15090.1 | 0 | 31 | 879 | 23 | 840 | unnamed protein product [Vitis vinifera] |
GenBank | EEC78143.1 | 0 | 39 | 880 | 33 | 810 | hypothetical protein OsI_17702 [Oryza sativa Indica Group] |
RefSeq | XP_002280399.1 | 0 | 31 | 879 | 23 | 807 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002318632.1 | 0 | 31 | 889 | 26 | 811 | predicted protein [Populus trichocarpa] |
RefSeq | XP_002511461.1 | 0 | 10 | 886 | 4 | 809 | alpha-n-acetylglucosaminidase, putative [Ricinus communis] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 2vcc_A | 4.00001e-41 | 75 | 394 | 192 | 492 | A Chain A, The Structure Of A Protein In Glycosyl Transferase Family 8 From Anaerococcus Prevotii. |
PDB | 2vcc_A | 1e-38 | 472 | 813 | 509 | 816 | A Chain A, The Structure Of A Protein In Glycosyl Transferase Family 8 From Anaerococcus Prevotii. |
PDB | 2vcb_A | 4.00001e-41 | 75 | 394 | 192 | 492 | A Chain A, The Structure Of A Protein In Glycosyl Transferase Family 8 From Anaerococcus Prevotii. |
PDB | 2vcb_A | 1e-38 | 472 | 813 | 509 | 816 | A Chain A, The Structure Of A Protein In Glycosyl Transferase Family 8 From Anaerococcus Prevotii. |
PDB | 2vca_A | 4.00001e-41 | 75 | 394 | 192 | 492 | A Chain A, The Structure Of A Protein In Glycosyl Transferase Family 8 From Anaerococcus Prevotii. |