y
Basic Information | |
---|---|
Species | Vitis vinifera |
Cazyme ID | GSVIVT01030073001 |
Family | CBM43 |
Protein Properties | Length: 874 Molecular Weight: 91323.7 Isoelectric Point: 8.3211 |
Chromosome | Chromosome/Scaffold: 12 Start: 9587814 End: 9604039 |
Description | O-Glycosyl hydrolases family 17 protein |
View CDS |
External Links |
---|
NCBI Taxonomy |
Plaza |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM43 | 368 | 447 | 1.5e-23 |
WCVVNNNRDLSNATASASEACSVADCTALSPGSSCFNISWPASISYSFNSYYQQHNQQAASCDFGGLGLITTVDPSMEKC | |||
GH17 | 23 | 347 | 0 |
IGLNWGTAASHPLPPPRVVELLKNNNIARVKLFDADPLVLQALSGSKIAVTVGIPNSMLRSLNSSKKAAESWVHDNVTRYVSSSGRGSGVRIEYVAVGDE PFLQSYGDQFHPFVIGAATNIQTALIRANLASEVKVVVPFSSDTIQSESNLPSKGHFRSDLNKTMSHLLTFLNKHHSPFFVNISPFLSLHQNKNISLDFS IFKETAHPHSDSHRTYKNSFDLIYDTVVTALSTVGYPEMDIVVGQIGWPTDGAANATSSVAETFMKGLIRHLQSKSGTPLRPRVPPTETYIFSLLDEDQR SIAAGNFERHWGLFTFDGQAKYHVD |
Full Sequence |
---|
Protein Sequence Length: 874 Download |
MPPNLCLILL LCISSSTTRA TAIGLNWGTA ASHPLPPPRV VELLKNNNIA RVKLFDADPL 60 VLQALSGSKI AVTVGIPNSM LRSLNSSKKA AESWVHDNVT RYVSSSGRGS GVRIEYVAVG 120 DEPFLQSYGD QFHPFVIGAA TNIQTALIRA NLASEVKVVV PFSSDTIQSE SNLPSKGHFR 180 SDLNKTMSHL LTFLNKHHSP FFVNISPFLS LHQNKNISLD FSIFKETAHP HSDSHRTYKN 240 SFDLIYDTVV TALSTVGYPE MDIVVGQIGW PTDGAANATS SVAETFMKGL IRHLQSKSGT 300 PLRPRVPPTE TYIFSLLDED QRSIAAGNFE RHWGLFTFDG QAKYHVDLGQ GSRNLVNAQN 360 VNYLPSRWCV VNNNRDLSNA TASASEACSV ADCTALSPGS SCFNISWPAS ISYSFNSYYQ 420 QHNQQAASCD FGGLGLITTV DPSMEKCRFS IQLHTSHSSS LHPLHLLHQM LLAAATILSG 480 LAEAFISQTL GPFSLYFSLE KQALRKTLEN MSGVYGHDGD ESGQPPSSGG YGGGGGYGGG 540 GYGGGGGGGG YGGNSGGGGG GRGGGYGGGG GRGGGGGGYG GNSQNRGGGG GGGYQGGDRG 600 GRGGGGGGRG GGRGGSGRDG DWLCPNPSCG NLNFARRVEC NKCGAPSPAG AGSDRGGGGG 660 YNRGGSGGGF GGNRGGRGGN YEGGSGGNRG GNYEGEAVPP PTSYTGGPTS TTEAPVKVKQ 720 CDENCGDSCD NSRIYISNLP PDVTVDELRE LFGGIGQVGR IKQKRGYKDQ WPWNIKIYTD 780 EGGNNKGDAC LSYEDPSAAH SAGGFYNNYE MRGYKINVAM AEKSAPRAPP AFGHGGGGRG 840 GYGGGDRRRD NYRDNAGSGP DRHHHGGNRS RPY* 900 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam07983 | X8 | 8.0e-14 | 367 | 435 | 79 | + X8 domain. The X8 domain domain contains at least 6 conserved cysteine residues that presumably form three disulphide bridges. The domain is found in an Olive pollen allergen as well as at the C-terminus of several families of glycosyl hydrolases. This domain may be involved in carbohydrate binding. This domain is characteristic of GPI-anchored domains. | ||
cd12534 | RRM_SARFH | 3.0e-15 | 734 | 821 | 89 | + RNA recognition motif in Drosophila melanogaster RNA-binding protein cabeza and similar proteins. This subgroup corresponds to the RRM in cabeza, also termed P19, or sarcoma-associated RNA-binding fly homolog (SARFH). It is a putative homolog of human RNA-binding proteins FUS (also termed TLS or Pigpen or hnRNP P2), EWS (also termed EWSR1), TAF15 (also termed hTAFII68 or TAF2N or RPB56), and belongs to the of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a nuclear RNA binding protein that may play an important role in the regulation of RNA metabolism during fly development. Cabeza contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). | ||
smart00768 | X8 | 2.0e-27 | 368 | 449 | 84 | + Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges. | ||
cd12280 | RRM_FET | 5.0e-37 | 734 | 820 | 87 | + RNA recognition motif in the FET family of RNA-binding proteins. This subfamily corresponds to the RRM of FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA-binding proteins. This ubiquitously expressed family of similarly structured proteins predominantly localizing to the nuclear, includes FUS (also known as TLS or Pigpen or hnRNP P2), EWS (also known as EWSR1), TAF15 (also known as hTAFII68 or TAF2N or RPB56), and Drosophila Cabeza (also known as SARFH). The corresponding coding genes of these proteins are involved in deleterious genomic rearrangements with transcription factor genes in a variety of human sarcomas and acute leukemias. All FET proteins interact with each other and are therefore likely to be part of the very same protein complexes, which suggests a general bridging role for FET proteins coupling RNA transcription, processing, transport, and DNA repair. The FET proteins contain multiple copies of a degenerate hexapeptide repeat motif at the N-terminus. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a putative zinc-finger domain, and a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), which is flanked by 3 arginine-glycine-glycine (RGG) boxes. FUS and EWS might have similar sequence specificity; both bind preferentially to GGUG-containing RNAs. FUS has also been shown to bind strongly to human telomeric RNA and to small low-copy-number RNAs tethered to the promoter of cyclin D1. To date, nothing is known about the RNA binding specificity of TAF15. | ||
pfam00332 | Glyco_hydro_17 | 1.0e-62 | 23 | 347 | 326 | + Glycosyl hydrolases family 17. |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003676 | nucleic acid binding |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds |
GO:0005622 | intracellular |
GO:0005975 | carbohydrate metabolic process |
GO:0008270 | zinc ion binding |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
GenBank | ABF95444.1 | 0 | 23 | 472 | 35 | 479 | Glycosyl hydrolases family 17 protein, expressed [Oryza sativa (japonica cultivar-group)] |
EMBL | CBI28434.1 | 0 | 1 | 873 | 1 | 873 | unnamed protein product [Vitis vinifera] |
RefSeq | NP_200656.2 | 0 | 11 | 463 | 16 | 458 | glycosyl hydrolase family 17 protein [Arabidopsis thaliana] |
RefSeq | XP_002269108.1 | 0 | 21 | 478 | 24 | 481 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002313970.1 | 0 | 14 | 478 | 13 | 476 | predicted protein [Populus trichocarpa] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 3f55_D | 4.2039e-45 | 23 | 349 | 2 | 316 | A Chain A, Crystal Structure Of Human Microsomal P450 2a6 L240cN297Q |
PDB | 3f55_C | 4.2039e-45 | 23 | 349 | 2 | 316 | A Chain A, Crystal Structure Of Human Microsomal P450 2a6 L240cN297Q |
PDB | 3f55_B | 4.2039e-45 | 23 | 349 | 2 | 316 | A Chain A, Crystal Structure Of Human Microsomal P450 2a6 L240cN297Q |
PDB | 3f55_A | 4.2039e-45 | 23 | 349 | 2 | 316 | A Chain A, Crystal Structure Of Human Microsomal P450 2a6 L240cN297Q |
PDB | 3em5_D | 4.2039e-45 | 23 | 349 | 2 | 316 | A Chain A, Crystal Structure Of A Native Endo Beta-1,3-Glucanase (Hev B 2), A Major Allergen From Hevea Brasiliensis |