Basic Information | |
---|---|
Species | Fragaria vesca |
Cazyme ID | mrna28204.1-v1.0-hybrid |
Family | CBM43 |
Protein Properties | Length: 881 Molecular Weight: 92786.8 Isoelectric Point: 6.368 |
Chromosome | Chromosome/Scaffold: 3 Start: 20692192 End: 20697633 |
Description | O-Glycosyl hydrolases family 17 protein |
View CDS |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM43 | 351 | 430 | 3.1e-24 |
WCVVNNNRDLSNATANALQACSLADCSALSPGGSCSNISWPGNISYAFNSYYQQHNQSADSCDFGGLGLITTVNPSIDNC | |||
GH17 | 8 | 331 | 0 |
VGVNWGTTASHPLPPAKVVQLMKSNNITRVKLFDADPLVLEALSGSNLGVTMGIPNGLLRSLNSSKKAAQTWVHDNVTRYVSSGGSRVKIEYVAVGDEPF LQSYGQQFYPFVLGAAINIQTALTQANLDNKVKVVVPCSFDSFLSETSLPSKGHFRADVNRTMIQLLKFLSKHNSPFFATISPFLAFQQNKNISLDFTLF RVYTKARNDSRRMYKNSFDLNYDILVNALSTVGFPRIGIVVSQIGWPTDGGPNATSYAAETFMKGLMNRLRSKLGTPLRPRNPPIETYIFSLLDEDQRSI STGNFERHWGIFTFDGQAKYRFDF |
Full Sequence |
---|
Protein Sequence Length: 881 Download |
MSYGSVAVGV NWGTTASHPL PPAKVVQLMK SNNITRVKLF DADPLVLEAL SGSNLGVTMG 60 IPNGLLRSLN SSKKAAQTWV HDNVTRYVSS GGSRVKIEYV AVGDEPFLQS YGQQFYPFVL 120 GAAINIQTAL TQANLDNKVK VVVPCSFDSF LSETSLPSKG HFRADVNRTM IQLLKFLSKH 180 NSPFFATISP FLAFQQNKNI SLDFTLFRVY TKARNDSRRM YKNSFDLNYD ILVNALSTVG 240 FPRIGIVVSQ IGWPTDGGPN ATSYAAETFM KGLMNRLRSK LGTPLRPRNP PIETYIFSLL 300 DEDQRSISTG NFERHWGIFT FDGQAKYRFD FTQGSNSLVN AQDVEYLPSR WCVVNNNRDL 360 SNATANALQA CSLADCSALS PGGSCSNISW PGNISYAFNS YYQQHNQSAD SCDFGGLGLI 420 TTVNPSIDNC RTAAPPRPMD LAVAVATVAL EDTEAVPEVM EEAVDMEVKV VMAVVTVVEA 480 VVVEVEEDMV AVAVEVEDIR AIVVVAAEEV AAAAEVAAVA VEGMVIGLAL IQGYCGNMNF 540 ARRTECNKCG TPSPAGGGGG GGDRGGGGYR GGSGGGYGDS RGGRGGNYDG GRSGNYEGGK 600 GGSYDGGRGG GFDSRGGGGS RGGSYGGSQG REDGGYGQAP AAPPSYGAGG SYQPSYNASY 660 GTDAVPPPTS YTGGPASYPP SYGGPAGGYG GDGSGDARSG GRGGPPAKYD GGYGSGGGRG 720 GYGSAPAEAP AKVKQCDENC DDTCDNARIY ISNLPPDVTV EELQALFGGI GQVGRIKQKR 780 GYKDQWPYNI KIYTDDSGKN KGDACLAYED PNAAHSAGSF YNGHDVRGYK ISVAMAERSA 840 PRTYDNGGGR GGYGGGDRRR DNRDGGPDRH QHGGNRSRPY * |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam07983 | X8 | 1.0e-15 | 350 | 418 | 79 | + X8 domain. The X8 domain domain contains at least 6 conserved cysteine residues that presumably form three disulphide bridges. The domain is found in an Olive pollen allergen as well as at the C-terminus of several families of glycosyl hydrolases. This domain may be involved in carbohydrate binding. This domain is characteristic of GPI-anchored domains. | ||
cd12534 | RRM_SARFH | 4.0e-17 | 749 | 836 | 89 | + RNA recognition motif in Drosophila melanogaster RNA-binding protein cabeza and similar proteins. This subgroup corresponds to the RRM in cabeza, also termed P19, or sarcoma-associated RNA-binding fly homolog (SARFH). It is a putative homolog of human RNA-binding proteins FUS (also termed TLS or Pigpen or hnRNP P2), EWS (also termed EWSR1), TAF15 (also termed hTAFII68 or TAF2N or RPB56), and belongs to the of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a nuclear RNA binding protein that may play an important role in the regulation of RNA metabolism during fly development. Cabeza contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). | ||
smart00768 | X8 | 1.0e-27 | 351 | 431 | 83 | + Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges. | ||
cd12280 | RRM_FET | 2.0e-34 | 749 | 835 | 87 | + RNA recognition motif in the FET family of RNA-binding proteins. This subfamily corresponds to the RRM of FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA-binding proteins. This ubiquitously expressed family of similarly structured proteins predominantly localizing to the nuclear, includes FUS (also known as TLS or Pigpen or hnRNP P2), EWS (also known as EWSR1), TAF15 (also known as hTAFII68 or TAF2N or RPB56), and Drosophila Cabeza (also known as SARFH). The corresponding coding genes of these proteins are involved in deleterious genomic rearrangements with transcription factor genes in a variety of human sarcomas and acute leukemias. All FET proteins interact with each other and are therefore likely to be part of the very same protein complexes, which suggests a general bridging role for FET proteins coupling RNA transcription, processing, transport, and DNA repair. The FET proteins contain multiple copies of a degenerate hexapeptide repeat motif at the N-terminus. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a putative zinc-finger domain, and a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), which is flanked by 3 arginine-glycine-glycine (RGG) boxes. FUS and EWS might have similar sequence specificity; both bind preferentially to GGUG-containing RNAs. FUS has also been shown to bind strongly to human telomeric RNA and to small low-copy-number RNAs tethered to the promoter of cyclin D1. To date, nothing is known about the RNA binding specificity of TAF15. | ||
pfam00332 | Glyco_hydro_17 | 9.0e-69 | 8 | 331 | 325 | + Glycosyl hydrolases family 17. |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003676 | nucleic acid binding |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds |
GO:0005622 | intracellular |
GO:0005975 | carbohydrate metabolic process |
GO:0008270 | zinc ion binding |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 3f55_D | 0 | 8 | 332 | 2 | 316 | A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes Takasagoensis, At Ph 2.5. |
PDB | 3f55_C | 0 | 8 | 332 | 2 | 316 | A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes Takasagoensis, At Ph 2.5. |
PDB | 3f55_B | 0 | 8 | 332 | 2 | 316 | A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes Takasagoensis, At Ph 2.5. |
PDB | 3f55_A | 0 | 8 | 332 | 2 | 316 | A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes Takasagoensis, At Ph 2.5. |
PDB | 3em5_D | 0 | 8 | 332 | 2 | 316 | A Chain A, Crystal Structure Of A Native Endo Beta-1,3-Glucanase (Hev B 2), A Major Allergen From Hevea Brasiliensis |
Sequence Alignments (This image is cropped. Click for full image.) |
---|