Basic Information | |
---|---|
Species | Thellungiella halophila |
Cazyme ID | Thhalv10012621m |
Family | CBM43 |
Protein Properties | Length: 905 Molecular Weight: 94912.6 Isoelectric Point: 8.6133 |
Chromosome | Chromosome/Scaffold: 2 Start: 8862058 End: 8867295 |
Description | O-Glycosyl hydrolases family 17 protein |
View CDS |
External Links |
---|
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM43 | 365 | 444 | 2.6e-22 |
WCVVNNNMDLSNASAKALEACSVADCTSVLPGGSCSGIRWPGNVSYAFNSLYQQNDHRADSCDFGGLGLITTVDPSEDNC | |||
GH17 | 26 | 345 | 0 |
VGVNWGTDASHPLPPSKVVELLKSNGIAKVKLFDADPKVLGALSGSNIGVTVGIPNSMIKSLNASRKAAESWVHDNVTRYFDGGNRVRIEYVAVGDEPFL QSYGNQYRPFVIGAAMNIQNALAKASLASEVKVVVPSSFDSFISESGRPSSGHFRADLNKTMIELLSFLTKHHSPFFVTLSPFLSFHQNKNFSLDFSLFK ESAQAHKDGRKNYRNSFDLSYDTLVSALSSIGFPGMDIVVSKIGWPTDGAANATSPTSEAFLKGLMGHLEKKTGSMLRPPVETYIESLLDEDQRSVSFGN FERHWGVFTFDGQAKYSFSF |
Full Sequence |
---|
Protein Sequence Length: 905 Download |
MARRLFLLLL AVTAELSLTR FTAGAVGVNW GTDASHPLPP SKVVELLKSN GIAKVKLFDA 60 DPKVLGALSG SNIGVTVGIP NSMIKSLNAS RKAAESWVHD NVTRYFDGGN RVRIEYVAVG 120 DEPFLQSYGN QYRPFVIGAA MNIQNALAKA SLASEVKVVV PSSFDSFISE SGRPSSGHFR 180 ADLNKTMIEL LSFLTKHHSP FFVTLSPFLS FHQNKNFSLD FSLFKESAQA HKDGRKNYRN 240 SFDLSYDTLV SALSSIGFPG MDIVVSKIGW PTDGAANATS PTSEAFLKGL MGHLEKKTGS 300 MLRPPVETYI ESLLDEDQRS VSFGNFERHW GVFTFDGQAK YSFSFNHNSK KLVNAQNVQY 360 LSPKWCVVNN NMDLSNASAK ALEACSVADC TSVLPGGSCS GIRWPGNVSY AFNSLYQQND 420 HRADSCDFGG LGLITTVDPS EDNCRFWIQL DTSHSSSHIP IFFQCCMIKR QIQLFWKLVM 480 AGMYNQDGGG GAPLPSYGGD GYGGGGGYGG GDAGYGGRGS SGGGGYGGRG GYGGGGGRGN 540 RGGGGYQGGG GGGRGGGGRD GGGRDGDWRC PNPSCGNVNF ARRVECNKCG APAPSGTGDR 600 GGGGYSRGGG ASDRGGGRGG RNDSGRSYES SRYDGGSRSG GGSYGSNSQH RDNGSYGQAP 660 PPPPAAAIPS YDGSGSYPPP SGYGMEAVPP PSSYAGGPPS YGGPTGGYGG DAPSIGGRGG 720 RGGGYDGGNA PRRQEASYGE EGAEKVKQCD ADCDDTCDNT RIYITNLPLD VTTDELKDLF 780 GGIGQVGRQK QKRGYKDQWP YNIKIYTDEK GKNKGDACLT YEDPSAAHSA GGFFNNYEMR 840 GHKIGVSMAA KSAPRAPAFD QRGGGGRGGG GGGGGGYGGG DRRRDNYGSG PDRNYHGGNR 900 SRPY* |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
cd12281 | RRM1_TatSF1_like | 1.0e-13 | 759 | 849 | 94 | + RNA recognition motif 1 in HIV Tat-specific factor 1 (Tat-SF1) and similar proteins. This subfamily corresponds to the RRM1 of Tat-SF1 and CUS2. Tat-SF1 is the cofactor for stimulation of transcriptional elongation by human immunodeficiency virus-type 1 (HIV-1) Tat. It is a substrate of an associated cellular kinase. Tat-SF1 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a highly acidic carboxyl-terminal half. The family also includes CUS2, a yeast homolog of human Tat-SF1. CUS2 interacts with U2 RNA in splicing extracts and functions as a splicing factor that aids assembly of the splicing-competent U2 snRNP in vivo. CUS2 also associates with PRP11 that is a subunit of the conserved splicing factor SF3a. Like Tat-SF1, CUS2 contains two RRMs as well. | ||
cd12534 | RRM_SARFH | 5.0e-17 | 762 | 849 | 89 | + RNA recognition motif in Drosophila melanogaster RNA-binding protein cabeza and similar proteins. This subgroup corresponds to the RRM in cabeza, also termed P19, or sarcoma-associated RNA-binding fly homolog (SARFH). It is a putative homolog of human RNA-binding proteins FUS (also termed TLS or Pigpen or hnRNP P2), EWS (also termed EWSR1), TAF15 (also termed hTAFII68 or TAF2N or RPB56), and belongs to the of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a nuclear RNA binding protein that may play an important role in the regulation of RNA metabolism during fly development. Cabeza contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). | ||
smart00768 | X8 | 6.0e-26 | 364 | 446 | 85 | + Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges. | ||
cd12280 | RRM_FET | 3.0e-37 | 762 | 848 | 87 | + RNA recognition motif in the FET family of RNA-binding proteins. This subfamily corresponds to the RRM of FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA-binding proteins. This ubiquitously expressed family of similarly structured proteins predominantly localizing to the nuclear, includes FUS (also known as TLS or Pigpen or hnRNP P2), EWS (also known as EWSR1), TAF15 (also known as hTAFII68 or TAF2N or RPB56), and Drosophila Cabeza (also known as SARFH). The corresponding coding genes of these proteins are involved in deleterious genomic rearrangements with transcription factor genes in a variety of human sarcomas and acute leukemias. All FET proteins interact with each other and are therefore likely to be part of the very same protein complexes, which suggests a general bridging role for FET proteins coupling RNA transcription, processing, transport, and DNA repair. The FET proteins contain multiple copies of a degenerate hexapeptide repeat motif at the N-terminus. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a putative zinc-finger domain, and a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), which is flanked by 3 arginine-glycine-glycine (RGG) boxes. FUS and EWS might have similar sequence specificity; both bind preferentially to GGUG-containing RNAs. FUS has also been shown to bind strongly to human telomeric RNA and to small low-copy-number RNAs tethered to the promoter of cyclin D1. To date, nothing is known about the RNA binding specificity of TAF15. | ||
pfam00332 | Glyco_hydro_17 | 4.0e-66 | 26 | 345 | 321 | + Glycosyl hydrolases family 17. |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003676 | nucleic acid binding |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds |
GO:0005622 | intracellular |
GO:0005975 | carbohydrate metabolic process |
GO:0008270 | zinc ion binding |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
EMBL | CBI28434.1 | 0 | 23 | 483 | 20 | 481 | unnamed protein product [Vitis vinifera] |
EMBL | CBI28434.1 | 0 | 683 | 904 | 693 | 873 | unnamed protein product [Vitis vinifera] |
RefSeq | NP_200656.2 | 0 | 1 | 464 | 1 | 462 | glycosyl hydrolase family 17 protein [Arabidopsis thaliana] |
RefSeq | XP_002269108.1 | 0 | 14 | 468 | 14 | 474 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002313970.1 | 0 | 19 | 480 | 15 | 476 | predicted protein [Populus trichocarpa] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 3f55_D | 7.9874e-44 | 26 | 345 | 2 | 315 | A Chain A, The Structure Of Soybean Peroxidase |
PDB | 3f55_C | 7.9874e-44 | 26 | 345 | 2 | 315 | A Chain A, The Structure Of Soybean Peroxidase |
PDB | 3f55_B | 7.9874e-44 | 26 | 345 | 2 | 315 | A Chain A, The Structure Of Soybean Peroxidase |
PDB | 3f55_A | 7.9874e-44 | 26 | 345 | 2 | 315 | A Chain A, The Structure Of Soybean Peroxidase |
PDB | 3em5_D | 7.9874e-44 | 26 | 345 | 2 | 315 | A Chain A, Crystal Structure Of A Native Endo Beta-1,3-Glucanase (Hev B 2), A Major Allergen From Hevea Brasiliensis |