y
Basic Information | |
---|---|
Species | Capsella rubella |
Cazyme ID | Carubv10028073m |
Family | CBM43 |
Protein Properties | Length: 919 Molecular Weight: 96345.1 Isoelectric Point: 9.1429 |
Chromosome | Chromosome/Scaffold: 8 Start: 9838303 End: 9842758 |
Description | O-Glycosyl hydrolases family 17 protein |
View CDS |
External Links |
---|
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM43 | 344 | 423 | 2.5e-22 |
WCVVNNNKDLSNASARLLEACAVADCTSILPGGTCAGIGWPGNVSYAFNSLYQQNDHSAESCSFGGLGLITTVDPSVDNC | |||
GH17 | 5 | 324 | 0 |
VGVNWGTEASHPLPPSKVVELLKSNSIAKVKLFDADPKVLRALSGSNIGVTVGIPNSMVKSLNASRKVAESWVHDNVTRYFNGGNRVRIEYVAVGDEPFL QSYGNQYRPFVIGAAMNIQNALAKASLASEVKVVVPSSFDSFLSKSGRPSSGHFRADLNKTMIELLSFLTKHHSPFFVTISPFLSFHQNKNISLDFSLFK ETAQAHKDGRKTYRNSYDLSYDTLASALSSIGFSDVDIVVSKIGWPTDGAANATSPTAEVFLKGLMGHLEKKTGSLLRPPIETYIGSLLDEDQRNISSGN FERHWGVFTFDGQAKYGFSF |
Full Sequence |
---|
Protein Sequence Length: 919 Download |
TAGAVGVNWG TEASHPLPPS KVVELLKSNS IAKVKLFDAD PKVLRALSGS NIGVTVGIPN 60 SMVKSLNASR KVAESWVHDN VTRYFNGGNR VRIEYVAVGD EPFLQSYGNQ YRPFVIGAAM 120 NIQNALAKAS LASEVKVVVP SSFDSFLSKS GRPSSGHFRA DLNKTMIELL SFLTKHHSPF 180 FVTISPFLSF HQNKNISLDF SLFKETAQAH KDGRKTYRNS YDLSYDTLAS ALSSIGFSDV 240 DIVVSKIGWP TDGAANATSP TAEVFLKGLM GHLEKKTGSL LRPPIETYIG SLLDEDQRNI 300 SSGNFERHWG VFTFDGQAKY GFSFNHNSKK LVNAQNVQYL PPKWCVVNNN KDLSNASARL 360 LEACAVADCT SILPGGTCAG IGWPGNVSYA FNSLYQQNDH SAESCSFGGL GLITTVDPSV 420 DNCRFSIQLD TSHSSSQNPI FCQRWPLLLL LFLLCEVYRP IRNDAASNNK KIKTKKVSPV 480 IFWKPVMAGM YNQDGGGGAP IPSYGGDGYG GGGGYGGGDS GYGGRSSSGG GGYGGRGGYG 540 GGGGRGNRGG GYQGGDRGGR GSGGGGRGGG GRDGGGKDGD WRCPNPSCGN VNFARRVECN 600 KCGAPAPSGA GAGAGDRGGG GYSRGGGASD RGGARGGRND SGRSYESSRY DGGSRVGGSY 660 GTGSQQRDNG SYGQAPPPPA AIPSYDGSGT YPPPSGYGME AVPPPSSYSG GPPSYGGPRG 720 GYGSDAPSPG GRGGRAGGYD GSSAPRRQES SYEDAAAENR TQVKQCDADC DDTCDNARIY 780 ISNLPPNVTT DELKDLFGGI GQVGRIKQKR GYKDQWPYNI KIYTDEKGNH KGDACLAYED 840 PSAAHSAGGF FNNYEMRGSK ISVTMAEKSA PRAPTFDQRG GGRGGGGGYG GGGGDRRRDN 900 YGSGPDRNHH GGNRSRPY* 960 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam07983 | X8 | 5.0e-14 | 343 | 411 | 76 | + X8 domain. The X8 domain domain contains at least 6 conserved cysteine residues that presumably form three disulphide bridges. The domain is found in an Olive pollen allergen as well as at the C-terminus of several families of glycosyl hydrolases. This domain may be involved in carbohydrate binding. This domain is characteristic of GPI-anchored domains. | ||
cd12534 | RRM_SARFH | 2.0e-17 | 779 | 866 | 89 | + RNA recognition motif in Drosophila melanogaster RNA-binding protein cabeza and similar proteins. This subgroup corresponds to the RRM in cabeza, also termed P19, or sarcoma-associated RNA-binding fly homolog (SARFH). It is a putative homolog of human RNA-binding proteins FUS (also termed TLS or Pigpen or hnRNP P2), EWS (also termed EWSR1), TAF15 (also termed hTAFII68 or TAF2N or RPB56), and belongs to the of the FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA- and DNA-binding proteins whose expression is altered in cancer. It is a nuclear RNA binding protein that may play an important role in the regulation of RNA metabolism during fly development. Cabeza contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). | ||
smart00768 | X8 | 5.0e-27 | 343 | 425 | 85 | + Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges. | ||
cd12280 | RRM_FET | 6.0e-37 | 779 | 865 | 87 | + RNA recognition motif in the FET family of RNA-binding proteins. This subfamily corresponds to the RRM of FET (previously TET) (FUS/TLS, EWS, TAF15) family of RNA-binding proteins. This ubiquitously expressed family of similarly structured proteins predominantly localizing to the nuclear, includes FUS (also known as TLS or Pigpen or hnRNP P2), EWS (also known as EWSR1), TAF15 (also known as hTAFII68 or TAF2N or RPB56), and Drosophila Cabeza (also known as SARFH). The corresponding coding genes of these proteins are involved in deleterious genomic rearrangements with transcription factor genes in a variety of human sarcomas and acute leukemias. All FET proteins interact with each other and are therefore likely to be part of the very same protein complexes, which suggests a general bridging role for FET proteins coupling RNA transcription, processing, transport, and DNA repair. The FET proteins contain multiple copies of a degenerate hexapeptide repeat motif at the N-terminus. The C-terminal region consists of a conserved nuclear import and retention signal (C-NLS), a putative zinc-finger domain, and a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), which is flanked by 3 arginine-glycine-glycine (RGG) boxes. FUS and EWS might have similar sequence specificity; both bind preferentially to GGUG-containing RNAs. FUS has also been shown to bind strongly to human telomeric RNA and to small low-copy-number RNAs tethered to the promoter of cyclin D1. To date, nothing is known about the RNA binding specificity of TAF15. | ||
pfam00332 | Glyco_hydro_17 | 6.0e-64 | 5 | 324 | 321 | + Glycosyl hydrolases family 17. |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003676 | nucleic acid binding |
GO:0004553 | hydrolase activity, hydrolyzing O-glycosyl compounds |
GO:0005622 | intracellular |
GO:0005975 | carbohydrate metabolic process |
GO:0008270 | zinc ion binding |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
EMBL | CBI28434.1 | 0 | 2 | 454 | 20 | 478 | unnamed protein product [Vitis vinifera] |
EMBL | CBI28434.1 | 0 | 697 | 918 | 693 | 873 | unnamed protein product [Vitis vinifera] |
RefSeq | NP_200656.2 | 0 | 1 | 446 | 22 | 465 | glycosyl hydrolase family 17 protein [Arabidopsis thaliana] |
RefSeq | XP_002269108.1 | 0 | 4 | 448 | 25 | 475 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002313970.1 | 0 | 1 | 451 | 18 | 472 | predicted protein [Populus trichocarpa] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 3f55_D | 1.96182e-44 | 5 | 324 | 2 | 315 | A Chain A, Pectin Methylesterase From Yersinia Enterocolitica |
PDB | 3f55_C | 1.96182e-44 | 5 | 324 | 2 | 315 | A Chain A, Pectin Methylesterase From Yersinia Enterocolitica |
PDB | 3f55_B | 1.96182e-44 | 5 | 324 | 2 | 315 | A Chain A, Pectin Methylesterase From Yersinia Enterocolitica |
PDB | 3f55_A | 1.96182e-44 | 5 | 324 | 2 | 315 | A Chain A, Pectin Methylesterase From Yersinia Enterocolitica |
PDB | 3em5_D | 1.96182e-44 | 5 | 324 | 2 | 315 | A Chain A, Crystal Structure Of A Native Endo Beta-1,3-Glucanase (Hev B 2), A Major Allergen From Hevea Brasiliensis |
EST Download unfiltered results here | ||||
---|---|---|---|---|
Hit | Length | Start | End | EValue |
DK488212 | 251 | 1 | 251 | 0 |
DK466382 | 256 | 1 | 256 | 0 |
DK503228 | 233 | 1 | 233 | 0 |
DK547796 | 233 | 208 | 440 | 0 |
DK526005 | 235 | 206 | 440 | 0 |
Sequence Alignments (This image is cropped. Click for full image.) |
---|