y
Basic Information | |
---|---|
Species | Carica papaya |
Cazyme ID | evm.TU.contig_30133.1 |
Family | GT31 |
Protein Properties | Length: 568 Molecular Weight: 63468.1 Isoelectric Point: 9.7988 |
Chromosome | Chromosome/Scaffold: 30133 Start: 703 End: 5006 |
Description | Galactosyltransferase family protein |
View CDS |
External Links |
---|
NCBI Taxonomy |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
GT31 | 330 | 526 | 0 |
KRRDSVRETWMPGGAKLKKLEREKGIVIRFVIGHSATPGGILDKAIDAEEVEHGDFLRLKHVEGYHELSTKTRLYFSTAVSIWDAEFYLKVDDDVHLNLG TLVSRLARYRWKPRVYIGCMKSGAVLSQKGVKYHEPEYWKFGEEGNKYFRHATGQIYAISKDLAAYISINQPILHRYANEDVSLGSWFIGLEVQHVD |
Full Sequence |
---|
Protein Sequence Length: 568 Download |
MSVTQEPTQP PQSSSTCTHL ESAPVINSRS NKRTRQTDTP FRGVRKRSWG RYVSEIRLPG 60 QKTRIWLGSF RSPDMAARAY DSAAFFLKGN SASLNFPNSI DSLPRPLSSS RRDIQSAAAQ 120 AALVSVGSDP KNEWREGREN TTSFKEDSEV DYFPPLSPLT FDSVNRRGIF ALLKKKIMMR 180 GKAVLLSGKA ILMACIGSFL AGSLFGSRNW STRSANYDLP RDNHHRLPII PYHVNRNMVQ 240 LPSDSRHDHG HNNPGLPQGV AGDVMGEVVK THRAIQSLEK TISMLEMELA VARTSSSNGG 300 DGIQNPPPTN HTLQKAFVVI GINTAFSSKK RRDSVRETWM PGGAKLKKLE REKGIVIRFV 360 IGHSATPGGI LDKAIDAEEV EHGDFLRLKH VEGYHELSTK TRLYFSTAVS IWDAEFYLKV 420 DDDVHLNLGT LVSRLARYRW KPRVYIGCMK SGAVLSQKGV KYHEPEYWKF GEEGNKYFRH 480 ATGQIYAISK DLAAYISINQ PILHRYANED VSLGSWFIGL EVQHVDERSM CCGTPPDCEW 540 KAQAGNVCVA SFDWSCSGIC KSVERIKH 600 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam13334 | DUF4094 | 8.0e-22 | 187 | 295 | 109 | + Domain of unknown function (DUF4094). This domain is found in plant proteins that often carry a galactosyltransferase domain, pfam01762, at their C-terminus. | ||
cd00018 | AP2 | 5.0e-23 | 41 | 98 | 58 | + DNA-binding domain found in transcription regulators in plants such as APETALA2 and EREBP (ethylene responsive element binding protein). In EREBPs the domain specifically binds to the 11bp GCC box of the ethylene response element (ERE), a promotor element essential for ethylene responsiveness. EREBPs and the C-repeat binding factor CBF1, which is involved in stress response, contain a single copy of the AP2 domain. APETALA2-like proteins, which play a role in plant development contain two copies. | ||
smart00380 | AP2 | 3.0e-25 | 41 | 98 | 58 | + DNA-binding domain in plant proteins such as APETALA2 and EREBPs. | ||
pfam01762 | Galactosyl_T | 3.0e-53 | 330 | 526 | 203 | + Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2. | ||
PLN03193 | PLN03193 | 0 | 185 | 567 | 394 | + beta-1,3-galactosyltransferase; Provisional |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003700 | sequence-specific DNA binding transcription factor activity |
GO:0006355 | regulation of transcription, DNA-dependent |
GO:0006486 | protein glycosylation |
GO:0008378 | galactosyltransferase activity |
GO:0016020 | membrane |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
RefSeq | NP_174609.1 | 0 | 179 | 566 | 1 | 374 | galactosyltransferase family protein [Arabidopsis thaliana] |
RefSeq | XP_002265159.1 | 0 | 179 | 567 | 1 | 372 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002269415.1 | 0 | 191 | 567 | 15 | 379 | PREDICTED: hypothetical protein [Vitis vinifera] |
RefSeq | XP_002513511.1 | 0 | 179 | 567 | 43 | 418 | transferase, transferring glycosyl groups, putative [Ricinus communis] |
RefSeq | XP_002513842.1 | 0 | 186 | 567 | 9 | 378 | Beta-1,3-galactosyltransferase sqv-2, putative [Ricinus communis] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 3gcc_A | 0.0000000000002 | 41 | 102 | 6 | 68 | A Chain A, Crystal Structure Of A Native Endo Beta-1,3-Glucanase (Hev B 2), A Major Allergen From Hevea Brasiliensis |
PDB | 2gcc_A | 0.0000000000002 | 41 | 102 | 6 | 68 | A Chain A, Solution Structure Of The Gcc-Box Binding Domain, Nmr, Minimized Mean Structure |
PDB | 1gcc_A | 0.0000000000005 | 41 | 97 | 3 | 60 | A Chain A, Solution Nmr Structure Of The Complex Of Gcc-Box Binding Domain Of Aterf1 And Gcc-Box Dna, Minimized Average Structure |