Basic Information | |
---|---|
Species | Thellungiella halophila |
Cazyme ID | Thhalv10003634m |
Family | CBM20 |
Protein Properties | Length: 881 Molecular Weight: 96927.5 Isoelectric Point: 6.4206 |
Chromosome | Chromosome/Scaffold: 6 Start: 3885960 End: 3892027 |
Description | catalytics;carbohydrate kinases;phosphoglucan, water dikinases |
View CDS |
External Links |
---|
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM20 | 69 | 153 | 1.8e-21 |
KVKLNVRLNHQVNFGEHVAMFGSAKEIGSWKKKSPLTWTENGWVCELELSGGQVLEYKFVIVHNDGSLSWESGDNRVLKLPNSGS |
Full Sequence |
---|
Protein Sequence Length: 881 Download |
MESIGSHCCG SPFTFITRNS SLPRIVNFTR SVHLGRQSQR LRNSPSRLTC AASSTIEEQR 60 KKKDGSGTKV KLNVRLNHQV NFGEHVAMFG SAKEIGSWKK KSPLTWTENG WVCELELSGG 120 QVLEYKFVIV HNDGSLSWES GDNRVLKLPN SGSFSVVCHW DATGETLDFP QEVASDGHDD 180 SDKHDVVDDR VVGSENGAQL HKSTLAGQWQ GKDASFMRSN EHGNREVGRN WDTTGLEGSA 240 LKMVEGDRNS RNWWRKLEMV REVIVGSLER EERLKALIYS AVYLKWINTG QIPCFEDGGH 300 HRPNRHAEIS RLIFRELEHI CSKKDATAEE VLVSRKIHPC LPSFKAEFTA SVPLTRIRDI 360 AHRNDIPHDL KQEIKHTIQN KLHRNAGPED LIATEAMLER ITETPGQYSG DFVEQFKIFH 420 NELKDFFNAG SLTEQLDSMK ISMDERGLSA LSLFFECKSG LDASGESTNV LELIKTMHSL 480 ASLRETIIKE LNSGLRNDAP DAAIAMRQKW RLCEIGLEDY FFVLLSRFLN ALETIGGAVQ 540 LAKDVESGNV ASWNDPLDAL VLGVHQVGLS GWKKEECLAI GNELLAWRER DLLEKEGEED 600 GRTIWAMRLK ATLDRARRLT AEYSDLLLQI FPPNVEILGK ALGIPENSVR TYTEAEIRAG 660 IIFQISKLCT VLLKAVRNSL GSEGWDVIVP GSTSGTLVQV ESIVPGSFPS TGGGPIILLV 720 NKADGDEEVS AANGNIAGVM LLQELPHLSH LGVRARQEKI VFVTCDDDDK VADIRRLVGK 780 YVRLEASPSH VNLILSAEGK SRTSKSNANK KTDKKSSSVD DEESKPVSSS SDSLLNSSKD 840 IPSGGIISLA DSDASTSGSK AAACGLLASL AAASSRGKIF * |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
cd02853 | E_set_MTHase_like_N | 3.0e-10 | 82 | 156 | 78 | + N-terminal Early set domain associated with the catalytic domain of Maltooligosyl trehalose trehalohydrolase (also called Glycosyltrehalose trehalohydrolase) and similar proteins. E or "early" set domains are associated with the catalytic domain of Maltooligosyl trehalose trehalohydrolase (MTHase) and similar proteins at the N-terminal end. This subfamily also includes bacterial alpha amylases and 1,4-alpha-glucan branching enzymes which are highly similar to MTHase. Maltooligosyl trehalose synthase (MTSase) and MTHase work together to produce trehalose. MTSase is responsible for converting the alpha-1,4-glucosidic linkage to an alpha,alpha-1,1-glucosidic linkage at the reducing end of the maltooligosaccharide through an intramolecular transglucosylation reaction, while MTHase hydrolyzes the penultimate alpha-1,4 linkage of the reducing end, resulting in the release of trehalose. The N-terminal domain of MTHase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase, among others. | ||
pfam00686 | CBM_20 | 8.0e-17 | 69 | 158 | 96 | + Starch binding domain. | ||
smart01065 | CBM_2 | 1.0e-17 | 68 | 150 | 89 | + Starch binding domain. | ||
cd05467 | CBM20 | 2.0e-23 | 75 | 160 | 92 | + The family 20 carbohydrate-binding module (CBM20), also known as the starch-binding domain, is found in a large number of starch degrading enzymes including alpha-amylase, beta-amylase, glucoamylase, and CGTase (cyclodextrin glucanotransferase). CBM20 is also present in proteins that have a regulatory role in starch metabolism in plants (e.g. alpha-amylase) or glycogen metabolism in mammals (e.g. laforin). CBM20 folds as an antiparallel beta-barrel structure with two starch binding sites. These two sites are thought to differ functionally with site 1 acting as the initial starch recognition site and site 2 involved in the specific recognition of appropriate regions of starch. | ||
cd05818 | CBM20_water_dikinase | 6.0e-53 | 69 | 160 | 92 | + Phosphoglucan water dikinase (also known as alpha-glucan water dikinase), N-terminal CBM20 (carbohydrate-binding module, family 20) domain. This domain is found in the chloroplast-encoded phosphoglucan water dikinase, one of two enzymes involved in the phosphorylation of plant starches. In addition to the CBM20 domain, phosphoglucan water dikinase contains a C-terminal pyruvate binding domain. The CBM20 domain is found in a large number of starch degrading enzymes including alpha-amylase, beta-amylase, glucoamylase, and CGTase (cyclodextrin glucanotransferase). CBM20 is also present in proteins that have a regulatory role in starch metabolism in plants (e.g. alpha-amylase) or glycogen metabolism in mammals (e.g. laforin). CBM20 folds as an antiparallel beta-barrel structure with two starch binding sites. These two sites are thought to differ functionally with site 1 acting as the initial starch recognition site and site 2 involved in the specific recognition of appropriate regions of starch. |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0003824 | catalytic activity |
GO:0005975 | carbohydrate metabolic process |
GO:2001070 | starch binding |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
GenBank | AAU93516.1 | 0 | 1 | 876 | 1 | 895 | chloroplast alpha-glucan water dikinase isoform 3 [Arabidopsis thaliana] |
EMBL | CBI39424.1 | 0 | 47 | 876 | 51 | 887 | unnamed protein product [Vitis vinifera] |
RefSeq | NP_001119280.1 | 0 | 1 | 826 | 1 | 845 | ATGWD3; carbohydrate kinase/ catalytic/ phosphoglucan, water dikinase [Arabidopsis thaliana] |
RefSeq | NP_198009.3 | 0 | 1 | 876 | 1 | 895 | ATGWD3; carbohydrate kinase/ catalytic/ phosphoglucan, water dikinase [Arabidopsis thaliana] |
RefSeq | XP_002265211.1 | 0 | 47 | 876 | 51 | 887 | PREDICTED: hypothetical protein [Vitis vinifera] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 1kum_A | 0.000001 | 68 | 151 | 5 | 95 | A Chain A, Crystal Structure Of The Nasturtium Seedling Mutant Xyloglucanase Isoform Nxg1-Delta-Yniig |
PDB | 1kul_A | 0.000001 | 68 | 151 | 5 | 95 | A Chain A, Crystal Structure Of The Nasturtium Seedling Mutant Xyloglucanase Isoform Nxg1-Delta-Yniig |
PDB | 1acz_A | 0.000001 | 68 | 151 | 5 | 95 | A Chain A, Glucoamylase, Granular Starch-Binding Domain Complex With Cyclodextrin, Nmr, 5 Structures |
PDB | 1ac0_A | 0.000001 | 68 | 151 | 5 | 95 | A Chain A, Glucoamylase, Granular Starch-Binding Domain Complex With Cyclodextrin, Nmr, Minimized Average Structure |
PDB | 1cyg_A | 0.000004 | 68 | 160 | 576 | 678 | A Chain A, Cyclodextrin Glucanotransferase (E.C.2.4.1.19) (Cgtase) |
EST Download unfiltered results here | ||||
---|---|---|---|---|
Hit | Length | Start | End | EValue |
HO778000 | 505 | 308 | 797 | 0 |
EV093738 | 261 | 545 | 805 | 0 |
DK535862 | 252 | 552 | 803 | 0 |
HO794745 | 342 | 461 | 797 | 0 |
HO794745 | 44 | 415 | 458 | 0.00000002 |
Sequence Alignments (This image is cropped. Click for full image.) |
---|