Basic Information | |
---|---|
Species | Glycine max |
Cazyme ID | Glyma20g32720.1 |
Family | CBM57 |
Protein Properties | Length: 627 Molecular Weight: 68894.1 Isoelectric Point: 5.981 |
Chromosome | Chromosome/Scaffold: 20 Start: 41349894 End: 41355859 |
Description | Di-glucose binding protein with Leucine-rich repeat domain |
View CDS |
External Links |
---|
NCBI Taxonomy |
Plaza |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM57 | 210 | 360 | 4.6e-25 |
AGFTNHTDRFSRSWQPDYDFRTIPEDRDEVRSLSTDNSISGADEAPNYFPMKLYQSAVTTEGPLGYELSVDAKLDYTVWLHFAEIDSSVNKAGERVFDIF INDDNVTRLDIYNHVGAFAALTLNFTVKNLSDNVLTLKLVPAVGAPLICAI |
Full Sequence |
---|
Protein Sequence Length: 627 Download |
MTRAYSFLVS LVFITMTPST PQVEAFSYHI NCGASTDSTD SFNTTWLSDR FFSAGSSALV 60 SEPLHFPLPS EKTLRFFPPS SSGKRNCYTF PSLPSPSRYL LRTFTVYDNY DAKSRPPSFD 120 VSLSSTVLFS WRSPWPESTA RNGAYSDLFA SLPNTSSLDL CFYGFATDSP LVSSIELVQV 180 HPAAYTNSNN LILVNYGRIS CGAAAKPWGA GFTNHTDRFS RSWQPDYDFR TIPEDRDEVR 240 SLSTDNSISG ADEAPNYFPM KLYQSAVTTE GPLGYELSVD AKLDYTVWLH FAEIDSSVNK 300 AGERVFDIFI NDDNVTRLDI YNHVGAFAAL TLNFTVKNLS DNVLTLKLVP AVGAPLICAI 360 ENYALVPVDP STLPLQVSAM KALKESLRVP DRMGWNGDPC APTNWDAWEG VTCRMTNDKT 420 AHVISQIDLG SQGLKGFISD QISLLSDLVS LNLSSNSLGG EIPPGLGQKS LIQVDLSNNQ 480 LMGFIPDSLA SSNLKLVLLN GNLLEGRVPE QLYSVGVHGG AIDLSGNKGL CGVPSLPSCP 540 MFWEHGRLST RGKIAIALSC LFVFCVVLLV AYIYIRRKRN DYDFALPHEL MSLAAKRNRY 600 QRQKSLMLLE LESQHAKGLP SPFTPQ* |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam11721 | Malectin | 0.003 | 27 | 95 | 77 | + Di-glucose binding within endoplasmic reticulum. Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan. | ||
pfam11721 | Malectin | 2.0e-17 | 210 | 360 | 168 | + Di-glucose binding within endoplasmic reticulum. Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan. | ||
pfam12819 | Malectin_like | 2.0e-57 | 30 | 363 | 357 | + Carbohydrate-binding protein of the ER. Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. The domain is found on a number of plant receptor kinases. | ||
PLN03150 | PLN03150 | 5.0e-97 | 7 | 580 | 590 | + hypothetical protein; Provisional |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
RefSeq | NP_001145734.1 | 0 | 27 | 626 | 31 | 634 | hypothetical protein LOC100279241 [Zea mays] |
RefSeq | NP_564237.2 | 0 | 27 | 621 | 29 | 622 | leucine-rich repeat protein-related [Arabidopsis thaliana] |
RefSeq | XP_002281668.1 | 0 | 27 | 621 | 30 | 621 | PREDICTED: similar to leucine-rich repeat protein-related [Vitis vinifera] |
RefSeq | XP_002457393.1 | 0 | 27 | 626 | 29 | 632 | hypothetical protein SORBIDRAFT_03g006620 [Sorghum bicolor] |
RefSeq | XP_002524408.1 | 0 | 26 | 597 | 22 | 591 | serine-threonine protein kinase, plant-type, putative [Ricinus communis] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 4hq1_A | 0.000004 | 377 | 480 | 29 | 122 | A Chain A, Crystal Structure Of A Rice Os3bglu6 Beta-glucosidase |
PDB | 3rgz_A | 0.0005 | 427 | 539 | 637 | 748 | A Chain A, Crystal Structure Of A Rice Os3bglu6 Beta-glucosidase |
PDB | 3rgz_A | 0.001 | 423 | 509 | 224 | 311 | A Chain A, Crystal Structure Of A Rice Os3bglu6 Beta-glucosidase |
PDB | 3rgz_A | 0.008 | 448 | 515 | 396 | 465 | A Chain A, Crystal Structure Of A Rice Os3bglu6 Beta-glucosidase |