y
Basic Information | |
---|---|
Species | Glycine max |
Cazyme ID | Glyma01g22406.1 |
Family | CBM57 |
Protein Properties | Length: 643 Molecular Weight: 70444 Isoelectric Point: 6.8254 |
Chromosome | Chromosome/Scaffold: 01 Start: 28104832 End: 28117466 |
Description | Di-glucose binding protein with Leucine-rich repeat domain |
View CDS |
External Links |
---|
NCBI Taxonomy |
Plaza |
CAZyDB |
Signature Domain Download full data set without filtering | |||
---|---|---|---|
Family | Start | End | Evalue |
CBM57 | 228 | 376 | 3.7e-22 |
SDRFGRSWQSDSDFRTGRSKVRAVSTRSGISGTEQKPNYFPEKLYQSAAMTAVTAEEGDGVLEYELSVDAKLDYLVWLHFAEIEGRVRRVGERVFDVYIN NDNLTRIDIYKQVGGFAAFTWHHTVKNLSSSVLSVKLVGVVGAPLICGI |
Full Sequence |
---|
Protein Sequence Length: 643 Download |
MSLFHTLTLL IFLFVILFTT STTTSTPLPL PTPSFPSGLS YHIDCGSPTN STDQFNTTWL 60 SDRYFSGGAT GIVSEPLRFR HGHEKTLRFF PISSGKKNCY TVPNLPPSRY LLRTFVVYDN 120 YDGRSHPPSF DVAVAATVVF SWRSPWPQSL ARNGAYADLF ATIASSEALI CFYSFATDPP 180 VVSSIELFAA DPASYDAAAI GKNDIVLVNY GRLSCGSNQW GPGFSNDSDR FGRSWQSDSD 240 FRTGRSKVRA VSTRSGISGT EQKPNYFPEK LYQSAAMTAV TAEEGDGVLE YELSVDAKLD 300 YLVWLHFAEI EGRVRRVGER VFDVYINNDN LTRIDIYKQV GGFAAFTWHH TVKNLSSSVL 360 SVKLVGVVGA PLICGIENYA LVPSDPSTVP EQVVAMKALK DSFRVPERMG WNGDPCAPTN 420 WDAWEGVTCR TSKNSTTLVI SQIDLGSQGL KGSISDQISL LSDLVSLNLS SNLLVGEIPS 480 GLGQKSLIHL DLSNNQLTGP IPDSIASSSL QLVLLNGNLL EGRVPEQLYS IGVHGGAIDL 540 SGNKGLCGVP SLPDCPMFWE NGKLSTQGKI AIGLSCLFVF CVILLLVYIY IRRRRNDYDF 600 ALPHELTSLA AKRNRYQRQK SLMVLEMESQ HAKGLPSHFT TQ* 660 |
Functional Domains Download unfiltered results here | ||||||||
---|---|---|---|---|---|---|---|---|
Cdd ID | Domain | E-Value | Start | End | Length | Domain Description | ||
pfam11721 | Malectin | 0.008 | 41 | 113 | 81 | + Di-glucose binding within endoplasmic reticulum. Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan. | ||
PLN00113 | PLN00113 | 0.001 | 419 | 502 | 92 | + leucine-rich repeat receptor-like protein kinase; Provisional | ||
pfam11721 | Malectin | 8.0e-10 | 212 | 343 | 138 | + Di-glucose binding within endoplasmic reticulum. Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. It carries a signal peptide from residues 1-26, a C-terminal transmembrane helix from residues 255-274, and a highly conserved central part of approximately 190 residues followed by an acidic, glutamate-rich region. Carbohydrate-binding is mediated by the four aromatic residues, Y67, Y89, Y116, and F117 and the aspartate at D186. NMR-based ligand-screening studies has shown binding of the protein to maltose and related oligosaccharides, on the basis of which the protein has been designated "malectin", and its endogenous ligand is found to be Glc2-high-mannose N-glycan. | ||
pfam12819 | Malectin_like | 3.0e-44 | 43 | 347 | 320 | + Carbohydrate-binding protein of the ER. Malectin is a membrane-anchored protein of the endoplasmic reticulum that recognises and binds Glc2-N-glycan. The domain is found on a number of plant receptor kinases. | ||
PLN03150 | PLN03150 | 4.0e-84 | 43 | 575 | 548 | + hypothetical protein; Provisional |
Gene Ontology | |
---|---|
GO Term | Description |
GO:0005515 | protein binding |
Annotations - NR Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
GenBank | AAG50525.1 | 0 | 34 | 615 | 23 | 550 | AC084221_7 hypothetical protein [Arabidopsis thaliana] |
RefSeq | NP_564237.2 | 0 | 34 | 637 | 23 | 622 | leucine-rich repeat protein-related [Arabidopsis thaliana] |
RefSeq | XP_002281668.1 | 0 | 35 | 637 | 25 | 621 | PREDICTED: similar to leucine-rich repeat protein-related [Vitis vinifera] |
RefSeq | XP_002457393.1 | 0 | 40 | 635 | 29 | 625 | hypothetical protein SORBIDRAFT_03g006620 [Sorghum bicolor] |
RefSeq | XP_002524408.1 | 0 | 36 | 613 | 19 | 591 | serine-threonine protein kinase, plant-type, putative [Ricinus communis] |
Annotations - PDB Download unfiltered results here | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-Value | Query Start | Query End | Hit Start | Hit End | Description |
PDB | 4hq1_A | 0.0000003 | 395 | 528 | 31 | 156 | B Chain B, Crystal Structure Of Recombinant Foot-and-mouth-disease Virus A22- H2093c Empty Capsid |
PDB | 3rgz_A | 0.00002 | 439 | 525 | 224 | 311 | B Chain B, Crystal Structure Of Recombinant Foot-and-mouth-disease Virus A22- H2093c Empty Capsid |
PDB | 3rgz_A | 0.00004 | 443 | 503 | 661 | 722 | B Chain B, Crystal Structure Of Recombinant Foot-and-mouth-disease Virus A22- H2093c Empty Capsid |