PUL ID

PUL0584

PubMed

23530958, BMC Microbiol. 2013 Mar 26;13:68. doi: 10.1186/1471-2180-13-68.

Characterization method

microarray

Genomic accession number

CP001834.1

Nucelotide position range

2317224-2332669

Substrate

melibiose

Loci

LLKF_2259-LLKF_2269

Species

Lactococcus lactis/1358

Degradation or Biosynthesis

degradation

Cluster number

1

Gene name

Gene position

Gene type

Found by CGCFinder?

sucP 1 - 1446 (-) CAZyme: GH13|GH13_18 Yes
- 1533 - 2360 (-) TC: gnl|TC-DB|Q00751|3.A.1.1.28 Yes
- 2373 - 3242 (-) TC: gnl|TC-DB|Q00750|3.A.1.1.28 Yes
- 3343 - 4587 (-) TC: gnl|TC-DB|Q00749|3.A.1.1.28 Yes
- 4737 - 5606 (+) TF: DBD-Pfam|HTH_AraC,DBD-Pfam|HTH_AraC,DBD-SUPERFAMILY|0035607,DBD-SUPERFAMILY|0035607 Yes
purH 5678 - 7222 (-) other Yes
galT 7347 - 8831 (-) other Yes
galK 8828 - 10018 (-) other Yes
aga 10015 - 12231 (-) CAZyme: GH36 Yes
galR 12362 - 13399 (-) TF: DBD-Pfam|LacI,DBD-SUPERFAMILY|0044558 No
fbp 13524 - 15446 (-) CDS No

PUL ID

PUL0584

PubMed

23530958, BMC Microbiol. 2013 Mar 26;13:68. doi: 10.1186/1471-2180-13-68.

Title

Genotype-phenotype matching analysis of 38 Lactococcus lactis strains using random forest methods.

Author

Bayjanov JR, Starrenburg MJ, van der Sijde MR, Siezen RJ, van Hijum SA

Abstract

BACKGROUND: Lactococcus lactis is used in dairy food fermentation and for the efficient production of industrially relevant enzymes. The genome content and different phenotypes have been determined for multiple L. lactis strains in order to understand intra-species genotype and phenotype diversity and annotate gene functions. In this study, we identified relations between gene presence and a collection of 207 phenotypes across 38 L. lactis strains of dairy and plant origin. Gene occurrence and phenotype data were used in an iterative gene selection procedure, based on the Random Forest algorithm, to identify genotype-phenotype relations. RESULTS: A total of 1388 gene-phenotype relations were found, of which some confirmed known gene-phenotype relations, such as the importance of arabinose utilization genes only for strains of plant origin. We also identified a gene cluster related to growth on melibiose, a plant disaccharide; this cluster is present only in melibiose-positive strains and can be used as a genetic marker in trait improvement. Additionally, several novel gene-phenotype relations were uncovered, for instance, genes related to arsenite resistance or arginine metabolism. CONCLUSIONS: Our results indicate that genotype-phenotype matching by integrating large data sets provides the possibility to identify gene-phenotype relations, possibly improve gene function annotation and identified relations can be used for screening bacterial culture collections for desired phenotypes. In addition to all gene-phenotype relations, we also provide coherent phenotype data for 38 Lactococcus strains assessed in 207 different phenotyping experiments, which to our knowledge is the largest to date for the Lactococcus lactis species.