Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.

<h4>Background</h4>To predict gene expressions is an important endeavour within computational systems biology. It can both be a way to explore how drugs affect the system, as well as providing a framework for finding which genes are interrelated in a certain process. A practical problem,...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Mika Gustafsson, Michael Hörnquist
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2010
Materias:
R
Q
Acceso en línea:https://doaj.org/article/8c771880a8b749598b339952ffade835
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:8c771880a8b749598b339952ffade835
record_format dspace
spelling oai:doaj.org-article:8c771880a8b749598b339952ffade8352021-11-25T06:25:50ZGene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.1932-620310.1371/journal.pone.0009134https://doaj.org/article/8c771880a8b749598b339952ffade8352010-02-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20169069/?tool=EBIhttps://doaj.org/toc/1932-6203<h4>Background</h4>To predict gene expressions is an important endeavour within computational systems biology. It can both be a way to explore how drugs affect the system, as well as providing a framework for finding which genes are interrelated in a certain process. A practical problem, however, is how to assess and discriminate among the various algorithms which have been developed for this purpose. Therefore, the DREAM project invited the year 2008 to a challenge for predicting gene expression values, and here we present the algorithm with best performance.<h4>Methodology/principal findings</h4>We develop an algorithm by exploring various regression schemes with different model selection procedures. It turns out that the most effective scheme is based on least squares, with a penalty term of a recently developed form called the "elastic net". Key components in the algorithm are the integration of expression data from other experimental conditions than those presented for the challenge and the utilization of transcription factor binding data for guiding the inference process towards known interactions. Of importance is also a cross-validation procedure where each form of external data is used only to the extent it increases the expected performance.<h4>Conclusions/significance</h4>Our algorithm proves both the possibility to extract information from large-scale expression data concerning prediction of gene levels, as well as the benefits of integrating different data sources for improving the inference. We believe the former is an important message to those still hesitating on the possibilities for computational approaches, while the latter is part of an important way forward for the future development of the field of computational systems biology.Mika GustafssonMichael HörnquistPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 5, Iss 2, p e9134 (2010)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Mika Gustafsson
Michael Hörnquist
Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.
description <h4>Background</h4>To predict gene expressions is an important endeavour within computational systems biology. It can both be a way to explore how drugs affect the system, as well as providing a framework for finding which genes are interrelated in a certain process. A practical problem, however, is how to assess and discriminate among the various algorithms which have been developed for this purpose. Therefore, the DREAM project invited the year 2008 to a challenge for predicting gene expression values, and here we present the algorithm with best performance.<h4>Methodology/principal findings</h4>We develop an algorithm by exploring various regression schemes with different model selection procedures. It turns out that the most effective scheme is based on least squares, with a penalty term of a recently developed form called the "elastic net". Key components in the algorithm are the integration of expression data from other experimental conditions than those presented for the challenge and the utilization of transcription factor binding data for guiding the inference process towards known interactions. Of importance is also a cross-validation procedure where each form of external data is used only to the extent it increases the expected performance.<h4>Conclusions/significance</h4>Our algorithm proves both the possibility to extract information from large-scale expression data concerning prediction of gene levels, as well as the benefits of integrating different data sources for improving the inference. We believe the former is an important message to those still hesitating on the possibilities for computational approaches, while the latter is part of an important way forward for the future development of the field of computational systems biology.
format article
author Mika Gustafsson
Michael Hörnquist
author_facet Mika Gustafsson
Michael Hörnquist
author_sort Mika Gustafsson
title Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.
title_short Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.
title_full Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.
title_fullStr Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.
title_full_unstemmed Gene expression prediction by soft integration and the elastic net-best performance of the DREAM3 gene expression challenge.
title_sort gene expression prediction by soft integration and the elastic net-best performance of the dream3 gene expression challenge.
publisher Public Library of Science (PLoS)
publishDate 2010
url https://doaj.org/article/8c771880a8b749598b339952ffade835
work_keys_str_mv AT mikagustafsson geneexpressionpredictionbysoftintegrationandtheelasticnetbestperformanceofthedream3geneexpressionchallenge
AT michaelhornquist geneexpressionpredictionbysoftintegrationandtheelasticnetbestperformanceofthedream3geneexpressionchallenge
_version_ 1718413756840542208