A novel probabilistic generator for large-scale gene association networks.

<h4>Motivation</h4>Gene expression data provide an opportunity for reverse-engineering gene-gene associations using network inference methods. However, it is difficult to assess the performance of these methods because the true underlying network is unknown in real data. Current benchmar...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Tyler Grimes, Somnath Datta
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/c3d08e3a71be49e1b6f07ff421531729
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:c3d08e3a71be49e1b6f07ff421531729
record_format dspace
spelling oai:doaj.org-article:c3d08e3a71be49e1b6f07ff4215317292021-12-02T20:13:17ZA novel probabilistic generator for large-scale gene association networks.1932-620310.1371/journal.pone.0259193https://doaj.org/article/c3d08e3a71be49e1b6f07ff4215317292021-01-01T00:00:00Zhttps://doi.org/10.1371/journal.pone.0259193https://doaj.org/toc/1932-6203<h4>Motivation</h4>Gene expression data provide an opportunity for reverse-engineering gene-gene associations using network inference methods. However, it is difficult to assess the performance of these methods because the true underlying network is unknown in real data. Current benchmarks address this problem by subsampling a known regulatory network to conduct simulations. But the topology of regulatory networks can vary greatly across organisms or tissues, and reference-based generators-such as GeneNetWeaver-are not designed to capture this heterogeneity. This means, for example, benchmark results from the E. coli regulatory network will not carry over to other organisms or tissues. In contrast, probabilistic generators do not require a reference network, and they have the potential to capture a rich distribution of topologies. This makes probabilistic generators an ideal approach for obtaining a robust benchmarking of network inference methods.<h4>Results</h4>We propose a novel probabilistic network generator that (1) provides an alternative to address the inherent limitation of reference-based generators and (2) is able to create realistic gene association networks, and (3) captures the heterogeneity found across gold-standard networks better than existing generators used in practice. Eight organism-specific and 12 human tissue-specific gold-standard association networks are considered. Several measures of global topology are used to determine the similarity of generated networks to the gold-standards. Along with demonstrating the variability of network structure across organisms and tissues, we show that the commonly used "scale-free" model is insufficient for replicating these structures.<h4>Availability</h4>This generator is implemented in the R package "SeqNet" and is available on CRAN (https://cran.r-project.org/web/packages/SeqNet/index.html).Tyler GrimesSomnath DattaPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 16, Iss 11, p e0259193 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Tyler Grimes
Somnath Datta
A novel probabilistic generator for large-scale gene association networks.
description <h4>Motivation</h4>Gene expression data provide an opportunity for reverse-engineering gene-gene associations using network inference methods. However, it is difficult to assess the performance of these methods because the true underlying network is unknown in real data. Current benchmarks address this problem by subsampling a known regulatory network to conduct simulations. But the topology of regulatory networks can vary greatly across organisms or tissues, and reference-based generators-such as GeneNetWeaver-are not designed to capture this heterogeneity. This means, for example, benchmark results from the E. coli regulatory network will not carry over to other organisms or tissues. In contrast, probabilistic generators do not require a reference network, and they have the potential to capture a rich distribution of topologies. This makes probabilistic generators an ideal approach for obtaining a robust benchmarking of network inference methods.<h4>Results</h4>We propose a novel probabilistic network generator that (1) provides an alternative to address the inherent limitation of reference-based generators and (2) is able to create realistic gene association networks, and (3) captures the heterogeneity found across gold-standard networks better than existing generators used in practice. Eight organism-specific and 12 human tissue-specific gold-standard association networks are considered. Several measures of global topology are used to determine the similarity of generated networks to the gold-standards. Along with demonstrating the variability of network structure across organisms and tissues, we show that the commonly used "scale-free" model is insufficient for replicating these structures.<h4>Availability</h4>This generator is implemented in the R package "SeqNet" and is available on CRAN (https://cran.r-project.org/web/packages/SeqNet/index.html).
format article
author Tyler Grimes
Somnath Datta
author_facet Tyler Grimes
Somnath Datta
author_sort Tyler Grimes
title A novel probabilistic generator for large-scale gene association networks.
title_short A novel probabilistic generator for large-scale gene association networks.
title_full A novel probabilistic generator for large-scale gene association networks.
title_fullStr A novel probabilistic generator for large-scale gene association networks.
title_full_unstemmed A novel probabilistic generator for large-scale gene association networks.
title_sort novel probabilistic generator for large-scale gene association networks.
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/c3d08e3a71be49e1b6f07ff421531729
work_keys_str_mv AT tylergrimes anovelprobabilisticgeneratorforlargescalegeneassociationnetworks
AT somnathdatta anovelprobabilisticgeneratorforlargescalegeneassociationnetworks
AT tylergrimes novelprobabilisticgeneratorforlargescalegeneassociationnetworks
AT somnathdatta novelprobabilisticgeneratorforlargescalegeneassociationnetworks
_version_ 1718374789860556800