A Novel Calibration Step in Gene Co-Expression Network Construction
High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a...
Guardado en:
Autores principales: | , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/590b63cca351477fad86afbcecf91498 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:590b63cca351477fad86afbcecf91498 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:590b63cca351477fad86afbcecf914982021-11-30T14:07:49ZA Novel Calibration Step in Gene Co-Expression Network Construction2673-764710.3389/fbinf.2021.704817https://doaj.org/article/590b63cca351477fad86afbcecf914982021-11-01T00:00:00Zhttps://www.frontiersin.org/articles/10.3389/fbinf.2021.704817/fullhttps://doaj.org/toc/2673-7647High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a GCN, nodes correspond to genes, and the weight of the connection between two nodes is a measure of similarity in the expression behavior of the two genes. In general, GCN construction and analysis includes three steps; 1) calculating a similarity value for each pair of genes 2) using these similarity values to construct a fully connected weighted network 3) finding clusters of genes in the network, commonly called modules. The specific implementation of these three steps can significantly impact the final output and the downstream biological analysis. GCN construction is a well-studied topic. Existing algorithms rely on relatively simple statistical and mathematical tools to implement these steps. Currently, software package WGCNA appears to be the most widely accepted standard. We hypothesize that the raw features provided by sequencing data can be leveraged to extract modules of higher quality. A novel preprocessing step of the gene expression data set is introduced that in effect calibrates the expression levels of individual genes, before computing pairwise similarities. Further, the similarity is computed as an inner-product of positive vectors. In experiments, this provides a significant improvement over WGCNA, as measured by aggregate p-values of the gene ontology term enrichment of the computed modules.Niloofar AghaieabianeIoannis KoutisFrontiers Media S.A.articleGene co-expression networksSimilarity functionClusteringGene OntologyTopological Overlap MeasureComputer applications to medicine. Medical informaticsR858-859.7ENFrontiers in Bioinformatics, Vol 1 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Gene co-expression networks Similarity function Clustering Gene Ontology Topological Overlap Measure Computer applications to medicine. Medical informatics R858-859.7 |
spellingShingle |
Gene co-expression networks Similarity function Clustering Gene Ontology Topological Overlap Measure Computer applications to medicine. Medical informatics R858-859.7 Niloofar Aghaieabiane Ioannis Koutis A Novel Calibration Step in Gene Co-Expression Network Construction |
description |
High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a GCN, nodes correspond to genes, and the weight of the connection between two nodes is a measure of similarity in the expression behavior of the two genes. In general, GCN construction and analysis includes three steps; 1) calculating a similarity value for each pair of genes 2) using these similarity values to construct a fully connected weighted network 3) finding clusters of genes in the network, commonly called modules. The specific implementation of these three steps can significantly impact the final output and the downstream biological analysis. GCN construction is a well-studied topic. Existing algorithms rely on relatively simple statistical and mathematical tools to implement these steps. Currently, software package WGCNA appears to be the most widely accepted standard. We hypothesize that the raw features provided by sequencing data can be leveraged to extract modules of higher quality. A novel preprocessing step of the gene expression data set is introduced that in effect calibrates the expression levels of individual genes, before computing pairwise similarities. Further, the similarity is computed as an inner-product of positive vectors. In experiments, this provides a significant improvement over WGCNA, as measured by aggregate p-values of the gene ontology term enrichment of the computed modules. |
format |
article |
author |
Niloofar Aghaieabiane Ioannis Koutis |
author_facet |
Niloofar Aghaieabiane Ioannis Koutis |
author_sort |
Niloofar Aghaieabiane |
title |
A Novel Calibration Step in Gene Co-Expression Network Construction |
title_short |
A Novel Calibration Step in Gene Co-Expression Network Construction |
title_full |
A Novel Calibration Step in Gene Co-Expression Network Construction |
title_fullStr |
A Novel Calibration Step in Gene Co-Expression Network Construction |
title_full_unstemmed |
A Novel Calibration Step in Gene Co-Expression Network Construction |
title_sort |
novel calibration step in gene co-expression network construction |
publisher |
Frontiers Media S.A. |
publishDate |
2021 |
url |
https://doaj.org/article/590b63cca351477fad86afbcecf91498 |
work_keys_str_mv |
AT niloofaraghaieabiane anovelcalibrationstepingenecoexpressionnetworkconstruction AT ioanniskoutis anovelcalibrationstepingenecoexpressionnetworkconstruction AT niloofaraghaieabiane novelcalibrationstepingenecoexpressionnetworkconstruction AT ioanniskoutis novelcalibrationstepingenecoexpressionnetworkconstruction |
_version_ |
1718406504745271296 |