Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis
Determining the number of clusters in high-dimensional real-life datasets and interpreting the final outcome are among the challenging problems in data science. Discovering the number of classes in cancer and microarray data plays a vital role in the treatment and diagnosis of cancers and other rela...
Guardado en:
Autor principal: | |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Hindawi Limited
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/62f6f6e5f3de45cd9b6d725e02fff1e9 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:62f6f6e5f3de45cd9b6d725e02fff1e9 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:62f6f6e5f3de45cd9b6d725e02fff1e92021-11-22T01:10:25ZCoordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis1687-004210.1155/2021/6675829https://doaj.org/article/62f6f6e5f3de45cd9b6d725e02fff1e92021-01-01T00:00:00Zhttp://dx.doi.org/10.1155/2021/6675829https://doaj.org/toc/1687-0042Determining the number of clusters in high-dimensional real-life datasets and interpreting the final outcome are among the challenging problems in data science. Discovering the number of classes in cancer and microarray data plays a vital role in the treatment and diagnosis of cancers and other related diseases. Nonnegative matrix factorization (NMF) plays a paramount role as an efficient data exploratory tool for extracting basis features inherent in massive data. Some algorithms which are based on incorporating sparsity constraints in the nonconvex NMF optimization problem are applied in the past for analyzing microarray datasets. However, to the best of our knowledge, none of these algorithms use block coordinate descent method which is known for providing closed form solutions. In this paper, we apply an algorithm developed based on columnwise partitioning and rank-one matrix approximation. We test this algorithm on two well-known cancer datasets: leukemia and multiple myeloma. The numerical results indicate that the proposed algorithm performs significantly better than related state-of-the-art methods. In particular, it is shown that this method is capable of robust clustering and discovering larger cancer classes in which the cluster splits are stable.Melisew Tefera BelachewHindawi LimitedarticleMathematicsQA1-939ENJournal of Applied Mathematics, Vol 2021 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Mathematics QA1-939 |
spellingShingle |
Mathematics QA1-939 Melisew Tefera Belachew Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis |
description |
Determining the number of clusters in high-dimensional real-life datasets and interpreting the final outcome are among the challenging problems in data science. Discovering the number of classes in cancer and microarray data plays a vital role in the treatment and diagnosis of cancers and other related diseases. Nonnegative matrix factorization (NMF) plays a paramount role as an efficient data exploratory tool for extracting basis features inherent in massive data. Some algorithms which are based on incorporating sparsity constraints in the nonconvex NMF optimization problem are applied in the past for analyzing microarray datasets. However, to the best of our knowledge, none of these algorithms use block coordinate descent method which is known for providing closed form solutions. In this paper, we apply an algorithm developed based on columnwise partitioning and rank-one matrix approximation. We test this algorithm on two well-known cancer datasets: leukemia and multiple myeloma. The numerical results indicate that the proposed algorithm performs significantly better than related state-of-the-art methods. In particular, it is shown that this method is capable of robust clustering and discovering larger cancer classes in which the cluster splits are stable. |
format |
article |
author |
Melisew Tefera Belachew |
author_facet |
Melisew Tefera Belachew |
author_sort |
Melisew Tefera Belachew |
title |
Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis |
title_short |
Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis |
title_full |
Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis |
title_fullStr |
Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis |
title_full_unstemmed |
Coordinate Descent-Based Sparse Nonnegative Matrix Factorization for Robust Cancer-Class Discovery and Microarray Data Analysis |
title_sort |
coordinate descent-based sparse nonnegative matrix factorization for robust cancer-class discovery and microarray data analysis |
publisher |
Hindawi Limited |
publishDate |
2021 |
url |
https://doaj.org/article/62f6f6e5f3de45cd9b6d725e02fff1e9 |
work_keys_str_mv |
AT melisewteferabelachew coordinatedescentbasedsparsenonnegativematrixfactorizationforrobustcancerclassdiscoveryandmicroarraydataanalysis |
_version_ |
1718418341430820864 |