Utilization of K-nearest neighbor algorithm for classification of white blood cells in AML M4, M5, and M7
Acute myeloid leukemia (AML) M4, M5, and M7 are subtypes of leukemia derived from myeloid cell derivatives that influences the results of the identification of AMLs, which includes myeloblast, monoblast, and megakaryoblast. Furthermore, they are divided into more specific types, including myeloblast...
Guardado en:
Autores principales: | , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
De Gruyter
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/dff44182cbd54bc3b70102fece130250 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:dff44182cbd54bc3b70102fece130250 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:dff44182cbd54bc3b70102fece1302502021-12-05T14:10:46ZUtilization of K-nearest neighbor algorithm for classification of white blood cells in AML M4, M5, and M72391-543910.1515/eng-2021-0065https://doaj.org/article/dff44182cbd54bc3b70102fece1302502021-05-01T00:00:00Zhttps://doi.org/10.1515/eng-2021-0065https://doaj.org/toc/2391-5439Acute myeloid leukemia (AML) M4, M5, and M7 are subtypes of leukemia derived from myeloid cell derivatives that influences the results of the identification of AMLs, which includes myeloblast, monoblast, and megakaryoblast. Furthermore, they are divided into more specific types, including myeloblasts, promyelocytes, monoblasts, promonocytes, monocytes, and megakaryoblasts, which must be clearly identified in order to further calculate the ratio value in the blood. Therefore, this research aims to classify these cell types using the K-nearest neighbor (KNN) algorithm. Three distance metrics are tested, namely, Euclidean, Chebychev, and Minkowski, and both the weighted and unweighted were tested. The features used as parameters are area, nucleus ratio, circularity, perimeter, mean, and standard deviation, and about 1,450 objects are used as training and testing data. In addition, to ensure that the classification is not overfitting, K-fold cross validation was conducted. The results show that the unweighted Minkowski distance acquired about 240 of 290 objects at K = 19, which is the best. Therefore, the unweighted Minkowski distance is selected for further analysis. The accuracy, recall, and precision values of KNN with unweighted Minkowski distance obtained from fivefold cross validation are 80.552, 44.145, and 42.592%, respectively.Prakisya Nurcahya Pradana TaufikLiantoni FebriHatta PuspandaAristyagama Yusfia HafidSetiawan AndikaDe Gruyterarticleacute myeloid leukemiaclassificationk-nearest neighborwhite blood cellsEngineering (General). Civil engineering (General)TA1-2040ENOpen Engineering, Vol 11, Iss 1, Pp 662-668 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
acute myeloid leukemia classification k-nearest neighbor white blood cells Engineering (General). Civil engineering (General) TA1-2040 |
spellingShingle |
acute myeloid leukemia classification k-nearest neighbor white blood cells Engineering (General). Civil engineering (General) TA1-2040 Prakisya Nurcahya Pradana Taufik Liantoni Febri Hatta Puspanda Aristyagama Yusfia Hafid Setiawan Andika Utilization of K-nearest neighbor algorithm for classification of white blood cells in AML M4, M5, and M7 |
description |
Acute myeloid leukemia (AML) M4, M5, and M7 are subtypes of leukemia derived from myeloid cell derivatives that influences the results of the identification of AMLs, which includes myeloblast, monoblast, and megakaryoblast. Furthermore, they are divided into more specific types, including myeloblasts, promyelocytes, monoblasts, promonocytes, monocytes, and megakaryoblasts, which must be clearly identified in order to further calculate the ratio value in the blood. Therefore, this research aims to classify these cell types using the K-nearest neighbor (KNN) algorithm. Three distance metrics are tested, namely, Euclidean, Chebychev, and Minkowski, and both the weighted and unweighted were tested. The features used as parameters are area, nucleus ratio, circularity, perimeter, mean, and standard deviation, and about 1,450 objects are used as training and testing data. In addition, to ensure that the classification is not overfitting, K-fold cross validation was conducted. The results show that the unweighted Minkowski distance acquired about 240 of 290 objects at K = 19, which is the best. Therefore, the unweighted Minkowski distance is selected for further analysis. The accuracy, recall, and precision values of KNN with unweighted Minkowski distance obtained from fivefold cross validation are 80.552, 44.145, and 42.592%, respectively. |
format |
article |
author |
Prakisya Nurcahya Pradana Taufik Liantoni Febri Hatta Puspanda Aristyagama Yusfia Hafid Setiawan Andika |
author_facet |
Prakisya Nurcahya Pradana Taufik Liantoni Febri Hatta Puspanda Aristyagama Yusfia Hafid Setiawan Andika |
author_sort |
Prakisya Nurcahya Pradana Taufik |
title |
Utilization of K-nearest neighbor algorithm for classification of white blood cells in AML M4, M5, and M7 |
title_short |
Utilization of K-nearest neighbor algorithm for classification of white blood cells in AML M4, M5, and M7 |
title_full |
Utilization of K-nearest neighbor algorithm for classification of white blood cells in AML M4, M5, and M7 |
title_fullStr |
Utilization of K-nearest neighbor algorithm for classification of white blood cells in AML M4, M5, and M7 |
title_full_unstemmed |
Utilization of K-nearest neighbor algorithm for classification of white blood cells in AML M4, M5, and M7 |
title_sort |
utilization of k-nearest neighbor algorithm for classification of white blood cells in aml m4, m5, and m7 |
publisher |
De Gruyter |
publishDate |
2021 |
url |
https://doaj.org/article/dff44182cbd54bc3b70102fece130250 |
work_keys_str_mv |
AT prakisyanurcahyapradanataufik utilizationofknearestneighboralgorithmforclassificationofwhitebloodcellsinamlm4m5andm7 AT liantonifebri utilizationofknearestneighboralgorithmforclassificationofwhitebloodcellsinamlm4m5andm7 AT hattapuspanda utilizationofknearestneighboralgorithmforclassificationofwhitebloodcellsinamlm4m5andm7 AT aristyagamayusfiahafid utilizationofknearestneighboralgorithmforclassificationofwhitebloodcellsinamlm4m5andm7 AT setiawanandika utilizationofknearestneighboralgorithmforclassificationofwhitebloodcellsinamlm4m5andm7 |
_version_ |
1718371749215600640 |