Estimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model

Dongmei Pei, Tengfei Yang, Chengpu Zhang Department of Health Management, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of ChinaCorrespondence: Dongmei PeiDepartment of Health Management, Shengjing Hospital of China Medical University, No. 36, Sanhao Street, H...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Pei D, Yang T, Zhang C
Formato: article
Lenguaje:EN
Publicado: Dove Medical Press 2020
Materias:
Acceso en línea:https://doaj.org/article/a8a13ee4911048bd90981e44e173c7cd
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:a8a13ee4911048bd90981e44e173c7cd
record_format dspace
spelling oai:doaj.org-article:a8a13ee4911048bd90981e44e173c7cd2021-12-02T11:21:38ZEstimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model1178-7007https://doaj.org/article/a8a13ee4911048bd90981e44e173c7cd2020-11-01T00:00:00Zhttps://www.dovepress.com/estimation-of-diabetes-in-a-high-risk-adult-chinese-population-using-j-peer-reviewed-article-DMSOhttps://doaj.org/toc/1178-7007Dongmei Pei, Tengfei Yang, Chengpu Zhang Department of Health Management, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of ChinaCorrespondence: Dongmei PeiDepartment of Health Management, Shengjing Hospital of China Medical University, No. 36, Sanhao Street, Heping District, Shenyang 110004, People’s Republic of ChinaEmail peidm1111@hotmail.comBackground: To predict and make an early diagnosis of diabetes is a critical approach in a population with high risk of diabetes, one of the devastating diseases globally. Traditional and conventional blood tests are recommended for screening the suspected patients; however, applying these tests could have health side effects and expensive cost. The goal of this study was to establish a simple and reliable predictive model based on the risk factors associated with diabetes using a decision tree algorithm.Methods: A retrospective cross-sectional study was used in this study. A total of 10,436 participants who had a health check-up from January 2017 to July 2017 were recruited. With appropriate data mining approaches, 3454 participants remained in the final dataset for further analysis. Seventy percent of these participants (2420 cases) were then randomly allocated to either the training dataset for the construction of the decision tree or the testing dataset (30%, 1034 cases) for evaluation of the performance of the decision tree. For this purpose, the cost-sensitive J48 algorithm was used to develop the decision tree model.Results: Utilizing all the key features of the dataset consisting of 14 input variables and two output variables, the constructed decision tree model identified several key factors that are closely linked to the development of diabetes and are also modifiable. Furthermore, our model achieved an accuracy of classification of 90.3% with a precision of 89.7% and a recall of 90.3%.Conclusion: By applying simple and cost-effective classification rules, our decision tree model estimates the development of diabetes in a high-risk adult Chinese population with strong potential for implementation of diabetes management.Keywords: diabetes, J48 algorithm, decision tree, risk factorsPei DYang TZhang CDove Medical Pressarticlediabetesj48 algorithmdecision treerisk factorsSpecialties of internal medicineRC581-951ENDiabetes, Metabolic Syndrome and Obesity: Targets and Therapy, Vol Volume 13, Pp 4621-4630 (2020)
institution DOAJ
collection DOAJ
language EN
topic diabetes
j48 algorithm
decision tree
risk factors
Specialties of internal medicine
RC581-951
spellingShingle diabetes
j48 algorithm
decision tree
risk factors
Specialties of internal medicine
RC581-951
Pei D
Yang T
Zhang C
Estimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model
description Dongmei Pei, Tengfei Yang, Chengpu Zhang Department of Health Management, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of ChinaCorrespondence: Dongmei PeiDepartment of Health Management, Shengjing Hospital of China Medical University, No. 36, Sanhao Street, Heping District, Shenyang 110004, People’s Republic of ChinaEmail peidm1111@hotmail.comBackground: To predict and make an early diagnosis of diabetes is a critical approach in a population with high risk of diabetes, one of the devastating diseases globally. Traditional and conventional blood tests are recommended for screening the suspected patients; however, applying these tests could have health side effects and expensive cost. The goal of this study was to establish a simple and reliable predictive model based on the risk factors associated with diabetes using a decision tree algorithm.Methods: A retrospective cross-sectional study was used in this study. A total of 10,436 participants who had a health check-up from January 2017 to July 2017 were recruited. With appropriate data mining approaches, 3454 participants remained in the final dataset for further analysis. Seventy percent of these participants (2420 cases) were then randomly allocated to either the training dataset for the construction of the decision tree or the testing dataset (30%, 1034 cases) for evaluation of the performance of the decision tree. For this purpose, the cost-sensitive J48 algorithm was used to develop the decision tree model.Results: Utilizing all the key features of the dataset consisting of 14 input variables and two output variables, the constructed decision tree model identified several key factors that are closely linked to the development of diabetes and are also modifiable. Furthermore, our model achieved an accuracy of classification of 90.3% with a precision of 89.7% and a recall of 90.3%.Conclusion: By applying simple and cost-effective classification rules, our decision tree model estimates the development of diabetes in a high-risk adult Chinese population with strong potential for implementation of diabetes management.Keywords: diabetes, J48 algorithm, decision tree, risk factors
format article
author Pei D
Yang T
Zhang C
author_facet Pei D
Yang T
Zhang C
author_sort Pei D
title Estimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model
title_short Estimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model
title_full Estimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model
title_fullStr Estimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model
title_full_unstemmed Estimation of Diabetes in a High-Risk Adult Chinese Population Using J48 Decision Tree Model
title_sort estimation of diabetes in a high-risk adult chinese population using j48 decision tree model
publisher Dove Medical Press
publishDate 2020
url https://doaj.org/article/a8a13ee4911048bd90981e44e173c7cd
work_keys_str_mv AT peid estimationofdiabetesinahighriskadultchinesepopulationusingj48decisiontreemodel
AT yangt estimationofdiabetesinahighriskadultchinesepopulationusingj48decisiontreemodel
AT zhangc estimationofdiabetesinahighriskadultchinesepopulationusingj48decisiontreemodel
_version_ 1718395955924959232