A Stacking Ensemble Model to Predict Daily Number of Hospital Admissions for Cardiovascular Diseases

With lifestyle and environmental changes, the prevalence of cardiovascular diseases (CVDs) is trending upwards, putting pressure on the limited medical resources. Accurate forecasting of daily counts of hospital admissions (HAs) for CVDs is helpful to optimize medical resources. In this study, we pr...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Zhixu Hu, Hang Qiu, Ziqi Su, Minghui Shen, Ziyu Chen
Formato: article
Lenguaje:EN
Publicado: IEEE 2020
Materias:
Acceso en línea:https://doaj.org/article/10c4a7c71c6e4c85b917bf094e2a989e
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:With lifestyle and environmental changes, the prevalence of cardiovascular diseases (CVDs) is trending upwards, putting pressure on the limited medical resources. Accurate forecasting of daily counts of hospital admissions (HAs) for CVDs is helpful to optimize medical resources. In this study, we proposed a stacking ensemble model with direct prediction strategy to predict the daily number of CVDs admissions using HAs data, air pollution data, and meteorological data. The sequential forward floating selection method with early stopping was applied for feature selection. Five machine learning models, including linear regression (LR), support vector regression (SVR), extreme gradient boosting (XGBoost), random forest (RF), and gradient boosting decision tree (GBDT), were utilized as base learners to construct the stacking model. We compared the performance of the proposed stacking model with the five base learners in three datasets. The experimental results indicated that our model performed best in three datasets under four evaluation criteria, including mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE), and coefficient of determination (R<sup>2</sup>). Particularly, in the CVDs dataset, the MAPE is 15.103 for LR, 11.862 for SVR, 10.571 for XGBoost, 10.378 for GBDT, 10.333 for RF, and 9.679 for the stacking model. Compared with the best base learner RF, the MAPE, RMSE, and MAE of the stacking model decreased by 6.3&#x0025;, 7.4&#x0025;, and 6.3&#x0025;, respectively, and the R<sup>2</sup> improved by 1.7&#x0025;. It is evident that the proposed stacking model can effectively forecast the daily number of hospitalizations for CVDs and provide decision support for hospital managers.