Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database

Background: Mechanically ventilated patients are susceptible to nosocomial infections such as ventilator-associated pneumonia. To treat ventilated patients with suspected infection, clinicians select appropriate antibiotics. However, decision-making regarding the use of antibiotics for methicillin-r...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Yohei Hirano, Keito Shinmoto, Yohei Okada, Kazuhiro Suga, Jeffrey Bombard, Shogo Murahata, Manoj Shrestha, Patrick Ocheja, Aiko Tanaka
Formato: article
Lenguaje:EN
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://doaj.org/article/98c4d1d586b148859b63baa43b559719
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:98c4d1d586b148859b63baa43b559719
record_format dspace
spelling oai:doaj.org-article:98c4d1d586b148859b63baa43b5597192021-11-16T05:24:24ZMachine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database2296-858X10.3389/fmed.2021.694520https://doaj.org/article/98c4d1d586b148859b63baa43b5597192021-11-01T00:00:00Zhttps://www.frontiersin.org/articles/10.3389/fmed.2021.694520/fullhttps://doaj.org/toc/2296-858XBackground: Mechanically ventilated patients are susceptible to nosocomial infections such as ventilator-associated pneumonia. To treat ventilated patients with suspected infection, clinicians select appropriate antibiotics. However, decision-making regarding the use of antibiotics for methicillin-resistant Staphylococcus aureus (MRSA) is challenging, because of the lack of evidence-supported criteria. This study aims to derive a machine learning model to predict MRSA as a possible pathogen responsible for infection in mechanically ventilated patients.Methods: Data were collected from the Medical Information Mart for Intensive Care (MIMIC)-IV database (an openly available database of patients treated at the Beth Israel Deaconess Medical Center in the period 2008–2019). Of 26,409 mechanically ventilated patients, 809 were screened for MRSA during the mechanical ventilation period and included in the study. The outcome was positivity to MRSA on screening, which was highly imbalanced in the dataset, with 93.9% positive outcomes. Therefore, after dividing the dataset into a training set (n = 566) and a test set (n = 243) for validation by stratified random sampling with a 7:3 allocation ratio, synthetic datasets with 50% positive outcomes were created by synthetic minority over-sampling for both sets individually (synthetic training set: n = 1,064; synthetic test set: n = 456). Using these synthetic datasets, we trained and validated an XGBoost machine learning model using 28 predictor variables for outcome prediction. Model performance was evaluated by area under the receiver operating characteristic (AUROC), sensitivity, specificity, and other statistical measurements. Feature importance was computed by the Gini method.Results: In validation, the XGBoost model demonstrated reliable outcome prediction with an AUROC value of 0.89 [95% confidence interval (CI): 0.83–0.95]. The model showed a high sensitivity of 0.98 [CI: 0.95–0.99], but a low specificity of 0.47 [CI: 0.41–0.54] and a positive predictive value of 0.65 [CI: 0.62–0.68]. Important predictor variables included admission from the emergency department, insertion of arterial lines, prior quinolone use, hemodialysis, and admission to a surgical intensive care unit.Conclusions: We were able to develop an effective machine learning model to predict positive MRSA screening during mechanical ventilation using synthetic datasets, thus encouraging further research to develop a clinically relevant machine learning model for antibiotics stewardship.Yohei HiranoKeito ShinmotoYohei OkadaKazuhiro SugaJeffrey BombardShogo MurahataManoj ShresthaPatrick OchejaAiko TanakaFrontiers Media S.A.articlepredictionmachine learningmechanical ventilationMethicillin-Resistant Staphylococcus aureus—MRSAoutcomeMedicine (General)R5-920ENFrontiers in Medicine, Vol 8 (2021)
institution DOAJ
collection DOAJ
language EN
topic prediction
machine learning
mechanical ventilation
Methicillin-Resistant Staphylococcus aureus—MRSA
outcome
Medicine (General)
R5-920
spellingShingle prediction
machine learning
mechanical ventilation
Methicillin-Resistant Staphylococcus aureus—MRSA
outcome
Medicine (General)
R5-920
Yohei Hirano
Keito Shinmoto
Yohei Okada
Kazuhiro Suga
Jeffrey Bombard
Shogo Murahata
Manoj Shrestha
Patrick Ocheja
Aiko Tanaka
Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
description Background: Mechanically ventilated patients are susceptible to nosocomial infections such as ventilator-associated pneumonia. To treat ventilated patients with suspected infection, clinicians select appropriate antibiotics. However, decision-making regarding the use of antibiotics for methicillin-resistant Staphylococcus aureus (MRSA) is challenging, because of the lack of evidence-supported criteria. This study aims to derive a machine learning model to predict MRSA as a possible pathogen responsible for infection in mechanically ventilated patients.Methods: Data were collected from the Medical Information Mart for Intensive Care (MIMIC)-IV database (an openly available database of patients treated at the Beth Israel Deaconess Medical Center in the period 2008–2019). Of 26,409 mechanically ventilated patients, 809 were screened for MRSA during the mechanical ventilation period and included in the study. The outcome was positivity to MRSA on screening, which was highly imbalanced in the dataset, with 93.9% positive outcomes. Therefore, after dividing the dataset into a training set (n = 566) and a test set (n = 243) for validation by stratified random sampling with a 7:3 allocation ratio, synthetic datasets with 50% positive outcomes were created by synthetic minority over-sampling for both sets individually (synthetic training set: n = 1,064; synthetic test set: n = 456). Using these synthetic datasets, we trained and validated an XGBoost machine learning model using 28 predictor variables for outcome prediction. Model performance was evaluated by area under the receiver operating characteristic (AUROC), sensitivity, specificity, and other statistical measurements. Feature importance was computed by the Gini method.Results: In validation, the XGBoost model demonstrated reliable outcome prediction with an AUROC value of 0.89 [95% confidence interval (CI): 0.83–0.95]. The model showed a high sensitivity of 0.98 [CI: 0.95–0.99], but a low specificity of 0.47 [CI: 0.41–0.54] and a positive predictive value of 0.65 [CI: 0.62–0.68]. Important predictor variables included admission from the emergency department, insertion of arterial lines, prior quinolone use, hemodialysis, and admission to a surgical intensive care unit.Conclusions: We were able to develop an effective machine learning model to predict positive MRSA screening during mechanical ventilation using synthetic datasets, thus encouraging further research to develop a clinically relevant machine learning model for antibiotics stewardship.
format article
author Yohei Hirano
Keito Shinmoto
Yohei Okada
Kazuhiro Suga
Jeffrey Bombard
Shogo Murahata
Manoj Shrestha
Patrick Ocheja
Aiko Tanaka
author_facet Yohei Hirano
Keito Shinmoto
Yohei Okada
Kazuhiro Suga
Jeffrey Bombard
Shogo Murahata
Manoj Shrestha
Patrick Ocheja
Aiko Tanaka
author_sort Yohei Hirano
title Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_short Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_full Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_fullStr Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_full_unstemmed Machine Learning Approach to Predict Positive Screening of Methicillin-Resistant Staphylococcus aureus During Mechanical Ventilation Using Synthetic Dataset From MIMIC-IV Database
title_sort machine learning approach to predict positive screening of methicillin-resistant staphylococcus aureus during mechanical ventilation using synthetic dataset from mimic-iv database
publisher Frontiers Media S.A.
publishDate 2021
url https://doaj.org/article/98c4d1d586b148859b63baa43b559719
work_keys_str_mv AT yoheihirano machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT keitoshinmoto machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT yoheiokada machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT kazuhirosuga machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT jeffreybombard machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT shogomurahata machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT manojshrestha machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT patrickocheja machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
AT aikotanaka machinelearningapproachtopredictpositivescreeningofmethicillinresistantstaphylococcusaureusduringmechanicalventilationusingsyntheticdatasetfrommimicivdatabase
_version_ 1718426683270234112