Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach

Background: Preterm birth (PTB) is a growing health issue worldwide, currently considered the leading cause of newborn deaths. To address this challenge, the present work aims to develop an algorithm capable of accurately predicting the week of delivery supporting the identification of a PTB in Braz...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Thiago Augusto Hernandes Rocha, Erika Bárbara Abreu Fonseca de Thomaz, Dante Grapiuna de Almeida, Núbia Cristina da Silva, Rejane Christine de Sousa Queiroz, Luciano Andrade, Luiz Augusto Facchini, Marcos Luiggi Lemos Sartori, Dalton Breno Costa, Marcos Adriano Garcia Campos, Antônio Augusto Moura da Silva, Catherine Staton, João Ricardo Nickenig Vissoci
Formato: article
Lenguaje:EN
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://doaj.org/article/a7a8f971ee9d4de898f1f5399ba79c3e
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:a7a8f971ee9d4de898f1f5399ba79c3e
record_format dspace
spelling oai:doaj.org-article:a7a8f971ee9d4de898f1f5399ba79c3e2021-11-12T04:50:19ZData-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach2667-193X10.1016/j.lana.2021.100053https://doaj.org/article/a7a8f971ee9d4de898f1f5399ba79c3e2021-11-01T00:00:00Zhttp://www.sciencedirect.com/science/article/pii/S2667193X21000454https://doaj.org/toc/2667-193XBackground: Preterm birth (PTB) is a growing health issue worldwide, currently considered the leading cause of newborn deaths. To address this challenge, the present work aims to develop an algorithm capable of accurately predicting the week of delivery supporting the identification of a PTB in Brazil. Methods: This a population-based study analyzing data from 3,876,666 mothers with live births distributed across the 3,929 Brazilian municipalities. Using indicators comprising delivery characteristics, primary care work processes, and physical infrastructure, and sociodemographic data we applied a machine learning-based approach to estimate the week of delivery at the point of care level. We tested six algorithms: eXtreme Gradient Boosting, Elastic Net, Quantile Ordinal Regression - LASSO, Linear Regression, Ridge Regression and Decision Tree. We used the root-mean-square error (RMSE) as a precision. Findings: All models obtained RMSE indexes close to each other. The lower levels of RMSE were obtained using the eXtreme Gradient Boosting approach which was able to estimate the week of delivery within a 2.09 window 95%IC (2.090–2.097). The five most important variables to predict the week of delivery were: number of previous deliveries through Cesarean-Section, number of prenatal consultations, age of the mother, existence of ultrasound exam available in the care network, and proportion of primary care teams in the municipality registering the oral care consultation. Interpretation: Using simple data describing the prenatal care offered, as well as minimal characteristics of the pregnant, our approach was capable of achieving a relevant predictive performance regarding the week of delivery. Funding: Bill and Melinda Gates Foundation, and National Council for Scientific and Technological Development – Brazil, (Conselho Nacional de Desenvolvimento Científico e Tecnológico - CNPQ acronym in portuguese) Support of the research project named: Data-Driven Risk Stratification for Preterm Birth in Brazil: Development of a Machine Learning-Based Innovation for Health Care- Grant: OPP1202186Thiago Augusto Hernandes RochaErika Bárbara Abreu Fonseca de ThomazDante Grapiuna de AlmeidaNúbia Cristina da SilvaRejane Christine de Sousa QueirozLuciano AndradeLuiz Augusto FacchiniMarcos Luiggi Lemos SartoriDalton Breno CostaMarcos Adriano Garcia CamposAntônio Augusto Moura da SilvaCatherine StatonJoão Ricardo Nickenig VissociElsevierarticlePreterm BirthPredictive Value of TestsPrimary Health CareAppraisal, Health Risk, Machine LearningPublic aspects of medicineRA1-1270ENThe Lancet Regional Health. Americas, Vol 3, Iss , Pp 100053- (2021)
institution DOAJ
collection DOAJ
language EN
topic Preterm Birth
Predictive Value of Tests
Primary Health Care
Appraisal, Health Risk, Machine Learning
Public aspects of medicine
RA1-1270
spellingShingle Preterm Birth
Predictive Value of Tests
Primary Health Care
Appraisal, Health Risk, Machine Learning
Public aspects of medicine
RA1-1270
Thiago Augusto Hernandes Rocha
Erika Bárbara Abreu Fonseca de Thomaz
Dante Grapiuna de Almeida
Núbia Cristina da Silva
Rejane Christine de Sousa Queiroz
Luciano Andrade
Luiz Augusto Facchini
Marcos Luiggi Lemos Sartori
Dalton Breno Costa
Marcos Adriano Garcia Campos
Antônio Augusto Moura da Silva
Catherine Staton
João Ricardo Nickenig Vissoci
Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach
description Background: Preterm birth (PTB) is a growing health issue worldwide, currently considered the leading cause of newborn deaths. To address this challenge, the present work aims to develop an algorithm capable of accurately predicting the week of delivery supporting the identification of a PTB in Brazil. Methods: This a population-based study analyzing data from 3,876,666 mothers with live births distributed across the 3,929 Brazilian municipalities. Using indicators comprising delivery characteristics, primary care work processes, and physical infrastructure, and sociodemographic data we applied a machine learning-based approach to estimate the week of delivery at the point of care level. We tested six algorithms: eXtreme Gradient Boosting, Elastic Net, Quantile Ordinal Regression - LASSO, Linear Regression, Ridge Regression and Decision Tree. We used the root-mean-square error (RMSE) as a precision. Findings: All models obtained RMSE indexes close to each other. The lower levels of RMSE were obtained using the eXtreme Gradient Boosting approach which was able to estimate the week of delivery within a 2.09 window 95%IC (2.090–2.097). The five most important variables to predict the week of delivery were: number of previous deliveries through Cesarean-Section, number of prenatal consultations, age of the mother, existence of ultrasound exam available in the care network, and proportion of primary care teams in the municipality registering the oral care consultation. Interpretation: Using simple data describing the prenatal care offered, as well as minimal characteristics of the pregnant, our approach was capable of achieving a relevant predictive performance regarding the week of delivery. Funding: Bill and Melinda Gates Foundation, and National Council for Scientific and Technological Development – Brazil, (Conselho Nacional de Desenvolvimento Científico e Tecnológico - CNPQ acronym in portuguese) Support of the research project named: Data-Driven Risk Stratification for Preterm Birth in Brazil: Development of a Machine Learning-Based Innovation for Health Care- Grant: OPP1202186
format article
author Thiago Augusto Hernandes Rocha
Erika Bárbara Abreu Fonseca de Thomaz
Dante Grapiuna de Almeida
Núbia Cristina da Silva
Rejane Christine de Sousa Queiroz
Luciano Andrade
Luiz Augusto Facchini
Marcos Luiggi Lemos Sartori
Dalton Breno Costa
Marcos Adriano Garcia Campos
Antônio Augusto Moura da Silva
Catherine Staton
João Ricardo Nickenig Vissoci
author_facet Thiago Augusto Hernandes Rocha
Erika Bárbara Abreu Fonseca de Thomaz
Dante Grapiuna de Almeida
Núbia Cristina da Silva
Rejane Christine de Sousa Queiroz
Luciano Andrade
Luiz Augusto Facchini
Marcos Luiggi Lemos Sartori
Dalton Breno Costa
Marcos Adriano Garcia Campos
Antônio Augusto Moura da Silva
Catherine Staton
João Ricardo Nickenig Vissoci
author_sort Thiago Augusto Hernandes Rocha
title Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach
title_short Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach
title_full Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach
title_fullStr Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach
title_full_unstemmed Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach
title_sort data-driven risk stratification for preterm birth in brazil: a population-based study to develop of a machine learning risk assessment approach
publisher Elsevier
publishDate 2021
url https://doaj.org/article/a7a8f971ee9d4de898f1f5399ba79c3e
work_keys_str_mv AT thiagoaugustohernandesrocha datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT erikabarbaraabreufonsecadethomaz datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT dantegrapiunadealmeida datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT nubiacristinadasilva datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT rejanechristinedesousaqueiroz datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT lucianoandrade datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT luizaugustofacchini datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT marcosluiggilemossartori datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT daltonbrenocosta datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT marcosadrianogarciacampos datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT antonioaugustomouradasilva datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT catherinestaton datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
AT joaoricardonickenigvissoci datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach
_version_ 1718431189427027968