Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach
Background: Preterm birth (PTB) is a growing health issue worldwide, currently considered the leading cause of newborn deaths. To address this challenge, the present work aims to develop an algorithm capable of accurately predicting the week of delivery supporting the identification of a PTB in Braz...
Guardado en:
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/a7a8f971ee9d4de898f1f5399ba79c3e |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:a7a8f971ee9d4de898f1f5399ba79c3e |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:a7a8f971ee9d4de898f1f5399ba79c3e2021-11-12T04:50:19ZData-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach2667-193X10.1016/j.lana.2021.100053https://doaj.org/article/a7a8f971ee9d4de898f1f5399ba79c3e2021-11-01T00:00:00Zhttp://www.sciencedirect.com/science/article/pii/S2667193X21000454https://doaj.org/toc/2667-193XBackground: Preterm birth (PTB) is a growing health issue worldwide, currently considered the leading cause of newborn deaths. To address this challenge, the present work aims to develop an algorithm capable of accurately predicting the week of delivery supporting the identification of a PTB in Brazil. Methods: This a population-based study analyzing data from 3,876,666 mothers with live births distributed across the 3,929 Brazilian municipalities. Using indicators comprising delivery characteristics, primary care work processes, and physical infrastructure, and sociodemographic data we applied a machine learning-based approach to estimate the week of delivery at the point of care level. We tested six algorithms: eXtreme Gradient Boosting, Elastic Net, Quantile Ordinal Regression - LASSO, Linear Regression, Ridge Regression and Decision Tree. We used the root-mean-square error (RMSE) as a precision. Findings: All models obtained RMSE indexes close to each other. The lower levels of RMSE were obtained using the eXtreme Gradient Boosting approach which was able to estimate the week of delivery within a 2.09 window 95%IC (2.090–2.097). The five most important variables to predict the week of delivery were: number of previous deliveries through Cesarean-Section, number of prenatal consultations, age of the mother, existence of ultrasound exam available in the care network, and proportion of primary care teams in the municipality registering the oral care consultation. Interpretation: Using simple data describing the prenatal care offered, as well as minimal characteristics of the pregnant, our approach was capable of achieving a relevant predictive performance regarding the week of delivery. Funding: Bill and Melinda Gates Foundation, and National Council for Scientific and Technological Development – Brazil, (Conselho Nacional de Desenvolvimento Científico e Tecnológico - CNPQ acronym in portuguese) Support of the research project named: Data-Driven Risk Stratification for Preterm Birth in Brazil: Development of a Machine Learning-Based Innovation for Health Care- Grant: OPP1202186Thiago Augusto Hernandes RochaErika Bárbara Abreu Fonseca de ThomazDante Grapiuna de AlmeidaNúbia Cristina da SilvaRejane Christine de Sousa QueirozLuciano AndradeLuiz Augusto FacchiniMarcos Luiggi Lemos SartoriDalton Breno CostaMarcos Adriano Garcia CamposAntônio Augusto Moura da SilvaCatherine StatonJoão Ricardo Nickenig VissociElsevierarticlePreterm BirthPredictive Value of TestsPrimary Health CareAppraisal, Health Risk, Machine LearningPublic aspects of medicineRA1-1270ENThe Lancet Regional Health. Americas, Vol 3, Iss , Pp 100053- (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Preterm Birth Predictive Value of Tests Primary Health Care Appraisal, Health Risk, Machine Learning Public aspects of medicine RA1-1270 |
spellingShingle |
Preterm Birth Predictive Value of Tests Primary Health Care Appraisal, Health Risk, Machine Learning Public aspects of medicine RA1-1270 Thiago Augusto Hernandes Rocha Erika Bárbara Abreu Fonseca de Thomaz Dante Grapiuna de Almeida Núbia Cristina da Silva Rejane Christine de Sousa Queiroz Luciano Andrade Luiz Augusto Facchini Marcos Luiggi Lemos Sartori Dalton Breno Costa Marcos Adriano Garcia Campos Antônio Augusto Moura da Silva Catherine Staton João Ricardo Nickenig Vissoci Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach |
description |
Background: Preterm birth (PTB) is a growing health issue worldwide, currently considered the leading cause of newborn deaths. To address this challenge, the present work aims to develop an algorithm capable of accurately predicting the week of delivery supporting the identification of a PTB in Brazil. Methods: This a population-based study analyzing data from 3,876,666 mothers with live births distributed across the 3,929 Brazilian municipalities. Using indicators comprising delivery characteristics, primary care work processes, and physical infrastructure, and sociodemographic data we applied a machine learning-based approach to estimate the week of delivery at the point of care level. We tested six algorithms: eXtreme Gradient Boosting, Elastic Net, Quantile Ordinal Regression - LASSO, Linear Regression, Ridge Regression and Decision Tree. We used the root-mean-square error (RMSE) as a precision. Findings: All models obtained RMSE indexes close to each other. The lower levels of RMSE were obtained using the eXtreme Gradient Boosting approach which was able to estimate the week of delivery within a 2.09 window 95%IC (2.090–2.097). The five most important variables to predict the week of delivery were: number of previous deliveries through Cesarean-Section, number of prenatal consultations, age of the mother, existence of ultrasound exam available in the care network, and proportion of primary care teams in the municipality registering the oral care consultation. Interpretation: Using simple data describing the prenatal care offered, as well as minimal characteristics of the pregnant, our approach was capable of achieving a relevant predictive performance regarding the week of delivery. Funding: Bill and Melinda Gates Foundation, and National Council for Scientific and Technological Development – Brazil, (Conselho Nacional de Desenvolvimento Científico e Tecnológico - CNPQ acronym in portuguese) Support of the research project named: Data-Driven Risk Stratification for Preterm Birth in Brazil: Development of a Machine Learning-Based Innovation for Health Care- Grant: OPP1202186 |
format |
article |
author |
Thiago Augusto Hernandes Rocha Erika Bárbara Abreu Fonseca de Thomaz Dante Grapiuna de Almeida Núbia Cristina da Silva Rejane Christine de Sousa Queiroz Luciano Andrade Luiz Augusto Facchini Marcos Luiggi Lemos Sartori Dalton Breno Costa Marcos Adriano Garcia Campos Antônio Augusto Moura da Silva Catherine Staton João Ricardo Nickenig Vissoci |
author_facet |
Thiago Augusto Hernandes Rocha Erika Bárbara Abreu Fonseca de Thomaz Dante Grapiuna de Almeida Núbia Cristina da Silva Rejane Christine de Sousa Queiroz Luciano Andrade Luiz Augusto Facchini Marcos Luiggi Lemos Sartori Dalton Breno Costa Marcos Adriano Garcia Campos Antônio Augusto Moura da Silva Catherine Staton João Ricardo Nickenig Vissoci |
author_sort |
Thiago Augusto Hernandes Rocha |
title |
Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach |
title_short |
Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach |
title_full |
Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach |
title_fullStr |
Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach |
title_full_unstemmed |
Data-driven risk stratification for preterm birth in Brazil: a population-based study to develop of a machine learning risk assessment approach |
title_sort |
data-driven risk stratification for preterm birth in brazil: a population-based study to develop of a machine learning risk assessment approach |
publisher |
Elsevier |
publishDate |
2021 |
url |
https://doaj.org/article/a7a8f971ee9d4de898f1f5399ba79c3e |
work_keys_str_mv |
AT thiagoaugustohernandesrocha datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT erikabarbaraabreufonsecadethomaz datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT dantegrapiunadealmeida datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT nubiacristinadasilva datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT rejanechristinedesousaqueiroz datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT lucianoandrade datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT luizaugustofacchini datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT marcosluiggilemossartori datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT daltonbrenocosta datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT marcosadrianogarciacampos datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT antonioaugustomouradasilva datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT catherinestaton datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach AT joaoricardonickenigvissoci datadrivenriskstratificationforpretermbirthinbrazilapopulationbasedstudytodevelopofamachinelearningriskassessmentapproach |
_version_ |
1718431189427027968 |