Privacy-first health research with federated learning
Abstract Privacy protection is paramount in conducting health research. However, studies often rely on data stored in a centralized repository, where analysis is done with full access to the sensitive underlying content. Recent advances in federated learning enable building complex machine-learned m...
Guardado en:
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Nature Portfolio
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/decac40d76f04fc18e73402b619ffedc |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:decac40d76f04fc18e73402b619ffedc |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:decac40d76f04fc18e73402b619ffedc2021-12-02T17:19:10ZPrivacy-first health research with federated learning10.1038/s41746-021-00489-22398-6352https://doaj.org/article/decac40d76f04fc18e73402b619ffedc2021-09-01T00:00:00Zhttps://doi.org/10.1038/s41746-021-00489-2https://doaj.org/toc/2398-6352Abstract Privacy protection is paramount in conducting health research. However, studies often rely on data stored in a centralized repository, where analysis is done with full access to the sensitive underlying content. Recent advances in federated learning enable building complex machine-learned models that are trained in a distributed fashion. These techniques facilitate the calculation of research study endpoints such that private data never leaves a given device or healthcare system. We show—on a diverse set of single and multi-site health studies—that federated models can achieve similar accuracy, precision, and generalizability, and lead to the same interpretation as standard centralized statistical models while achieving considerably stronger privacy protections and without significantly raising computational costs. This work is the first to apply modern and general federated learning methods that explicitly incorporate differential privacy to clinical and epidemiological research—across a spectrum of units of federation, model architectures, complexity of learning tasks and diseases. As a result, it enables health research participants to remain in control of their data and still contribute to advancing science—aspects that used to be at odds with each other.Adam SadilekLuyang LiuDung NguyenMethun KamruzzamanStylianos SerghiouBenjamin RaderAlex IngermanStefan MellemPeter KairouzElaine O. NsoesieJamie MacFarlaneAnil VullikantiMadhav MarathePaul EasthamJohn S. BrownsteinBlaise Aguera y. ArcasMichael D. HowellJohn HernandezNature PortfolioarticleComputer applications to medicine. Medical informaticsR858-859.7ENnpj Digital Medicine, Vol 4, Iss 1, Pp 1-8 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Computer applications to medicine. Medical informatics R858-859.7 |
spellingShingle |
Computer applications to medicine. Medical informatics R858-859.7 Adam Sadilek Luyang Liu Dung Nguyen Methun Kamruzzaman Stylianos Serghiou Benjamin Rader Alex Ingerman Stefan Mellem Peter Kairouz Elaine O. Nsoesie Jamie MacFarlane Anil Vullikanti Madhav Marathe Paul Eastham John S. Brownstein Blaise Aguera y. Arcas Michael D. Howell John Hernandez Privacy-first health research with federated learning |
description |
Abstract Privacy protection is paramount in conducting health research. However, studies often rely on data stored in a centralized repository, where analysis is done with full access to the sensitive underlying content. Recent advances in federated learning enable building complex machine-learned models that are trained in a distributed fashion. These techniques facilitate the calculation of research study endpoints such that private data never leaves a given device or healthcare system. We show—on a diverse set of single and multi-site health studies—that federated models can achieve similar accuracy, precision, and generalizability, and lead to the same interpretation as standard centralized statistical models while achieving considerably stronger privacy protections and without significantly raising computational costs. This work is the first to apply modern and general federated learning methods that explicitly incorporate differential privacy to clinical and epidemiological research—across a spectrum of units of federation, model architectures, complexity of learning tasks and diseases. As a result, it enables health research participants to remain in control of their data and still contribute to advancing science—aspects that used to be at odds with each other. |
format |
article |
author |
Adam Sadilek Luyang Liu Dung Nguyen Methun Kamruzzaman Stylianos Serghiou Benjamin Rader Alex Ingerman Stefan Mellem Peter Kairouz Elaine O. Nsoesie Jamie MacFarlane Anil Vullikanti Madhav Marathe Paul Eastham John S. Brownstein Blaise Aguera y. Arcas Michael D. Howell John Hernandez |
author_facet |
Adam Sadilek Luyang Liu Dung Nguyen Methun Kamruzzaman Stylianos Serghiou Benjamin Rader Alex Ingerman Stefan Mellem Peter Kairouz Elaine O. Nsoesie Jamie MacFarlane Anil Vullikanti Madhav Marathe Paul Eastham John S. Brownstein Blaise Aguera y. Arcas Michael D. Howell John Hernandez |
author_sort |
Adam Sadilek |
title |
Privacy-first health research with federated learning |
title_short |
Privacy-first health research with federated learning |
title_full |
Privacy-first health research with federated learning |
title_fullStr |
Privacy-first health research with federated learning |
title_full_unstemmed |
Privacy-first health research with federated learning |
title_sort |
privacy-first health research with federated learning |
publisher |
Nature Portfolio |
publishDate |
2021 |
url |
https://doaj.org/article/decac40d76f04fc18e73402b619ffedc |
work_keys_str_mv |
AT adamsadilek privacyfirsthealthresearchwithfederatedlearning AT luyangliu privacyfirsthealthresearchwithfederatedlearning AT dungnguyen privacyfirsthealthresearchwithfederatedlearning AT methunkamruzzaman privacyfirsthealthresearchwithfederatedlearning AT stylianosserghiou privacyfirsthealthresearchwithfederatedlearning AT benjaminrader privacyfirsthealthresearchwithfederatedlearning AT alexingerman privacyfirsthealthresearchwithfederatedlearning AT stefanmellem privacyfirsthealthresearchwithfederatedlearning AT peterkairouz privacyfirsthealthresearchwithfederatedlearning AT elaineonsoesie privacyfirsthealthresearchwithfederatedlearning AT jamiemacfarlane privacyfirsthealthresearchwithfederatedlearning AT anilvullikanti privacyfirsthealthresearchwithfederatedlearning AT madhavmarathe privacyfirsthealthresearchwithfederatedlearning AT pauleastham privacyfirsthealthresearchwithfederatedlearning AT johnsbrownstein privacyfirsthealthresearchwithfederatedlearning AT blaiseaguerayarcas privacyfirsthealthresearchwithfederatedlearning AT michaeldhowell privacyfirsthealthresearchwithfederatedlearning AT johnhernandez privacyfirsthealthresearchwithfederatedlearning |
_version_ |
1718381114859454464 |