A general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data.

Influenza is an acute respiratory seasonal disease that affects millions of people worldwide and causes thousands of deaths in Europe alone. Estimating in a fast and reliable way the impact of an illness on a given country is essential to plan and organize effective countermeasures, which is now pos...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Giovanni De Toni, Cristian Consonni, Alberto Montresor
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/0e4158c4df474311860df96d7caec1ee
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:0e4158c4df474311860df96d7caec1ee
record_format dspace
spelling oai:doaj.org-article:0e4158c4df474311860df96d7caec1ee2021-12-02T20:14:51ZA general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data.1932-620310.1371/journal.pone.0256858https://doaj.org/article/0e4158c4df474311860df96d7caec1ee2021-01-01T00:00:00Zhttps://doi.org/10.1371/journal.pone.0256858https://doaj.org/toc/1932-6203Influenza is an acute respiratory seasonal disease that affects millions of people worldwide and causes thousands of deaths in Europe alone. Estimating in a fast and reliable way the impact of an illness on a given country is essential to plan and organize effective countermeasures, which is now possible by leveraging unconventional data sources like web searches and visits. In this study, we show the feasibility of exploiting machine learning models and information about Wikipedia's page views of a selected group of articles to obtain accurate estimates of influenza-like illnesses incidence in four European countries: Italy, Germany, Belgium, and the Netherlands. We propose a novel language-agnostic method, based on two algorithms, Personalized PageRank and CycleRank, to automatically select the most relevant Wikipedia pages to be monitored without the need for expert supervision. We then show how our model can reach state-of-the-art results by comparing it with previous solutions.Giovanni De ToniCristian ConsonniAlberto MontresorPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 16, Iss 8, p e0256858 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Giovanni De Toni
Cristian Consonni
Alberto Montresor
A general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data.
description Influenza is an acute respiratory seasonal disease that affects millions of people worldwide and causes thousands of deaths in Europe alone. Estimating in a fast and reliable way the impact of an illness on a given country is essential to plan and organize effective countermeasures, which is now possible by leveraging unconventional data sources like web searches and visits. In this study, we show the feasibility of exploiting machine learning models and information about Wikipedia's page views of a selected group of articles to obtain accurate estimates of influenza-like illnesses incidence in four European countries: Italy, Germany, Belgium, and the Netherlands. We propose a novel language-agnostic method, based on two algorithms, Personalized PageRank and CycleRank, to automatically select the most relevant Wikipedia pages to be monitored without the need for expert supervision. We then show how our model can reach state-of-the-art results by comparing it with previous solutions.
format article
author Giovanni De Toni
Cristian Consonni
Alberto Montresor
author_facet Giovanni De Toni
Cristian Consonni
Alberto Montresor
author_sort Giovanni De Toni
title A general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data.
title_short A general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data.
title_full A general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data.
title_fullStr A general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data.
title_full_unstemmed A general method for estimating the prevalence of influenza-like-symptoms with Wikipedia data.
title_sort general method for estimating the prevalence of influenza-like-symptoms with wikipedia data.
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/0e4158c4df474311860df96d7caec1ee
work_keys_str_mv AT giovannidetoni ageneralmethodforestimatingtheprevalenceofinfluenzalikesymptomswithwikipediadata
AT cristianconsonni ageneralmethodforestimatingtheprevalenceofinfluenzalikesymptomswithwikipediadata
AT albertomontresor ageneralmethodforestimatingtheprevalenceofinfluenzalikesymptomswithwikipediadata
AT giovannidetoni generalmethodforestimatingtheprevalenceofinfluenzalikesymptomswithwikipediadata
AT cristianconsonni generalmethodforestimatingtheprevalenceofinfluenzalikesymptomswithwikipediadata
AT albertomontresor generalmethodforestimatingtheprevalenceofinfluenzalikesymptomswithwikipediadata
_version_ 1718374628474224640