Evaluation of Twitter data for an emerging crisis: an application to the first wave of COVID-19 in the UK

Abstract In the absence of nationwide mass testing for an emerging health crisis, alternative approaches could provide necessary information efficiently to aid policy makers and health bodies when dealing with a pandemic. The following work presents a methodology by which Twitter data surrounding th...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: I Kit Cheng, Johannes Heyl, Nisha Lad, Gabriel Facini, Zara Grout
Formato: article
Lenguaje:EN
Publicado: Nature Portfolio 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/1a6bd9d44d1942bab92e50d0d2b4a5c4
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:1a6bd9d44d1942bab92e50d0d2b4a5c4
record_format dspace
spelling oai:doaj.org-article:1a6bd9d44d1942bab92e50d0d2b4a5c42021-12-02T17:27:03ZEvaluation of Twitter data for an emerging crisis: an application to the first wave of COVID-19 in the UK10.1038/s41598-021-98396-92045-2322https://doaj.org/article/1a6bd9d44d1942bab92e50d0d2b4a5c42021-09-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-98396-9https://doaj.org/toc/2045-2322Abstract In the absence of nationwide mass testing for an emerging health crisis, alternative approaches could provide necessary information efficiently to aid policy makers and health bodies when dealing with a pandemic. The following work presents a methodology by which Twitter data surrounding the first wave of the COVID-19 pandemic in the UK is harvested and analysed using two main approaches. The first is an investigation into localized outbreak predictions by developing a prototype early-warning system using the distribution of total tweet volume. The temporal lag between the rises in the number of COVID-19 related tweets and officially reported deaths by Public Health England (PHE) is observed to be 6–27 days for various UK cities which matches the temporal lag values found in the literature. To better understand the topics of discussion and attitudes of people surrounding the pandemic, the second approach is an in-depth behavioural analysis assessing the public opinion and response to government policies such as the introduction of face-coverings. Using topic modelling, nine distinct topics are identified within the corpus of COVID-19 tweets, of which the themes ranged from retail to government bodies. Sentiment analysis on a subset of mask related tweets revealed sentiment spikes corresponding to major news and announcements. A Named Entity Recognition (NER) algorithm is trained and applied in a semi-supervised manner to recognise tweets containing location keywords within the unlabelled corpus and achieved a precision of 81.6%. Overall, these approaches allowed extraction of temporal trends relating to PHE case numbers, popular locations in relation to the use of face-coverings, and attitudes towards face-coverings, vaccines and the national ‘Test and Trace’ scheme.I Kit ChengJohannes HeylNisha LadGabriel FaciniZara GroutNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-13 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
I Kit Cheng
Johannes Heyl
Nisha Lad
Gabriel Facini
Zara Grout
Evaluation of Twitter data for an emerging crisis: an application to the first wave of COVID-19 in the UK
description Abstract In the absence of nationwide mass testing for an emerging health crisis, alternative approaches could provide necessary information efficiently to aid policy makers and health bodies when dealing with a pandemic. The following work presents a methodology by which Twitter data surrounding the first wave of the COVID-19 pandemic in the UK is harvested and analysed using two main approaches. The first is an investigation into localized outbreak predictions by developing a prototype early-warning system using the distribution of total tweet volume. The temporal lag between the rises in the number of COVID-19 related tweets and officially reported deaths by Public Health England (PHE) is observed to be 6–27 days for various UK cities which matches the temporal lag values found in the literature. To better understand the topics of discussion and attitudes of people surrounding the pandemic, the second approach is an in-depth behavioural analysis assessing the public opinion and response to government policies such as the introduction of face-coverings. Using topic modelling, nine distinct topics are identified within the corpus of COVID-19 tweets, of which the themes ranged from retail to government bodies. Sentiment analysis on a subset of mask related tweets revealed sentiment spikes corresponding to major news and announcements. A Named Entity Recognition (NER) algorithm is trained and applied in a semi-supervised manner to recognise tweets containing location keywords within the unlabelled corpus and achieved a precision of 81.6%. Overall, these approaches allowed extraction of temporal trends relating to PHE case numbers, popular locations in relation to the use of face-coverings, and attitudes towards face-coverings, vaccines and the national ‘Test and Trace’ scheme.
format article
author I Kit Cheng
Johannes Heyl
Nisha Lad
Gabriel Facini
Zara Grout
author_facet I Kit Cheng
Johannes Heyl
Nisha Lad
Gabriel Facini
Zara Grout
author_sort I Kit Cheng
title Evaluation of Twitter data for an emerging crisis: an application to the first wave of COVID-19 in the UK
title_short Evaluation of Twitter data for an emerging crisis: an application to the first wave of COVID-19 in the UK
title_full Evaluation of Twitter data for an emerging crisis: an application to the first wave of COVID-19 in the UK
title_fullStr Evaluation of Twitter data for an emerging crisis: an application to the first wave of COVID-19 in the UK
title_full_unstemmed Evaluation of Twitter data for an emerging crisis: an application to the first wave of COVID-19 in the UK
title_sort evaluation of twitter data for an emerging crisis: an application to the first wave of covid-19 in the uk
publisher Nature Portfolio
publishDate 2021
url https://doaj.org/article/1a6bd9d44d1942bab92e50d0d2b4a5c4
work_keys_str_mv AT ikitcheng evaluationoftwitterdataforanemergingcrisisanapplicationtothefirstwaveofcovid19intheuk
AT johannesheyl evaluationoftwitterdataforanemergingcrisisanapplicationtothefirstwaveofcovid19intheuk
AT nishalad evaluationoftwitterdataforanemergingcrisisanapplicationtothefirstwaveofcovid19intheuk
AT gabrielfacini evaluationoftwitterdataforanemergingcrisisanapplicationtothefirstwaveofcovid19intheuk
AT zaragrout evaluationoftwitterdataforanemergingcrisisanapplicationtothefirstwaveofcovid19intheuk
_version_ 1718380776710471680