The added value of online user-generated content in traditional methods for influenza surveillance

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

The added value of online user-generated content in traditional methods for influenza surveillance. / Wagner, Moritz; Lampos, Vasileios; Cox, Ingemar J.; Pebody, Richard.

I: Scientific Reports, Bind 8, Nr. 1, 13963, 2018.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Wagner, M, Lampos, V, Cox, IJ & Pebody, R 2018, 'The added value of online user-generated content in traditional methods for influenza surveillance', Scientific Reports, bind 8, nr. 1, 13963. https://doi.org/10.1038/s41598-018-32029-6

APA

Wagner, M., Lampos, V., Cox, I. J., & Pebody, R. (2018). The added value of online user-generated content in traditional methods for influenza surveillance. Scientific Reports, 8(1), [13963]. https://doi.org/10.1038/s41598-018-32029-6

Vancouver

Wagner M, Lampos V, Cox IJ, Pebody R. The added value of online user-generated content in traditional methods for influenza surveillance. Scientific Reports. 2018;8(1). 13963. https://doi.org/10.1038/s41598-018-32029-6

Author

Wagner, Moritz ; Lampos, Vasileios ; Cox, Ingemar J. ; Pebody, Richard. / The added value of online user-generated content in traditional methods for influenza surveillance. I: Scientific Reports. 2018 ; Bind 8, Nr. 1.

Bibtex

@article{ecc012d03b884bf9ab9fb7a53068198a,
title = "The added value of online user-generated content in traditional methods for influenza surveillance",
abstract = "There has been considerable work in evaluating the efficacy of using online data for health surveillance. Often comparisons with baseline data involve various squared error and correlation metrics. While useful, these overlook a variety of other factors important to public health bodies considering the adoption of such methods. In this paper, a proposed surveillance system that incorporates models based on recent research efforts is evaluated in terms of its added value for influenza surveillance at Public Health England. The system comprises of two supervised learning approaches trained on influenza-like illness (ILI) rates provided by the Royal College of General Practitioners (RCGP) and produces ILI estimates using Twitter posts or Google search queries. RCGP ILI rates for different age groups and laboratory confirmed cases by influenza type are used to evaluate the models with a particular focus on predicting the onset, overall intensity, peak activity and duration of the 2015/16 influenza season. We show that the Twitter-based models perform poorly and hypothesise that this is mostly due to the sparsity of the data available and a limited training period. Conversely, the Google-based model provides accurate estimates with timeliness of approximately one week and has the potential to complement current surveillance systems.",
author = "Moritz Wagner and Vasileios Lampos and Cox, {Ingemar J.} and Richard Pebody",
year = "2018",
doi = "10.1038/s41598-018-32029-6",
language = "English",
volume = "8",
journal = "Scientific Reports",
issn = "2045-2322",
publisher = "nature publishing group",
number = "1",

}

RIS

TY - JOUR

T1 - The added value of online user-generated content in traditional methods for influenza surveillance

AU - Wagner, Moritz

AU - Lampos, Vasileios

AU - Cox, Ingemar J.

AU - Pebody, Richard

PY - 2018

Y1 - 2018

N2 - There has been considerable work in evaluating the efficacy of using online data for health surveillance. Often comparisons with baseline data involve various squared error and correlation metrics. While useful, these overlook a variety of other factors important to public health bodies considering the adoption of such methods. In this paper, a proposed surveillance system that incorporates models based on recent research efforts is evaluated in terms of its added value for influenza surveillance at Public Health England. The system comprises of two supervised learning approaches trained on influenza-like illness (ILI) rates provided by the Royal College of General Practitioners (RCGP) and produces ILI estimates using Twitter posts or Google search queries. RCGP ILI rates for different age groups and laboratory confirmed cases by influenza type are used to evaluate the models with a particular focus on predicting the onset, overall intensity, peak activity and duration of the 2015/16 influenza season. We show that the Twitter-based models perform poorly and hypothesise that this is mostly due to the sparsity of the data available and a limited training period. Conversely, the Google-based model provides accurate estimates with timeliness of approximately one week and has the potential to complement current surveillance systems.

AB - There has been considerable work in evaluating the efficacy of using online data for health surveillance. Often comparisons with baseline data involve various squared error and correlation metrics. While useful, these overlook a variety of other factors important to public health bodies considering the adoption of such methods. In this paper, a proposed surveillance system that incorporates models based on recent research efforts is evaluated in terms of its added value for influenza surveillance at Public Health England. The system comprises of two supervised learning approaches trained on influenza-like illness (ILI) rates provided by the Royal College of General Practitioners (RCGP) and produces ILI estimates using Twitter posts or Google search queries. RCGP ILI rates for different age groups and laboratory confirmed cases by influenza type are used to evaluate the models with a particular focus on predicting the onset, overall intensity, peak activity and duration of the 2015/16 influenza season. We show that the Twitter-based models perform poorly and hypothesise that this is mostly due to the sparsity of the data available and a limited training period. Conversely, the Google-based model provides accurate estimates with timeliness of approximately one week and has the potential to complement current surveillance systems.

U2 - 10.1038/s41598-018-32029-6

DO - 10.1038/s41598-018-32029-6

M3 - Journal article

C2 - 30228285

VL - 8

JO - Scientific Reports

JF - Scientific Reports

SN - 2045-2322

IS - 1

M1 - 13963

ER -

ID: 203242201