Automatic identification of web-based risk markers for health events

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Automatic identification of web-based risk markers for health events. / Yom-Tov, Elad; Borsa, Diana; Hayward, Andrew C.; McKendry, Rachel A.; Cox, Ingemar Johansson.

I: Journal of Medical Internet Research, Bind 17, Nr. 1, e29, 2015.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Yom-Tov, E, Borsa, D, Hayward, AC, McKendry, RA & Cox, IJ 2015, 'Automatic identification of web-based risk markers for health events', Journal of Medical Internet Research, bind 17, nr. 1, e29. https://doi.org/10.2196/jmir.4082

APA

Yom-Tov, E., Borsa, D., Hayward, A. C., McKendry, R. A., & Cox, I. J. (2015). Automatic identification of web-based risk markers for health events. Journal of Medical Internet Research, 17(1), [e29]. https://doi.org/10.2196/jmir.4082

Vancouver

Yom-Tov E, Borsa D, Hayward AC, McKendry RA, Cox IJ. Automatic identification of web-based risk markers for health events. Journal of Medical Internet Research. 2015;17(1). e29. https://doi.org/10.2196/jmir.4082

Author

Yom-Tov, Elad ; Borsa, Diana ; Hayward, Andrew C. ; McKendry, Rachel A. ; Cox, Ingemar Johansson. / Automatic identification of web-based risk markers for health events. I: Journal of Medical Internet Research. 2015 ; Bind 17, Nr. 1.

Bibtex

@article{5c9259263efe4bb19e1fd6ac78480833,
title = "Automatic identification of web-based risk markers for health events",
abstract = "Background: The escalating cost of global health care is driving the development of new technologies to identify early indicators of an individual's risk of disease. Traditionally, epidemiologists have identified such risk factors using medical databases and lengthy clinical studies but these are often limited in size and cost and can fail to take full account of diseases where there are social stigmas or to identify transient acute risk factors. Objective: Here we report that Web search engine queries coupled with information on Wikipedia access patterns can be used to infer health events associated with an individual user and automatically generate Web-based risk markers for some of the common medical conditions worldwide, from cardiovascular disease to sexually transmitted infections and mental health conditions, as well as pregnancy. Methods: Using anonymized datasets, we present methods to first distinguish individuals likely to have experienced specific health events, and classify them into distinct categories. We then use the self-controlled case series method to find the incidence of health events in risk periods directly following a user's search for a query category, and compare to the incidence during other periods for the same individuals. Results: Searches for pet stores were risk markers for allergy. We also identified some possible new risk markers; for example: searching for fast food and theme restaurants was associated with a transient increase in risk of myocardial infarction, suggesting this exposure goes beyond a long-term risk factor but may also act as an acute trigger of myocardial infarction. Dating and adult content websites were risk markers for sexually transmitted infections, such as human immunodeficiency virus (HIV). Conclusions: Web-based methods provide a powerful, low-cost approach to automatically identify risk factors, and support more timely and personalized public health efforts to bring human and economic benefits.",
keywords = "Epidemiology, Information retrieval query processing, Machine Learning, Self-controlled case series, Web search engines",
author = "Elad Yom-Tov and Diana Borsa and Hayward, {Andrew C.} and McKendry, {Rachel A.} and Cox, {Ingemar Johansson}",
year = "2015",
doi = "10.2196/jmir.4082",
language = "English",
volume = "17",
journal = "Journal of Medical Internet Research",
issn = "1439-4456",
publisher = "JMIR Publications",
number = "1",

}

RIS

TY - JOUR

T1 - Automatic identification of web-based risk markers for health events

AU - Yom-Tov, Elad

AU - Borsa, Diana

AU - Hayward, Andrew C.

AU - McKendry, Rachel A.

AU - Cox, Ingemar Johansson

PY - 2015

Y1 - 2015

N2 - Background: The escalating cost of global health care is driving the development of new technologies to identify early indicators of an individual's risk of disease. Traditionally, epidemiologists have identified such risk factors using medical databases and lengthy clinical studies but these are often limited in size and cost and can fail to take full account of diseases where there are social stigmas or to identify transient acute risk factors. Objective: Here we report that Web search engine queries coupled with information on Wikipedia access patterns can be used to infer health events associated with an individual user and automatically generate Web-based risk markers for some of the common medical conditions worldwide, from cardiovascular disease to sexually transmitted infections and mental health conditions, as well as pregnancy. Methods: Using anonymized datasets, we present methods to first distinguish individuals likely to have experienced specific health events, and classify them into distinct categories. We then use the self-controlled case series method to find the incidence of health events in risk periods directly following a user's search for a query category, and compare to the incidence during other periods for the same individuals. Results: Searches for pet stores were risk markers for allergy. We also identified some possible new risk markers; for example: searching for fast food and theme restaurants was associated with a transient increase in risk of myocardial infarction, suggesting this exposure goes beyond a long-term risk factor but may also act as an acute trigger of myocardial infarction. Dating and adult content websites were risk markers for sexually transmitted infections, such as human immunodeficiency virus (HIV). Conclusions: Web-based methods provide a powerful, low-cost approach to automatically identify risk factors, and support more timely and personalized public health efforts to bring human and economic benefits.

AB - Background: The escalating cost of global health care is driving the development of new technologies to identify early indicators of an individual's risk of disease. Traditionally, epidemiologists have identified such risk factors using medical databases and lengthy clinical studies but these are often limited in size and cost and can fail to take full account of diseases where there are social stigmas or to identify transient acute risk factors. Objective: Here we report that Web search engine queries coupled with information on Wikipedia access patterns can be used to infer health events associated with an individual user and automatically generate Web-based risk markers for some of the common medical conditions worldwide, from cardiovascular disease to sexually transmitted infections and mental health conditions, as well as pregnancy. Methods: Using anonymized datasets, we present methods to first distinguish individuals likely to have experienced specific health events, and classify them into distinct categories. We then use the self-controlled case series method to find the incidence of health events in risk periods directly following a user's search for a query category, and compare to the incidence during other periods for the same individuals. Results: Searches for pet stores were risk markers for allergy. We also identified some possible new risk markers; for example: searching for fast food and theme restaurants was associated with a transient increase in risk of myocardial infarction, suggesting this exposure goes beyond a long-term risk factor but may also act as an acute trigger of myocardial infarction. Dating and adult content websites were risk markers for sexually transmitted infections, such as human immunodeficiency virus (HIV). Conclusions: Web-based methods provide a powerful, low-cost approach to automatically identify risk factors, and support more timely and personalized public health efforts to bring human and economic benefits.

KW - Epidemiology

KW - Information retrieval query processing

KW - Machine Learning

KW - Self-controlled case series

KW - Web search engines

U2 - 10.2196/jmir.4082

DO - 10.2196/jmir.4082

M3 - Journal article

C2 - 25626480

AN - SCOPUS:84922311499

VL - 17

JO - Journal of Medical Internet Research

JF - Journal of Medical Internet Research

SN - 1439-4456

IS - 1

M1 - e29

ER -

ID: 154523128