Algorithmic prediction of HIV status using nation-wide electronic registry data

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Algorithmic prediction of HIV status using nation-wide electronic registry data. / Ahlström, Magnus G; Ronit, Andreas; Omland, Lars Haukali; Vedel, Søren; Obel, Niels.

I: EClinicalMedicine, Bind 17, 100203, 2019.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Ahlström, MG, Ronit, A, Omland, LH, Vedel, S & Obel, N 2019, 'Algorithmic prediction of HIV status using nation-wide electronic registry data', EClinicalMedicine, bind 17, 100203. https://doi.org/10.1016/j.eclinm.2019.10.016

APA

Ahlström, M. G., Ronit, A., Omland, L. H., Vedel, S., & Obel, N. (2019). Algorithmic prediction of HIV status using nation-wide electronic registry data. EClinicalMedicine, 17, [100203]. https://doi.org/10.1016/j.eclinm.2019.10.016

Vancouver

Ahlström MG, Ronit A, Omland LH, Vedel S, Obel N. Algorithmic prediction of HIV status using nation-wide electronic registry data. EClinicalMedicine. 2019;17. 100203. https://doi.org/10.1016/j.eclinm.2019.10.016

Author

Ahlström, Magnus G ; Ronit, Andreas ; Omland, Lars Haukali ; Vedel, Søren ; Obel, Niels. / Algorithmic prediction of HIV status using nation-wide electronic registry data. I: EClinicalMedicine. 2019 ; Bind 17.

Bibtex

@article{977e6c12d2594ce2a7069aab7318f545,
title = "Algorithmic prediction of HIV status using nation-wide electronic registry data",
abstract = "Background: Late HIV diagnosis is detrimental both to the individual and to society. Strategies to improve early diagnosis of HIV must be a key health care priority. We examined whether nation-wide electronic registry data could be used to predict HIV status using machine learning algorithms.Methods: We extracted individual level data from Danish registries and used algorithms to predict HIV status. We used various algorithms to train prediction models and validated these models. We calibrated the models to mimic different clinical scenarios and created confusion matrices based on the calibrated models.Findings: A total 4,384,178 individuals, including 4,350 with incident HIV, were included in the analyses. The full model that included all variables that included demographic variables and information on past medical history had the highest area under the receiver operating characteristics curves of 88·4% (95%CI: 87·5% - 89·4%) in the validation dataset. Performance measures did not differ substantially with regards to which machine learning algorithm was used. When we calibrated the models to a specificity of 99·9% (pre-exposure prophylaxis (PrEP) scenario), we found a positive predictive value (PPV) of 8·3% in the full model. When we calibrated the models to a sensitivity of 90% (screening scenario), 384 individuals would have to be tested to find one undiagnosed person with HIV.Interpretation: Machine learning algorithms can learn from electronic registry data and help to predict HIV status with a fairly high level of accuracy. Integration of prediction models into clinical software systems may complement existing strategies such as indicator condition-guided HIV testing and prove useful for identifying individuals suitable for PrEP.Funding: The study was supported by funds from the Preben and Anne Simonsens Foundation, the Novo Nordisk Foundation, Rigshospitalet, Copenhagen University, the Danish AIDS Foundation, the Augustinus Foundation and the Danish Health Foundation.",
author = "Ahlstr{\"o}m, {Magnus G} and Andreas Ronit and Omland, {Lars Haukali} and S{\o}ren Vedel and Niels Obel",
year = "2019",
doi = "10.1016/j.eclinm.2019.10.016",
language = "English",
volume = "17",
journal = "EClinicalMedicine",
issn = "2589-5370",
publisher = "The Lancet Publishing Group",

}

RIS

TY - JOUR

T1 - Algorithmic prediction of HIV status using nation-wide electronic registry data

AU - Ahlström, Magnus G

AU - Ronit, Andreas

AU - Omland, Lars Haukali

AU - Vedel, Søren

AU - Obel, Niels

PY - 2019

Y1 - 2019

N2 - Background: Late HIV diagnosis is detrimental both to the individual and to society. Strategies to improve early diagnosis of HIV must be a key health care priority. We examined whether nation-wide electronic registry data could be used to predict HIV status using machine learning algorithms.Methods: We extracted individual level data from Danish registries and used algorithms to predict HIV status. We used various algorithms to train prediction models and validated these models. We calibrated the models to mimic different clinical scenarios and created confusion matrices based on the calibrated models.Findings: A total 4,384,178 individuals, including 4,350 with incident HIV, were included in the analyses. The full model that included all variables that included demographic variables and information on past medical history had the highest area under the receiver operating characteristics curves of 88·4% (95%CI: 87·5% - 89·4%) in the validation dataset. Performance measures did not differ substantially with regards to which machine learning algorithm was used. When we calibrated the models to a specificity of 99·9% (pre-exposure prophylaxis (PrEP) scenario), we found a positive predictive value (PPV) of 8·3% in the full model. When we calibrated the models to a sensitivity of 90% (screening scenario), 384 individuals would have to be tested to find one undiagnosed person with HIV.Interpretation: Machine learning algorithms can learn from electronic registry data and help to predict HIV status with a fairly high level of accuracy. Integration of prediction models into clinical software systems may complement existing strategies such as indicator condition-guided HIV testing and prove useful for identifying individuals suitable for PrEP.Funding: The study was supported by funds from the Preben and Anne Simonsens Foundation, the Novo Nordisk Foundation, Rigshospitalet, Copenhagen University, the Danish AIDS Foundation, the Augustinus Foundation and the Danish Health Foundation.

AB - Background: Late HIV diagnosis is detrimental both to the individual and to society. Strategies to improve early diagnosis of HIV must be a key health care priority. We examined whether nation-wide electronic registry data could be used to predict HIV status using machine learning algorithms.Methods: We extracted individual level data from Danish registries and used algorithms to predict HIV status. We used various algorithms to train prediction models and validated these models. We calibrated the models to mimic different clinical scenarios and created confusion matrices based on the calibrated models.Findings: A total 4,384,178 individuals, including 4,350 with incident HIV, were included in the analyses. The full model that included all variables that included demographic variables and information on past medical history had the highest area under the receiver operating characteristics curves of 88·4% (95%CI: 87·5% - 89·4%) in the validation dataset. Performance measures did not differ substantially with regards to which machine learning algorithm was used. When we calibrated the models to a specificity of 99·9% (pre-exposure prophylaxis (PrEP) scenario), we found a positive predictive value (PPV) of 8·3% in the full model. When we calibrated the models to a sensitivity of 90% (screening scenario), 384 individuals would have to be tested to find one undiagnosed person with HIV.Interpretation: Machine learning algorithms can learn from electronic registry data and help to predict HIV status with a fairly high level of accuracy. Integration of prediction models into clinical software systems may complement existing strategies such as indicator condition-guided HIV testing and prove useful for identifying individuals suitable for PrEP.Funding: The study was supported by funds from the Preben and Anne Simonsens Foundation, the Novo Nordisk Foundation, Rigshospitalet, Copenhagen University, the Danish AIDS Foundation, the Augustinus Foundation and the Danish Health Foundation.

U2 - 10.1016/j.eclinm.2019.10.016

DO - 10.1016/j.eclinm.2019.10.016

M3 - Journal article

C2 - 31891137

VL - 17

JO - EClinicalMedicine

JF - EClinicalMedicine

SN - 2589-5370

M1 - 100203

ER -

ID: 241885894