Variational Open-Domain Question Answering

Publikation: Bidrag til tidsskrift › Konferenceartikel › Forskning › fagfællebedømt

Valentin Liévin
Andreas Geert Motzfeldt
Ida Riis Jensen
Winther, Ole

Retrieval-augmented models have proven to be effective in natural language processing tasks, yet there remains a lack of research on their optimization using variational inference. We introduce the Variational Open-Domain (VOD) framework for end-to-end training and evaluation of retrieval-augmented models, focusing on open-domain question answering and language modelling. The VOD objective, a self-normalized estimate of the Rényi variational bound, approximates the task marginal likelihood and is evaluated under samples drawn from an auxiliary sampling distribution (cached retriever and/or approximate posterior). It remains tractable, even for retriever distributions defined on large corpora. We demonstrate VOD's versatility by training reader-retriever BERT-sized models on multiple-choice medical exam questions. On the MedMCQA dataset, we outperform the domain-tuned Med-PaLM by +5.3% despite using 2.500× fewer parameters. Our retrieval-augmented BioLinkBERT model scored 62.9% on the MedMCQA and 55.0% on the MedQA-USMLE. Last, we show the effectiveness of our learned retriever component in the context of medical semantic search. © 2023 Proceedings of Machine Learning Research. All rights reserved.

Originalsprog	Engelsk
Tidsskrift	Proceedings of Machine Learning Research
Vol/bind	202
Sider (fra-til)	20950-20977
Antal sider	28
ISSN	2640-3498
Status	Udgivet - 2023
Begivenhed	40th International Conference on Machine Learning, ICML 2023 - Honolulu, USA Varighed: 23 jul. 2023 → 29 jul. 2023

Konference

Konference	40th International Conference on Machine Learning, ICML 2023
Land	USA
By	Honolulu
Periode	23/07/2023 → 29/07/2023

Bibliografisk note

Funding Information:
VL’s work was funded in part by Google DeepMind through a PhD grant. OW’s work was funded in part by the Novo Nordisk Foundation through the Center for Basic Machine Learning Research in Life Science (NNF20OC0062606). VL and OW acknowledge support from the Pioneer Centre for AI, DNRF grant number P1.

Publisher Copyright:
© 2023 Proceedings of Machine Learning Research. All rights reserved.

Forskning