Variational Open-Domain Question Answering

Publikation: Bidrag til tidsskriftKonferenceartikelForskningfagfællebedømt

  • Valentin Liévin
  • Andreas Geert Motzfeldt
  • Ida Riis Jensen
  • Winther, Ole
Retrieval-augmented models have proven to be effective in natural language processing tasks, yet there remains a lack of research on their optimization using variational inference. We introduce the Variational Open-Domain (VOD) framework for end-to-end training and evaluation of retrieval-augmented models, focusing on open-domain question answering and language modelling. The VOD objective, a self-normalized estimate of the Rényi variational bound, approximates the task marginal likelihood and is evaluated under samples drawn from an auxiliary sampling distribution (cached retriever and/or approximate posterior). It remains tractable, even for retriever distributions defined on large corpora. We demonstrate VOD's versatility by training reader-retriever BERT-sized models on multiple-choice medical exam questions. On the MedMCQA dataset, we outperform the domain-tuned Med-PaLM by +5.3% despite using 2.500× fewer parameters. Our retrieval-augmented BioLinkBERT model scored 62.9% on the MedMCQA and 55.0% on the MedQA-USMLE. Last, we show the effectiveness of our learned retriever component in the context of medical semantic search. © 2023 Proceedings of Machine Learning Research. All rights reserved.
OriginalsprogEngelsk
TidsskriftProceedings of Machine Learning Research
Vol/bind202
Sider (fra-til)20950-20977
Antal sider28
ISSN2640-3498
StatusUdgivet - 2023
Begivenhed40th International Conference on Machine Learning, ICML 2023 - Honolulu, USA
Varighed: 23 jul. 202329 jul. 2023

Konference

Konference40th International Conference on Machine Learning, ICML 2023
LandUSA
ByHonolulu
Periode23/07/202329/07/2023

Bibliografisk note

Funding Information:
VL’s work was funded in part by Google DeepMind through a PhD grant. OW’s work was funded in part by the Novo Nordisk Foundation through the Center for Basic Machine Learning Research in Life Science (NNF20OC0062606). VL and OW acknowledge support from the Pioneer Centre for AI, DNRF grant number P1.

Publisher Copyright:
© 2023 Proceedings of Machine Learning Research. All rights reserved.

Links

ID: 372185038