A Discriminative Latent-Variable Model for Bilingual Lexicon Induction

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

We introduce a novel discriminative latentvariablemodel for the task of bilingual lexiconinduction. Our model combines the bipartitematching dictionary prior of Haghighiet al. (2008) with a state-of-the-art embeddingbasedapproach. To train the model, we derivean efficient Viterbi EM algorithm. We provideempirical improvements on six language pairsunder two metrics and show that the prior theoreticallyand empirically helps to mitigate thehubness problem. We also demonstrate howprevious work may be viewed as a similarlyfashioned latent-variable model, albeit with adifferent prior.1

Originalsprog	Engelsk
Titel	Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Forlag	Association for Computational Linguistics
Publikationsdato	2020
Sider	458–468
Status	Udgivet - 2020
Begivenhed	2018 Conference on Empirical Methods in Natural Language Processing - Brussels, Belgien Varighed: 31 okt. 2018 → 4 nov. 2018

Konference

Konference	2018 Conference on Empirical Methods in Natural Language Processing
Land	Belgien
By	Brussels
Periode	31/10/2018 → 04/11/2018

ID: 214760286