Resources and Evaluations for Danish Entity Resolution

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 229 KB, PDF-dokument

Automatic coreference resolution is understudied in Danish even though most of the Danish Dependency Treebank (Buch-Kromann, 2003) is annotated with coreference relations. This paper describes a conversion of its partial, yet well-documented, coreference relations into coreference clusters and the training and evaluation of coreference models on this data. To the best of our knowledge, these are the first publicly available, neural coreference models for Danish. We also present a new entity linking annotation on the dataset using WikiData identifiers, a named entity disambiguation (NED) dataset, and a larger automatically created NED dataset enabling wikily supervised NED models. The entity linking annotation is benchmarked using a state-of-the-art neural entity disambiguation model.
OriginalsprogEngelsk
TitelProceedings of the Fourth Workshop on Computational Models of Reference, Anaphora and Coreference
ForlagAssociation for Computational Linguistics
Publikationsdato2021
Sider63-69
DOI
StatusUdgivet - 2021
Begivenhed4th Workshop on Computational Models of Reference, Anaphora and Coreference - Punta Cana, Dominican Republic
Varighed: 10 nov. 202111 nov. 2021

Konference

Konference4th Workshop on Computational Models of Reference, Anaphora and Coreference
ByPunta Cana, Dominican Republic
Periode10/11/202111/11/2021

ID: 300081519