Joint Extraction and Classification of Danish Competences for Job Matching

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

The matching of competences, such as skills, occupations or knowledges, is a key desiderata for candidates to be fit for jobs. Automatic extraction of competences from CVs and Jobs can greatly promote recruiters’ productivity in locating relevant candidates for job vacancies. This work presents the first model that jointly extracts and classifies competence from Danish job postings. Different from existing works on skill extraction and skill classification, our model is trained on a large volume of annotated Danish corpora and is capable of extracting a wide range of danish competences, including skills, occupations and knowledges of different categories. More importantly, as a single BERT-like architecture for joint extraction and classification, our model is lightweight and efficient at inference. On a real-scenario job matching dataset, our model beats the state-of-the-art models in the overall performance of Danish competence extraction and classification, and saves over 50% time at inference.

OriginalsprogEngelsk
Titel Advances in Information Retrieval : 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, April 2–6, 2023, Proceedings, Part II
RedaktørerJaap Kamps, Lorraine Goeuriot, Fabio Crestani, Maria Maistro, Hideo Joho, Brian Davis, Cathal Gurrin, Udo Kruschwitz, Annalina Caputo
Antal sider9
ForlagSpringer
Publikationsdato2023
Sider475-483
ISBN (Trykt)978-3-031-28237-9
ISBN (Elektronisk)978-3-031-28238-6
DOI
StatusUdgivet - 2023
Begivenhed45th European Conference on Information Retrieval, ECIR 2023 - Dublin, Irland
Varighed: 2 apr. 20236 apr. 2023

Konference

Konference45th European Conference on Information Retrieval, ECIR 2023
LandIrland
ByDublin
Periode02/04/202306/04/2023
NavnLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN0302-9743

Bibliografisk note

Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

ID: 373676951