Rhetorical relations for information retrieval

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Typically, every part in most coherent text has some plausible reason for its presence, some function that it performs to the overall semantics of the text. Rhetorical relations, e.g. contrast, cause, explanation, describe how the parts of a text are linked to each other. Knowledge about this so-called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document’s rhetorical relations be useful to IR?

We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness notably (> 10%
in mean average precision over a state-of-the-art baseline).
Original languageEnglish
Title of host publicationProceedings of the 35th international ACM SIGIR conference on Research and Development in Information Retrieval
Number of pages10
PublisherAssociation for Computing Machinery
Publication date2012
Pages931-940
ISBN (Electronic)978-1-4503-1472-5
DOIs
Publication statusPublished - 2012
Event35th International ACM SIGIR Conference on Research and Development in Information Retrieval - Oregon, United States
Duration: 12 Aug 201216 Aug 2012
Conference number: 35

Conference

Conference35th International ACM SIGIR Conference on Research and Development in Information Retrieval
Nummer35
LandUnited States
ByOregon
Periode12/08/201216/08/2012

ID: 38240033