In Information Retrieval (IR) evaluation, preference judgments are collected by presenting to the assessors a pair of documents and asking them to select which of the two, if any, is the most relevant. This is an alternative to the classic relevance judgment approach, in which human assessors judge the relevance of a single document on a scale; such an alternative allows to make relative rather than absolute judgments of relevance. While preference judgments are easier for human assessors to perform, the number of possible document pairs to be judged is usually so high that it makes it unfeasible to judge them all. Thus, following a similar idea to pooling strategies for single document relevance judgments where the goal is to sample the mo...
We consider the problem of optimally allocating a fixed budget to construct a test collection with a...
Offline evaluation of information retrieval systems typically focuses on a single effectiveness meas...
Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to t...
Evaluation of information retrieval (IR) systems has recently been exploring the use of preference j...
We consider the problem of optimally allocating a limited budget to acquire relevance judgments when...
Information retrieval systems have traditionally been evaluated over absolute judgments of relevance...
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the ...
The dominant approach to evaluate the effectiveness of information retrieval (IR) systems is by mean...
Abstract. We consider the problem of acquiring relevance judgements for in-formation retrieval (IR) ...
An information retrieval (IR) system assists people in consuming huge amount of data, where the eval...
Information Retrieval (IR) researchers have often used existing IR evaluation collections and transf...
Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to t...
In recent years, gathering relevance judgments through non-topic originators has become an increasin...
The availability of test collections in Cranfield paradigm has significantly benefited the developme...
© 2019 Ziying YangBatch evaluation techniques are often used to measure and compare the performance ...
We consider the problem of optimally allocating a fixed budget to construct a test collection with a...
Offline evaluation of information retrieval systems typically focuses on a single effectiveness meas...
Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to t...
Evaluation of information retrieval (IR) systems has recently been exploring the use of preference j...
We consider the problem of optimally allocating a limited budget to acquire relevance judgments when...
Information retrieval systems have traditionally been evaluated over absolute judgments of relevance...
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the ...
The dominant approach to evaluate the effectiveness of information retrieval (IR) systems is by mean...
Abstract. We consider the problem of acquiring relevance judgements for in-formation retrieval (IR) ...
An information retrieval (IR) system assists people in consuming huge amount of data, where the eval...
Information Retrieval (IR) researchers have often used existing IR evaluation collections and transf...
Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to t...
In recent years, gathering relevance judgments through non-topic originators has become an increasin...
The availability of test collections in Cranfield paradigm has significantly benefited the developme...
© 2019 Ziying YangBatch evaluation techniques are often used to measure and compare the performance ...
We consider the problem of optimally allocating a fixed budget to construct a test collection with a...
Offline evaluation of information retrieval systems typically focuses on a single effectiveness meas...
Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to t...