Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to the availability of crowdsourcing platforms and quality control techniques that allow to obtain reliable results. Previous work has used crowdsourcing to ask multiple crowd workers to judge the relevance of a document with respect to a query and studied how to best aggregate multiple judgments of the same topic-document pair. This paper addresses an aspect that has been rather overlooked so far: we study how the time available to express a relevance judgment affects its quality. We also discuss the quality loss of making crowdsourced relevance judgments more efficient in terms of time taken to judge the relevance of a document. We use standard...
Information Retrieval systems rely on large test collections to measure their effectiveness in retri...
Information Retrieval systems rely on large test collections to measure their effectiveness in retri...
Abstract. Crowdsourcing relevance judgments for test collection con-struction is attractive because ...
Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to t...
Crowdsourcing has become an alternative approach to collect relevance judgments at large scale. In t...
© 2018 ACM. While crowdsourcing offers a low-cost, scalable way to collect relevance judgments, lack...
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the ...
Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as rel...
Evaluation is instrumental in the development and management of effective information retrieval syst...
The performance of information retrieval (IR) systems is commonly evaluated using a test set with kn...
In recent years, gathering relevance judgments through non-topic originators has become an increasin...
Crowdsourcing relevance judgments for the evaluation of search engines is used increasingly to overc...
Abstract. We consider the problem of acquiring relevance judgements for in-formation retrieval (IR) ...
When collecting item ratings from human judges, it can be difficult to measure and enforce data qual...
In Information Retrieval (IR) evaluation, preference judgments are collected by presenting to the as...
Information Retrieval systems rely on large test collections to measure their effectiveness in retri...
Information Retrieval systems rely on large test collections to measure their effectiveness in retri...
Abstract. Crowdsourcing relevance judgments for test collection con-struction is attractive because ...
Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to t...
Crowdsourcing has become an alternative approach to collect relevance judgments at large scale. In t...
© 2018 ACM. While crowdsourcing offers a low-cost, scalable way to collect relevance judgments, lack...
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the ...
Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as rel...
Evaluation is instrumental in the development and management of effective information retrieval syst...
The performance of information retrieval (IR) systems is commonly evaluated using a test set with kn...
In recent years, gathering relevance judgments through non-topic originators has become an increasin...
Crowdsourcing relevance judgments for the evaluation of search engines is used increasingly to overc...
Abstract. We consider the problem of acquiring relevance judgements for in-formation retrieval (IR) ...
When collecting item ratings from human judges, it can be difficult to measure and enforce data qual...
In Information Retrieval (IR) evaluation, preference judgments are collected by presenting to the as...
Information Retrieval systems rely on large test collections to measure their effectiveness in retri...
Information Retrieval systems rely on large test collections to measure their effectiveness in retri...
Abstract. Crowdsourcing relevance judgments for test collection con-struction is attractive because ...