Examining the limits of crowdsourcing for relevance assessment

Clough, P
Sanderson, M
Tang, J
Gollins, T
Warner, A

Open link

Publication date

January 2013

DOI

10.1109/mic.2012.95

Publisher

IEEE (United States)

ISSN

1089-7801

Abstract

Evaluation is instrumental in the development and management of effective information retrieval systems and ensuring high levels of user satisfaction. Using crowdsourcing to obtain relevance assessments has been shown to be viable through a number of publications. What is less well understood are the limits of crowdsourcing for the assessment task, particularly for domain specific search. We present results comparing relevance assessments gathered using crowdsourcing with those gathered from a domain expert for evaluating different search engines in a large government archive. While crowdsourced judgments rank the tested search engines in the same order as expert judgments, crowdsourced workers appear unable to distinguish different levels ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Examining the limits of crowdsourcing for relevance assessment

Abstract

Extracted data

Examining the limits of crowdsourcing for relevance assessment

Abstract

Extracted data

Related items

Related items