On aggregating labels from multiple crowd workers to infer relevance of documents

Mehdi Hosseini
Ingemar J. Cox
Nataša Milić-frayling
Gabriella Kazai
Vishwa Vinay

Open link

Publication date

January 2012

DOI

10.1007/978-3-642-28997-2_16

Publisher

Springer

ISSN

0302-9743

Citation count (estimate)

Abstract

Abstract. We consider the problem of acquiring relevance judgements for in-formation retrieval (IR) test collections through crowdsourcing when no true relevance labels are available. We collect multiple, possibly noisy relevance la-bels per document from workers of unknown labelling accuracy. We use these labels to infer the document relevance based on two methods. The first method is the commonly used majority voting (MV) which determines the document relevance based on the label that received the most votes, treating all the work-ers equally. The second is a probabilistic model that concurrently estimates the document relevance and the workers accuracy using expectation maximization (EM). We run simulations and conduct experiments with c...

Extracted data

We use cookies to provide a better user experience.

Data Protection

On aggregating labels from multiple crowd workers to infer relevance of documents

Abstract

Extracted data

On aggregating labels from multiple crowd workers to infer relevance of documents

Abstract

Extracted data

Related items

Related items