Multileaved Comparisons for Fast Online Evaluation

Anne Schuth
Floor Sietsma
Shimon Whiteson
Damien Lefortier

Publication date

November 2015

Abstract

Evaluation methods for information retrieval systems come in three types: offline evaluation, using static data sets annotated for rele-vance by human judges; user studies, usually conducted in a lab-based setting; and online evaluation, using implicit signals such as clicks from actual users. For the latter, preferences between rankers are typically inferred from implicit signals via interleaved compar-ison methods, which combine a pair of rankings and display the result to the user. We propose a new approach to online evaluation called multileaved comparisons that is useful in the prevalent case where designers are interested in the relative performance of more than two rankers. Rather than combining only a pair of rankings, multileaved c...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Multileaved Comparisons for Fast Online Evaluation

Abstract

Extracted data

Multileaved Comparisons for Fast Online Evaluation

Abstract

Extracted data

Related items

Related items