AFidelity, Soundness, and Efficiency of Interleaved Comparison Methods

Katja Hofmann

Publication date

August 2016

Abstract

Ranker evaluation is central to the research into search engines, be it to compare rankers or to provide feedback for learning to rank. Traditional evaluation approaches do not scale well because they require explicit relevance judgments of document-query pairs, which are expensive to obtain. A promising alternative is the use of interleaved comparison methods, which compare rankers using click data obtained when interleaving their rankings. We propose a framework for analyzing interleaved comparison methods. An interleaved comparison method has fidelity if the expected outcome of ranker comparisons properly corresponds to the true relevance of the ranked documents. It is sound if its estimates of that expected outcome are unbiased and cons...

Extracted data

We use cookies to provide a better user experience.

Data Protection

AFidelity, Soundness, and Efficiency of Interleaved Comparison Methods

Abstract

Extracted data

AFidelity, Soundness, and Efficiency of Interleaved Comparison Methods

Abstract

Extracted data

Related items

Related items