Interleaving is an increasingly popular technique for evaluating information retrieval systems based on implicit user feedback. While a number of isolated studies have analyzed how this technique agrees with conventional offline evaluation approaches and other online techniques, a complete picture of its efficiency and effectiveness is still lacking. In this paper we extend and combine the body of empirical evidence regarding interleaving, and provide a comprehensive analysis of interleaving using data from two major commercial search engines and a retrieval system for scientific literature. In particular, we analyze the agreement of interleaving with manual relevance judgments and observational implicit feedback measures, estimate the...
A result page of a modern search engine often goes beyond a simple list of "10 blue links." Many spe...
A result page of a modern search engine often goes beyond a simple list of “10 blue links.” Many spe...
Online evaluation methods for information retrieval use implicit signals such as clicks from users t...
Interleaving is an increasingly popular technique for evaluating information retrieval systems based...
Interleaving is an online evaluation method to compare two alternative ranking functions based on th...
Interleaving is an online evaluation method to compare two alternative ranking functions based on th...
The gold standard for online retrieval evaluation is AB testing. Rooted in the idea of a controlled ...
Ranker evaluation is central to the research into search engines, be it to compare rankers or to pro...
Ranker evaluation is central to the research into search engines, be it to compare rankers or to pro...
Interleaving is an online evaluation method that compares two ranking functions by mixing their res...
Ranker evaluation is central to the research into search engines, be it to compare rankers or to pro...
Interleaving is an online evaluation method that compares two ranking functions by mixing their res...
A result page of a modern web search engine is often much more complicated than a simple list of "te...
Evaluation methods for information retrieval systems come in three types: offline evaluation, using ...
Evaluation methods for information retrieval systems come in three types: offline evaluation, using ...
A result page of a modern search engine often goes beyond a simple list of "10 blue links." Many spe...
A result page of a modern search engine often goes beyond a simple list of “10 blue links.” Many spe...
Online evaluation methods for information retrieval use implicit signals such as clicks from users t...
Interleaving is an increasingly popular technique for evaluating information retrieval systems based...
Interleaving is an online evaluation method to compare two alternative ranking functions based on th...
Interleaving is an online evaluation method to compare two alternative ranking functions based on th...
The gold standard for online retrieval evaluation is AB testing. Rooted in the idea of a controlled ...
Ranker evaluation is central to the research into search engines, be it to compare rankers or to pro...
Ranker evaluation is central to the research into search engines, be it to compare rankers or to pro...
Interleaving is an online evaluation method that compares two ranking functions by mixing their res...
Ranker evaluation is central to the research into search engines, be it to compare rankers or to pro...
Interleaving is an online evaluation method that compares two ranking functions by mixing their res...
A result page of a modern web search engine is often much more complicated than a simple list of "te...
Evaluation methods for information retrieval systems come in three types: offline evaluation, using ...
Evaluation methods for information retrieval systems come in three types: offline evaluation, using ...
A result page of a modern search engine often goes beyond a simple list of "10 blue links." Many spe...
A result page of a modern search engine often goes beyond a simple list of “10 blue links.” Many spe...
Online evaluation methods for information retrieval use implicit signals such as clicks from users t...