Ranking a set retrieval systems according to their retrieval effectiveness without relying on relevance judgments was first explored by Soboroff et al. [13]. Over the years, a number of alternative approaches have been proposed, all of which have been evaluated on early TREC test collections. In this work, we perform a wider analysis of system ranking estimation methods on sixteen TREC data sets which cover more tasks and corpora than previously. Our analysis reveals that the performance of system ranking estimation approaches varies across topics. This observation motivates the hypothesis that the performance of such methods can be improved by selecting the “right” subset of topics from a topic set. We show that using topic subsets improve...
Corpora and topics are readily available for information retrieval research. Relevance judgments, wh...
Batched evaluations in IR experiments are commonly built using relevance judgments formed over a sam...
We compare the performance of two database selection algorithms reported in the literature. Their pe...
Ranking a set retrieval systems according to their retrieval effectiveness without relying on releva...
Ranking a set retrieval systems according to their retrieval effectiveness without relying on releva...
Ranking a number of retrieval systems according to their retrieval effectiveness without relying on ...
Some measures such as mean average precision and recall level precision are considered as good syste...
For decades, the use of test collection has been a standardized approach in information retrieval ev...
Measuring effectiveness of information retrieval (IR) systems is essential for research and developm...
Typical information retrieval system evaluation requires expensive manually-collected relevance judg...
In information retrieval (IR), research aiming to reduce the cost of retrieval system evaluations ha...
International audienceInformation Retrieval (IR) systems heavily rely on a large number of parameter...
© 2011 Dr. Sri Devi RavanaComparative evaluations of information retrieval systems using test collec...
The information retrieval system evaluation is necessary to measure and quantify the effectiveness,...
Most of information retrieval effectiveness evaluation metrics assume that systems appending irrelev...
Corpora and topics are readily available for information retrieval research. Relevance judgments, wh...
Batched evaluations in IR experiments are commonly built using relevance judgments formed over a sam...
We compare the performance of two database selection algorithms reported in the literature. Their pe...
Ranking a set retrieval systems according to their retrieval effectiveness without relying on releva...
Ranking a set retrieval systems according to their retrieval effectiveness without relying on releva...
Ranking a number of retrieval systems according to their retrieval effectiveness without relying on ...
Some measures such as mean average precision and recall level precision are considered as good syste...
For decades, the use of test collection has been a standardized approach in information retrieval ev...
Measuring effectiveness of information retrieval (IR) systems is essential for research and developm...
Typical information retrieval system evaluation requires expensive manually-collected relevance judg...
In information retrieval (IR), research aiming to reduce the cost of retrieval system evaluations ha...
International audienceInformation Retrieval (IR) systems heavily rely on a large number of parameter...
© 2011 Dr. Sri Devi RavanaComparative evaluations of information retrieval systems using test collec...
The information retrieval system evaluation is necessary to measure and quantify the effectiveness,...
Most of information retrieval effectiveness evaluation metrics assume that systems appending irrelev...
Corpora and topics are readily available for information retrieval research. Relevance judgments, wh...
Batched evaluations in IR experiments are commonly built using relevance judgments formed over a sam...
We compare the performance of two database selection algorithms reported in the literature. Their pe...