An unanswered question in information retrieval research is whether improvements in system performance demonstrated by batch evaluations confer the same benefit for real users. We used the TREC-8 Interactive Track to investigate this question. After identifying a weighting scheme that gave maximum improvement over the baseline, we used it with real users searching on an instance recall task. Our results showed no improvement; although there was overall average improvement comparable to the batch results, it was not statistically significant and due to the effect of just one out of the six queries. Further analysis with more queries is necessary to resolve this question
Traditional batch evaluation metrics assume that user interaction with search results is limited to...
Test collection design eliminates sources of user variability to make statistical comparisons among ...
Introduction -Evaluation is highly important for designing, developing and maintaining effective inf...
Web search tools are used on a daily basis by billions of people. The commercial providers of these ...
The existence and use of standard test collections in information retrieval experimentation allows r...
We introduce and explore the concept of an individual's relevance threshold as a way of reconci...
Contains fulltext : 141626.pdf (author's version ) (Open Access)Users can judge th...
© 2011 Dr. Sri Devi RavanaComparative evaluations of information retrieval systems using test collec...
Traditional batch evaluation metrics assume that user interaction with search results is limited to ...
Recent investigations of search performance have shown that, even when presented with two systems th...
Information Retrieval (IR) research has traditionally focused on serving the best results for a sing...
Research in Information Retrieval has progressed against a background of rapidly increasing corpus s...
Information retrieval systems are often evaluated through the use of effectiveness metrics. In the p...
[dianek | fu | chirag] @ email.unc.edu Previous research has demonstrated that system performance d...
Several recent studies have demonstrated that the type of improvements in information retrieval syst...
Traditional batch evaluation metrics assume that user interaction with search results is limited to...
Test collection design eliminates sources of user variability to make statistical comparisons among ...
Introduction -Evaluation is highly important for designing, developing and maintaining effective inf...
Web search tools are used on a daily basis by billions of people. The commercial providers of these ...
The existence and use of standard test collections in information retrieval experimentation allows r...
We introduce and explore the concept of an individual's relevance threshold as a way of reconci...
Contains fulltext : 141626.pdf (author's version ) (Open Access)Users can judge th...
© 2011 Dr. Sri Devi RavanaComparative evaluations of information retrieval systems using test collec...
Traditional batch evaluation metrics assume that user interaction with search results is limited to ...
Recent investigations of search performance have shown that, even when presented with two systems th...
Information Retrieval (IR) research has traditionally focused on serving the best results for a sing...
Research in Information Retrieval has progressed against a background of rapidly increasing corpus s...
Information retrieval systems are often evaluated through the use of effectiveness metrics. In the p...
[dianek | fu | chirag] @ email.unc.edu Previous research has demonstrated that system performance d...
Several recent studies have demonstrated that the type of improvements in information retrieval syst...
Traditional batch evaluation metrics assume that user interaction with search results is limited to...
Test collection design eliminates sources of user variability to make statistical comparisons among ...
Introduction -Evaluation is highly important for designing, developing and maintaining effective inf...