Test collections are the primary drivers of progress in infor-mation retrieval. They provide yardsticks for assessing the effectiveness of ranking functions in an automatic, rapid, and repeatable fashion and serve as training data for learning to rank models. However, manual construction of test collec-tions tends to be slow, labor-intensive, and expensive. This paper examines the feasibility of constructing web search test collections in a completely unsupervised manner given only a large web corpus as input. Within our proposed framework, anchor text extracted from the web graph is treated as a pseudo query log from which pseudo queries are sampled. For each pseudo query, a set of relevant and non-relevant documents are selected using a v...
Many machine learning technologies such as Support Vector Machines, Boosting, and Neural Networks ha...
Many online or local data sources provide powerful querying mechanisms but limited ranking capabilit...
A standard approach to estimating online click-based met-rics of a ranking function is to run it in ...
Recent years have witnessed a persistent interest in generating pseudo test collections, both for tr...
Abstract. Pseudo test collections are automatically generated to pro-vide training material for lear...
Learning to rank has become a popular approach to build a ranking model for Web search recently. Bas...
Learning-to-rank algorithms, which can automatically adapt ranking functions in web search, require ...
This data archive accompanies our work, in which we analyze a pseudo-relevance retrieval method that...
Web search engines are increasingly deploying many features, combined using learning to rank techniq...
We consider the problem of efficiently sampling Web search engine query results. In turn, using a sm...
Learning to rank techniques provide mechanisms for combining document feature values into learned mo...
We consider the problem of efficiently sampling Web search engine query results. In turn, using a sm...
Many web databases are only accessible through a proprietary search interface which allows users to ...
This paper explores two classes of model adaptation methods for Web search ranking: Model Interpolat...
We propose a methodology for building a robust query classification system that can identify thou-sa...
Many machine learning technologies such as Support Vector Machines, Boosting, and Neural Networks ha...
Many online or local data sources provide powerful querying mechanisms but limited ranking capabilit...
A standard approach to estimating online click-based met-rics of a ranking function is to run it in ...
Recent years have witnessed a persistent interest in generating pseudo test collections, both for tr...
Abstract. Pseudo test collections are automatically generated to pro-vide training material for lear...
Learning to rank has become a popular approach to build a ranking model for Web search recently. Bas...
Learning-to-rank algorithms, which can automatically adapt ranking functions in web search, require ...
This data archive accompanies our work, in which we analyze a pseudo-relevance retrieval method that...
Web search engines are increasingly deploying many features, combined using learning to rank techniq...
We consider the problem of efficiently sampling Web search engine query results. In turn, using a sm...
Learning to rank techniques provide mechanisms for combining document feature values into learned mo...
We consider the problem of efficiently sampling Web search engine query results. In turn, using a sm...
Many web databases are only accessible through a proprietary search interface which allows users to ...
This paper explores two classes of model adaptation methods for Web search ranking: Model Interpolat...
We propose a methodology for building a robust query classification system that can identify thou-sa...
Many machine learning technologies such as Support Vector Machines, Boosting, and Neural Networks ha...
Many online or local data sources provide powerful querying mechanisms but limited ranking capabilit...
A standard approach to estimating online click-based met-rics of a ranking function is to run it in ...