In this paper the score distributions of a number of text search engines are modeled. It is shown empirically that the score distributions on a per query basis may be modeled using an exponential distribution for the set of non-relevant documents and a normal distribution for the set of relevant documents. Experiments show that this model fits TREC-3 and TREC-4 data for a wide variety of different search engines including INQUERY a probabilistic search engine, SMART a vector space engine, and search engines based on latent semantic indexing and language modeling. The model also works when search engines index other languages like Chinese. It is then shown that given a query for which relevance information is not available, a mixture model c...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
In web search, latent semantic models have been proposed to bridge the lexical gap between queries a...
. This paper presents a new probabilistic model of information retrieval. The most important modelin...
Meta-search, or the combination of the outputs of different search engines in response to a query, h...
Given the ranked lists of documents returned by multiple search engines in response to a given query...
Modelling the distribution of document scores returned from an information retrieval (IR) system in ...
Score normalization is indispensable in distributed retrieval and fusion or meta-search where mergin...
Score-distribution models are used for various practical pur-poses in search, for example for result...
Score normalization is indispensable in distributed retrieval and fu-sion or meta-search where mergi...
Abstract. We review the history of modeling score distributions, focusing on the mixture of normal-e...
We review the history of modeling score distributions, focusing on the mixture of normal-exponential...
Ranked retrieval has a particular disadvantage in comparison with traditional Boolean retrieval: the...
Ranking search results is an ongoing research topic in information retrieval. The traditional models...
This paper presents a development and an implementation of probabilistic methods of information retr...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
In web search, latent semantic models have been proposed to bridge the lexical gap between queries a...
. This paper presents a new probabilistic model of information retrieval. The most important modelin...
Meta-search, or the combination of the outputs of different search engines in response to a query, h...
Given the ranked lists of documents returned by multiple search engines in response to a given query...
Modelling the distribution of document scores returned from an information retrieval (IR) system in ...
Score normalization is indispensable in distributed retrieval and fusion or meta-search where mergin...
Score-distribution models are used for various practical pur-poses in search, for example for result...
Score normalization is indispensable in distributed retrieval and fu-sion or meta-search where mergi...
Abstract. We review the history of modeling score distributions, focusing on the mixture of normal-e...
We review the history of modeling score distributions, focusing on the mixture of normal-exponential...
Ranked retrieval has a particular disadvantage in comparison with traditional Boolean retrieval: the...
Ranking search results is an ongoing research topic in information retrieval. The traditional models...
This paper presents a development and an implementation of probabilistic methods of information retr...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
In web search, latent semantic models have been proposed to bridge the lexical gap between queries a...
. This paper presents a new probabilistic model of information retrieval. The most important modelin...