There have been a number of prior attempts to theoretically justify the effectiveness of the inverse document frequency (IDF). Those that take as their starting point Robertson and Spärck Jones’s probabilistic model are based on strong or complex assumptions. We show that a more intuitively plausible assumption suffices. Moreover, the new assumption, while conceptually very simple, provides a solution to an estimation problem that had been deemed intractable by Robertson and Walker (1997)
From the issue entitled "Special Issue on the Second International Conference on the Theory of Infor...
Two competing approaches for document retrieval were first identified by Robertson et al (Robertson,...
In this study, we show how Luhn’s claim about the degree of importance of a word in a document can b...
This paper reports on theoretical investigations about the assumptions underlying the inverse docume...
This paper presents a new probabilistic model of information retrieval. The most important modeling ...
Retrieval models are the core components of information retrieval systems, which guide the document ...
Keywords: in information retrieval for decades. We propose a novel term weighting method based on wh...
This paper presents a new probabilistic model of information retrieval. The most important modeling ...
Abstract. This paper presents a new probabilistic model of information retrieval. The most important...
In this study, we show how Luhn‘s claim about the degree of importance of a word in a document can b...
This research evaluates a model for probabilistic text and document retrieval; the model utilizes th...
We introduce and create a framework for deriving probabilistic models of Information Retrieval. The ...
The need for an efficient method to find the furthermost appropriate document corresponding to a par...
In document analysis, an important task is to automatically find keywords which best describe the su...
Based on the Shannon information theory, a measure for term value is introduced. This study is an a...
From the issue entitled "Special Issue on the Second International Conference on the Theory of Infor...
Two competing approaches for document retrieval were first identified by Robertson et al (Robertson,...
In this study, we show how Luhn’s claim about the degree of importance of a word in a document can b...
This paper reports on theoretical investigations about the assumptions underlying the inverse docume...
This paper presents a new probabilistic model of information retrieval. The most important modeling ...
Retrieval models are the core components of information retrieval systems, which guide the document ...
Keywords: in information retrieval for decades. We propose a novel term weighting method based on wh...
This paper presents a new probabilistic model of information retrieval. The most important modeling ...
Abstract. This paper presents a new probabilistic model of information retrieval. The most important...
In this study, we show how Luhn‘s claim about the degree of importance of a word in a document can b...
This research evaluates a model for probabilistic text and document retrieval; the model utilizes th...
We introduce and create a framework for deriving probabilistic models of Information Retrieval. The ...
The need for an efficient method to find the furthermost appropriate document corresponding to a par...
In document analysis, an important task is to automatically find keywords which best describe the su...
Based on the Shannon information theory, a measure for term value is introduced. This study is an a...
From the issue entitled "Special Issue on the Second International Conference on the Theory of Infor...
Two competing approaches for document retrieval were first identified by Robertson et al (Robertson,...
In this study, we show how Luhn’s claim about the degree of importance of a word in a document can b...