Automatic information retrieval systems have to deal with documents of varying lengths in a text collection. Docu-ment length normalization is used to fairly retrieve docu-ments of all lengths. In this study, we ohserve that a nor-malization scheme that retrieves documents of all lengths with similar chances as their likelihood of relevance will outperform another scheme which retrieves documents with chances very different from their likelihood of relevance. We show that the retrievaf probabilities for a particular normal-ization method deviate systematically from the relevance probabilities across different collections. We present pivoted normalization, a technique that can be used to modify any normalization function thereby reducing the...
This paper introduces a new weighting scheme in information retrieval. It also proposes using the do...
Abstract. XML retrieval is a departure from standard document retrieval in which each individual XML...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Document length normalization is an important aspect of term weight assignment in an automatic infor...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...
In information retrieval systems search quality is directly related to the number of relevant retrie...
Term weighting is an essential part of the modern information retrieval systems. Out of the three ma...
In the TREC collection -- a large full-text experimental text collection with widely varying documen...
Today's lecture notes cover pivoted document length normalization by Singhal, Buckley, and Mitr...
Normalizing document length is widely recognized as an important factor for adjusting retrieval syst...
Every information retrieval (IR) model embeds in its scoring function a form of term frequency (TF) ...
Open access funding provided by Austrian Science Fund (FWF). This research was partly supported by t...
Abstract—This paper introduces a new weighting scheme in information retrieval. It also proposes usi...
The advent of large electronic text corpora has generated a range of technologies for their search a...
The full paper appeared as: J. Kamps, M. de Rijke, and B. Sigurbj¨ornsson, “Length Normalization in ...
This paper introduces a new weighting scheme in information retrieval. It also proposes using the do...
Abstract. XML retrieval is a departure from standard document retrieval in which each individual XML...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Document length normalization is an important aspect of term weight assignment in an automatic infor...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...
In information retrieval systems search quality is directly related to the number of relevant retrie...
Term weighting is an essential part of the modern information retrieval systems. Out of the three ma...
In the TREC collection -- a large full-text experimental text collection with widely varying documen...
Today's lecture notes cover pivoted document length normalization by Singhal, Buckley, and Mitr...
Normalizing document length is widely recognized as an important factor for adjusting retrieval syst...
Every information retrieval (IR) model embeds in its scoring function a form of term frequency (TF) ...
Open access funding provided by Austrian Science Fund (FWF). This research was partly supported by t...
Abstract—This paper introduces a new weighting scheme in information retrieval. It also proposes usi...
The advent of large electronic text corpora has generated a range of technologies for their search a...
The full paper appeared as: J. Kamps, M. de Rijke, and B. Sigurbj¨ornsson, “Length Normalization in ...
This paper introduces a new weighting scheme in information retrieval. It also proposes using the do...
Abstract. XML retrieval is a departure from standard document retrieval in which each individual XML...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...