Document length is widely recognized as an important factor for adjusting retrieval systems. Many models tend to favor the retrieval of either short or long documents and, thus, a length-based correction needs to be applied for avoiding any length bias. In Language Modeling for Information Retrieval, smoothing methods are applied to move probability mass from document terms to unseen words, which is often dependant upon document length. In this article, we perform an in-depth study of this behavior, characterized by the document length retrieval trends, of three popular smoothing methods across a number of factors, and its impact on the length of documents retrieved and retrieval performance. First, we theoretically analyze the Jelinek–Merc...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
Abstract. Term frequency normalization is a serious issue since lengths of doc-uments are various. G...
The scope hypothesis in Information Retrieval (IR) states that a relationship exists between documen...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Normalizing document length is widely recognized as an important factor for adjusting retrieval syst...
this paper we study the problem of language model smoothing and its inuence on retrieval performance...
Language models form a class of successful probabilistic models in information retrieval. However, k...
Contains fulltext : 73393.pdf (publisher's version ) (Open Access)Language models ...
The optimal settings of retrieval parameters often depend on both the document collection and the qu...
Recent research has shown that long documents are unfairly penalised by a number of current retrieva...
Document expansion is the process of augmenting the text of a document with text drawn from one or m...
Document expansion is the process of augmenting the text of a document with text drawn from one or m...
Abstract. XML retrieval is a departure from standard document retrieval in which each individual XML...
XML retrieval is a departure from standard document retrieval in which each individual XML element, ...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
Abstract. Term frequency normalization is a serious issue since lengths of doc-uments are various. G...
The scope hypothesis in Information Retrieval (IR) states that a relationship exists between documen...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Normalizing document length is widely recognized as an important factor for adjusting retrieval syst...
this paper we study the problem of language model smoothing and its inuence on retrieval performance...
Language models form a class of successful probabilistic models in information retrieval. However, k...
Contains fulltext : 73393.pdf (publisher's version ) (Open Access)Language models ...
The optimal settings of retrieval parameters often depend on both the document collection and the qu...
Recent research has shown that long documents are unfairly penalised by a number of current retrieva...
Document expansion is the process of augmenting the text of a document with text drawn from one or m...
Document expansion is the process of augmenting the text of a document with text drawn from one or m...
Abstract. XML retrieval is a departure from standard document retrieval in which each individual XML...
XML retrieval is a departure from standard document retrieval in which each individual XML element, ...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
Abstract. Term frequency normalization is a serious issue since lengths of doc-uments are various. G...
The scope hypothesis in Information Retrieval (IR) states that a relationship exists between documen...