Per-field normalisation has been shown to be effective for Web search tasks, e.g. named-page finding. However, per-field normalisation also suffers from having hyper-parameters to tune on a per-field basis. In this paper, we argue that the purpose of per-field normalisation is to adjust the linear relationship between field length and term frequency. We experiment with standard Web test collections, using three document fields, namely the body of the document, its title, and the anchor text of its incoming links. From our experiments, we find that across different collections, the linear correlation values, given by the optimised hyper-parameter settings, are proportional to the maximum negative linear correlation. Based on this observation...
Note: We thank the reviewers for their corrections to our revised submission. Abstract: When one tri...
Web Search Engines (WSEs) are probably nowadays the most complex information systems since they need...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...
We participated in the WebCLEF 2005 monolingual task. In this task, a search system aims to retrieve...
The term frequency normalisation parameter tuning is a crucial issue in information retrieval (IR), ...
Every information retrieval (IR) model embeds in its scoring function a form of term frequency (TF) ...
In information retrieval systems search quality is directly related to the number of relevant retrie...
Term weighting is an essential part of the modern information retrieval systems. Out of the three ma...
Document fields, such as the title or the headings of a document, offer a way to consider the struct...
Web search personalization has been studied as a way to tailor Web search results to individual user...
The full paper appeared as: J. Kamps, M. de Rijke, and B. Sigurbj¨ornsson, “Length Normalization in ...
In this paper, we report our experiments in the TREC 2009 Million Query Track. Our first line of stu...
The term relevance weighting method has been shown to produce optimal information retrieval queries...
The fields that compose structured documents such as web pages have been exploited to improve the ef...
Emails are examples of structured documents with various fields. These fields can be exploited to en...
Note: We thank the reviewers for their corrections to our revised submission. Abstract: When one tri...
Web Search Engines (WSEs) are probably nowadays the most complex information systems since they need...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...
We participated in the WebCLEF 2005 monolingual task. In this task, a search system aims to retrieve...
The term frequency normalisation parameter tuning is a crucial issue in information retrieval (IR), ...
Every information retrieval (IR) model embeds in its scoring function a form of term frequency (TF) ...
In information retrieval systems search quality is directly related to the number of relevant retrie...
Term weighting is an essential part of the modern information retrieval systems. Out of the three ma...
Document fields, such as the title or the headings of a document, offer a way to consider the struct...
Web search personalization has been studied as a way to tailor Web search results to individual user...
The full paper appeared as: J. Kamps, M. de Rijke, and B. Sigurbj¨ornsson, “Length Normalization in ...
In this paper, we report our experiments in the TREC 2009 Million Query Track. Our first line of stu...
The term relevance weighting method has been shown to produce optimal information retrieval queries...
The fields that compose structured documents such as web pages have been exploited to improve the ef...
Emails are examples of structured documents with various fields. These fields can be exploited to en...
Note: We thank the reviewers for their corrections to our revised submission. Abstract: When one tri...
Web Search Engines (WSEs) are probably nowadays the most complex information systems since they need...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...