XML retrieval is a departure from standard document retrieval in which each individual XML element, ranging from italicized words or phrases to full blown articles, is a retrievable unit. The distribution of XML element lengths is unlike what we usually observe in standard document collections, prompting us to revisit the issue of document length normalization. We perform a comparative analysis of arbitrary elements versus relevant elements, and show the importance of element length as a parameter for XML retrieval. Within the language modeling framework, we investigate a range of techniques that deal with length either directly or indirectly. We observe a length-bias introduced by the amount of smoothing, and show the importance of extreme...
Indexing and ranking are two key factors for efficient and effective XML information retrieval. Inap...
Document length normalization is an important aspect of term weight assignment in an automatic infor...
The article presents an analysis of the effect of granularity and order in an XML encoded collection...
Abstract. XML retrieval is a departure from standard document retrieval in which each individual XML...
The full paper appeared as: J. Kamps, M. de Rijke, and B. Sigurbj¨ornsson, “Length Normalization in ...
Normalizing document length is widely recognized as an important factor for adjusting retrieval syst...
Current information retrieval systems typically ignore structural aspects of documents, solely focus...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Abstract. In focussed XML retrieval, a retrieval unit is an XML element that not only contains infor...
Recent research has shown that long documents are unfairly penalised by a number of current retrieva...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...
Traditional document retrieval has shown to be a competitive approach in XML element retrieval, whic...
In information retrieval systems search quality is directly related to the number of relevant retrie...
This paper presents an information retrieval model on XML documents based on tree matching. Queries ...
Indexing and ranking are two key factors for efficient and effective XML information retrieval. Inap...
Document length normalization is an important aspect of term weight assignment in an automatic infor...
The article presents an analysis of the effect of granularity and order in an XML encoded collection...
Abstract. XML retrieval is a departure from standard document retrieval in which each individual XML...
The full paper appeared as: J. Kamps, M. de Rijke, and B. Sigurbj¨ornsson, “Length Normalization in ...
Normalizing document length is widely recognized as an important factor for adjusting retrieval syst...
Current information retrieval systems typically ignore structural aspects of documents, solely focus...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Document length is widely recognized as an important factor for adjusting retrieval systems. Many mo...
Abstract. In focussed XML retrieval, a retrieval unit is an XML element that not only contains infor...
Recent research has shown that long documents are unfairly penalised by a number of current retrieva...
Automatic information retrieval systems have to deal with documents of varying lengths in a text col...
Traditional document retrieval has shown to be a competitive approach in XML element retrieval, whic...
In information retrieval systems search quality is directly related to the number of relevant retrie...
This paper presents an information retrieval model on XML documents based on tree matching. Queries ...
Indexing and ranking are two key factors for efficient and effective XML information retrieval. Inap...
Document length normalization is an important aspect of term weight assignment in an automatic infor...
The article presents an analysis of the effect of granularity and order in an XML encoded collection...