Since long, computer science has distinguished between information retrieval and data retrieval, where information retrieval entails the problem of ranking textual documents on their content (with the goal to identify documents relevant for satisfying a user's information need) while data retrieval involves exact match, that is, checking a data collection for presence or absence of (precisely specified) items. But, now that XML has become a standard document model that allows structure and text content to be represented in a combined way, new generations of information retrieval systems are expected to handle semi-structured documents instead of plain text, with usage scenarios that require the combination of `conventional' ranking with oth...
Structured elements are pervasive in digital libraries, product catalogs, scientific data collection...
Papier egalement presente au Workshop sur XML & Object Technology (XOT'2000) dans le cadre de la 14t...
XML has emerged as a lingua franca of the WWW and is rapidly replacing other formats as the preferre...
Information is the main value of Information Society. The recent developments in computing power and...
Information is the main value of Information Society. The recent developments in computing power and...
Structured document interchange formats such as XML and SGML are ubiquitous, however information ret...
Traditional information retrieval model is based on the statistics, and all the keywords to the mode...
Abstract. Structured document retrieval aims at exploiting the struc-ture together with the content ...
Recently, we have seen a steep increase in the popularity and adoption of XML, in areas such as trad...
XML holds the promise to yield (1) a more precise search by providing additional information in the ...
The Extensible Markup Language (XML) is extremely popular as a generic markup language for text docu...
Document-centric XML is a mixture of text and structure. With the increased availability of document...
1. XML is able to represent a mix of structured and unstructured (text) information. 2. Examples of ...
The widespread adoption of XML necessitates structure-aware systems that can effectively retrieve in...
XML documents represent a middle range between unstructured data such as textual documents and fully...
Structured elements are pervasive in digital libraries, product catalogs, scientific data collection...
Papier egalement presente au Workshop sur XML & Object Technology (XOT'2000) dans le cadre de la 14t...
XML has emerged as a lingua franca of the WWW and is rapidly replacing other formats as the preferre...
Information is the main value of Information Society. The recent developments in computing power and...
Information is the main value of Information Society. The recent developments in computing power and...
Structured document interchange formats such as XML and SGML are ubiquitous, however information ret...
Traditional information retrieval model is based on the statistics, and all the keywords to the mode...
Abstract. Structured document retrieval aims at exploiting the struc-ture together with the content ...
Recently, we have seen a steep increase in the popularity and adoption of XML, in areas such as trad...
XML holds the promise to yield (1) a more precise search by providing additional information in the ...
The Extensible Markup Language (XML) is extremely popular as a generic markup language for text docu...
Document-centric XML is a mixture of text and structure. With the increased availability of document...
1. XML is able to represent a mix of structured and unstructured (text) information. 2. Examples of ...
The widespread adoption of XML necessitates structure-aware systems that can effectively retrieve in...
XML documents represent a middle range between unstructured data such as textual documents and fully...
Structured elements are pervasive in digital libraries, product catalogs, scientific data collection...
Papier egalement presente au Workshop sur XML & Object Technology (XOT'2000) dans le cadre de la 14t...
XML has emerged as a lingua franca of the WWW and is rapidly replacing other formats as the preferre...