We present a query language for searching collections of structured text. Documents within the collection are not required to adhere to a global schema nor are individual documents required to be structured according to any defined schema at all. Nonetheless, queries may directly reference structure across differently formatted documents. We briefly discuss application of the language to multilingual collections, relational databases, text filtering and document analysis. 1 Introduction Figure 1, a facsimile of a page from an edition of Shakespeare's Tragedie of Macbeth, demonstrates the complexity of structured text. This single page includes stage directions, speakers, speeches, the start of the play's first act, its entire fir...
International audienceNoSQL document stores are well-tailored to efficiently load and manage massive...
This paper presents a Information Retrieval (IR) system from collections of structured documents tha...
AbstractThis on the web, most structured document collections consist of documents from different so...
We present a query language for searching collections of structured text. Documents within the colle...
Structured document interchange formats such as XML and SGML are ubiquitous, however information ret...
Abstract. This paper presents a selection of methods for searching in heterogeneous data collections...
Abstract. How to exploit structured information to facilitate document retrieval? There have been qu...
A significant amount of information is expressed as the semi-structured, non-grammatical text found ...
260 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.This work contributes to info...
Sgrep is a Unix tool for searching the contents of text files. Sgrep implements an algebra of unrest...
In many practical information retrieval situations, it is necessary to process heterogeneous text da...
We present a model for complex documents possibly consisting of a hierarchically structured set of i...
A semi-structured information space consists of multiple collections of textual documents containing...
Abstract: Structured documents are made up of a few logical components, such as title, sections, sub...
Documents often display a structure, e.g., several sections, each with several subsections and so on...
International audienceNoSQL document stores are well-tailored to efficiently load and manage massive...
This paper presents a Information Retrieval (IR) system from collections of structured documents tha...
AbstractThis on the web, most structured document collections consist of documents from different so...
We present a query language for searching collections of structured text. Documents within the colle...
Structured document interchange formats such as XML and SGML are ubiquitous, however information ret...
Abstract. This paper presents a selection of methods for searching in heterogeneous data collections...
Abstract. How to exploit structured information to facilitate document retrieval? There have been qu...
A significant amount of information is expressed as the semi-structured, non-grammatical text found ...
260 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.This work contributes to info...
Sgrep is a Unix tool for searching the contents of text files. Sgrep implements an algebra of unrest...
In many practical information retrieval situations, it is necessary to process heterogeneous text da...
We present a model for complex documents possibly consisting of a hierarchically structured set of i...
A semi-structured information space consists of multiple collections of textual documents containing...
Abstract: Structured documents are made up of a few logical components, such as title, sections, sub...
Documents often display a structure, e.g., several sections, each with several subsections and so on...
International audienceNoSQL document stores are well-tailored to efficiently load and manage massive...
This paper presents a Information Retrieval (IR) system from collections of structured documents tha...
AbstractThis on the web, most structured document collections consist of documents from different so...