The need to store and query a set of strings – a string dictionary – arises in many kinds of applications. While classically these string dictionaries have accounted for a small share of the total space budget (e.g., in Natural Language Processing or when indexing text collections), recent applications in Web engines, Seman-tic Web (RDF) graphs, Bioinformatics, and many others, handle very large string dictionaries, whose size is a significant fraction of the whole data. In these cases, string dictionary management is a scalability issue by itself. This paper focuses on the problem of managing large static string dictionaries in compressed main memory space. We revisit classical solutions for string dictionaries like hashing, tries, and fro...
The original publication is available at www.springerlink.comThe past few years have witnessed sever...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
We design two compressed data structures for the full-text indexing problem that support efficient s...
The need to store and query a set of strings { a string dictionary { arises in many kinds of applica...
Artículo de publicación ISIThe need to store and query a set of strings - a string dictionary - aris...
[Abstract] We introduce a new family of compressed data structures to efficiently store and query la...
In this thesis, we will illustrate a two-level approach to compress and index string dictionaries, w...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
String dictionaries constitute a large portion of the memory footprint of database applications. Whi...
[[abstract]]The past few years have witnessed several exciting results on compressed representation ...
Let T be a string with n characters over an alphabet of constant size. A recent breakthrough on comp...
The past few years have witnessed several exciting results on compressed representation of a string ...
We show that greedy left-to-right (right-to-left) parsing is optimal w.r.t. a suffix (prefix) dictio...
In this paper we present different solutions for the problem of indexing a dictionary of strings in ...
The original publication is available at www.springerlink.comThe past few years have witnessed sever...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
We design two compressed data structures for the full-text indexing problem that support efficient s...
The need to store and query a set of strings { a string dictionary { arises in many kinds of applica...
Artículo de publicación ISIThe need to store and query a set of strings - a string dictionary - aris...
[Abstract] We introduce a new family of compressed data structures to efficiently store and query la...
In this thesis, we will illustrate a two-level approach to compress and index string dictionaries, w...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
String dictionaries constitute a large portion of the memory footprint of database applications. Whi...
[[abstract]]The past few years have witnessed several exciting results on compressed representation ...
Let T be a string with n characters over an alphabet of constant size. A recent breakthrough on comp...
The past few years have witnessed several exciting results on compressed representation of a string ...
We show that greedy left-to-right (right-to-left) parsing is optimal w.r.t. a suffix (prefix) dictio...
In this paper we present different solutions for the problem of indexing a dictionary of strings in ...
The original publication is available at www.springerlink.comThe past few years have witnessed sever...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
We design two compressed data structures for the full-text indexing problem that support efficient s...