The need to store and query a set of strings { a string dictionary { arises in many kinds of applications. While classically these string dictionaries have accounted for a small share of the total space budget (e.g., in Natural Language Processing or when indexing text collections), recent applications in Web engines, Seman-tic Web (RDF) graphs, Bioinformatics, and many others, handle very large string dictionaries, whose size is a signicant fraction of the whole data. In these cases, string dictionary management is a scalability issue by itself. This paper focuses on the problem of managing large static string dictionaries in compressed main memory space. We revisit classical solutions for string dictionaries like hashing, tries, and front...
We show that greedy left-to-right (right-to-left) parsing is optimal w.r.t. a suffix (prefix) dictio...
Current data structures for searching large string collec-tions are limited in that they either fail...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
The need to store and query a set of strings – a string dictionary – arises in many kinds of applica...
Artículo de publicación ISIThe need to store and query a set of strings - a string dictionary - aris...
[Abstract] We introduce a new family of compressed data structures to efficiently store and query la...
In this thesis, we will illustrate a two-level approach to compress and index string dictionaries, w...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
String dictionaries constitute a large portion of the memory footprint of database applications. Whi...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
[[abstract]]The past few years have witnessed several exciting results on compressed representation ...
The past few years have witnessed several exciting results on compressed representation of a string ...
Let T be a string with n characters over an alphabet of constant size. A recent breakthrough on comp...
The original publication is available at www.springerlink.comThe past few years have witnessed sever...
In this paper we present different solutions for the problem of indexing a dictionary of strings in ...
We show that greedy left-to-right (right-to-left) parsing is optimal w.r.t. a suffix (prefix) dictio...
Current data structures for searching large string collec-tions are limited in that they either fail...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
The need to store and query a set of strings – a string dictionary – arises in many kinds of applica...
Artículo de publicación ISIThe need to store and query a set of strings - a string dictionary - aris...
[Abstract] We introduce a new family of compressed data structures to efficiently store and query la...
In this thesis, we will illustrate a two-level approach to compress and index string dictionaries, w...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
String dictionaries constitute a large portion of the memory footprint of database applications. Whi...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
[[abstract]]The past few years have witnessed several exciting results on compressed representation ...
The past few years have witnessed several exciting results on compressed representation of a string ...
Let T be a string with n characters over an alphabet of constant size. A recent breakthrough on comp...
The original publication is available at www.springerlink.comThe past few years have witnessed sever...
In this paper we present different solutions for the problem of indexing a dictionary of strings in ...
We show that greedy left-to-right (right-to-left) parsing is optimal w.r.t. a suffix (prefix) dictio...
Current data structures for searching large string collec-tions are limited in that they either fail...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...