The Elias-Fano representation of monotone sequences has been recently applied to the compression of inverted indexes, showing excellent query performance thanks to its efficient random access and search operations. While its space oc-cupancy is competitive with some state-of-the-art methods such as γ-δ-Golomb codes and PForDelta, it fails to exploit the local clustering that inverted lists usually exhibit, namely the presence of long subsequences of close identifiers. In this paper we describe a new representation based on partitioning the list into chunks and encoding both the chunks and their endpoints with Elias-Fano, hence forming a two-level data structure. This partitioning enables the encoding to better adapt to the local statistics ...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Indexing highly repetitive collections has become a relevant problem with the emergence of large rep...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
State-of-the-art encoders for inverted indexes compress each posting list individually. Encoding clu...
State-of-the-art encoders for inverted indexes compress each posting list individually. Encoding clu...
A practical and simple self-indexing data structure, Partitioned Elias-Fano (PEF) - Compressed Suffi...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
The data structure at the core of large-scale search engines is the inverted index, which is essenti...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...
Inverted indexes are usually represented by dividing posting lists into constant-sized blocks and re...
Inverted indexes are usually represented by dividing posting lists into constant-sized blocks and re...
We design two compressed data structures for the full-text indexing problem that support efficient s...
Dictionary-based compression schemes provide fast decoding operation, typically at the expense of re...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
[[abstract]]A new trend in the field of pattern matching is to design indexing data structures which...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Indexing highly repetitive collections has become a relevant problem with the emergence of large rep...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
State-of-the-art encoders for inverted indexes compress each posting list individually. Encoding clu...
State-of-the-art encoders for inverted indexes compress each posting list individually. Encoding clu...
A practical and simple self-indexing data structure, Partitioned Elias-Fano (PEF) - Compressed Suffi...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
The data structure at the core of large-scale search engines is the inverted index, which is essenti...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...
Inverted indexes are usually represented by dividing posting lists into constant-sized blocks and re...
Inverted indexes are usually represented by dividing posting lists into constant-sized blocks and re...
We design two compressed data structures for the full-text indexing problem that support efficient s...
Dictionary-based compression schemes provide fast decoding operation, typically at the expense of re...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
[[abstract]]A new trend in the field of pattern matching is to design indexing data structures which...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Indexing highly repetitive collections has become a relevant problem with the emergence of large rep...
In this paper we design two compressed data structures for the full-text indexing problem. These da...