Inverted indexes are usually represented by dividing posting lists into constant-sized blocks and representing them with an encoder for sequences of integers. Different encoders yield a different point in the space-time trade-off curve, with the fastest being several times larger than the most space-efficient. An important design decision for an index is thus the choice of the fastest encoding method such that the index fits in the available memory. However, a better usage of the space budget could be obtained by using faster encoders for frequently accessed blocks, and more space-efficient ones those that are rarely accessed. To perform this choice optimally, we introduce a linear time algorithm that, given a query distribution and a set o...
Efficient access to the inverted index data structure is a key aspect for a search engine to achieve...
Inverted index structures are a core element of current text retrieval systems. They can be construc...
We introduce a new representation of the inverted index that performs faster ranked unions and inter...
Inverted indexes are usually represented by dividing posting lists into constant-sized blocks and re...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...
The data structure at the core of large-scale search engines is the inverted index, which is essenti...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Magíster en Ciencias, Mención ComputaciónWeb search has become an important part of day-to-day life....
For free-text search over rapidly evolving corpora, dynamic update of inverted indices is a basic re...
Modern text analytics applications operate on large volumes of temporal text data such as Web archiv...
The explosion of big data poses a serious problem to the efficient retrieval and management of infor...
The issue of reducing the space overhead when indexing large text databases is becoming more and mor...
The Elias-Fano representation of monotone sequences has been recently applied to the compression of ...
For text retrieval systems, the assumption that all data structures reside in main memory is increas...
Efficient access to the inverted index data structure is a key aspect for a search engine to achieve...
Inverted index structures are a core element of current text retrieval systems. They can be construc...
We introduce a new representation of the inverted index that performs faster ranked unions and inter...
Inverted indexes are usually represented by dividing posting lists into constant-sized blocks and re...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...
The data structure at the core of large-scale search engines is the inverted index, which is essenti...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Magíster en Ciencias, Mención ComputaciónWeb search has become an important part of day-to-day life....
For free-text search over rapidly evolving corpora, dynamic update of inverted indices is a basic re...
Modern text analytics applications operate on large volumes of temporal text data such as Web archiv...
The explosion of big data poses a serious problem to the efficient retrieval and management of infor...
The issue of reducing the space overhead when indexing large text databases is becoming more and mor...
The Elias-Fano representation of monotone sequences has been recently applied to the compression of ...
For text retrieval systems, the assumption that all data structures reside in main memory is increas...
Efficient access to the inverted index data structure is a key aspect for a search engine to achieve...
Inverted index structures are a core element of current text retrieval systems. They can be construc...
We introduce a new representation of the inverted index that performs faster ranked unions and inter...