Sequence representations supporting not only direct access to their symbols, but also rank/select operations, are a fundamental building block in many compressed data structures. Several recent applications need to represent highly repetitive sequences, and classical statistical compression proves ineffective. We introduce, instead, grammar-based representations for repetitive sequences, which use up to 6% of the space needed by statistically compressed representations, and support direct access and rank/select operations within tens of microseconds. We demonstrate the impact of our structures in text indexing applications. (C) 2016 Elsevier B.V. All rights reserved.European Union, 690941 / FONDECYT, Chile, 1-140796 / CDTI EXP, 000645663/I...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
Grammar-based compression is a popular and powerful approach to compressing repetitive texts but unt...
Abstract. Sequence representations supporting not only direct access to their symbols, but also rank...
Abstract. Sequence representations supporting not only direct access to their symbols, but also rank...
An early partial version of this paper appeared in Proc. SPIRE 2014: G. Navarro, A. Ordóñez Grammar...
AbstractOperations rank and select over a sequence of symbols have many applications to the design o...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The deep connection between the Burrows-Wheeler transform (BWT) and the socalled rank and select dat...
Abstract. Given a string S of length N on a fixed alphabet of σ symbols, a grammar compressor produc...
This thesis studies problems related to compressed full-text indexes. A full-text index is a data st...
A space-economical self-index on the grammar-based compression is proposed. The algorithm by (Sakam...
AbstractOperations rank and select over a sequence of symbols have many applications to the design o...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
Grammar-based compression is a popular and powerful approach to compressing repetitive texts but unt...
Abstract. Sequence representations supporting not only direct access to their symbols, but also rank...
Abstract. Sequence representations supporting not only direct access to their symbols, but also rank...
An early partial version of this paper appeared in Proc. SPIRE 2014: G. Navarro, A. Ordóñez Grammar...
AbstractOperations rank and select over a sequence of symbols have many applications to the design o...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The deep connection between the Burrows-Wheeler transform (BWT) and the socalled rank and select dat...
Abstract. Given a string S of length N on a fixed alphabet of σ symbols, a grammar compressor produc...
This thesis studies problems related to compressed full-text indexes. A full-text index is a data st...
A space-economical self-index on the grammar-based compression is proposed. The algorithm by (Sakam...
AbstractOperations rank and select over a sequence of symbols have many applications to the design o...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
Grammar-based compression is a popular and powerful approach to compressing repetitive texts but unt...