A grammar compression algorithm, called GCIS, is introduced in this work. GCIS is based on the induced suffix sorting algorithm SAIS, presented by Nong et al. in 2009. The proposed solution builds on the factorization performed by SAIS during suffix sorting. A context-free grammar is used to replace factors by non-terminals. The algorithm is then recursively applied on the shorter sequence of non-terminals. The resulting grammar is encoded by exploiting some redundancies, such as common prefixes between right-hands of rules, sorted according to SAIS. GCIS excels for its low space and time required for compression while obtaining competitive compression ratios. Our experiments on regular and repetitive, moderate and very large texts, show th...
We present a simple linear-time algorithm constructing a~context-free grammar of size O(g \log (N/g)...
This paper describes a technique that develops models of symbol sequences in the form of small, huma...
Abstract. A space-efficient linear-time approximation algorithm for the grammar-based compression pr...
We introduce GCIS, a grammar compression algorithm based on the induced suffix sorting algorithm SAI...
We introduce a compression technique for suffix arrays. It is sensitive to the compressibility of th...
We report on a new experimental analysis of high-order entropy-compressed suffix arrays, which retai...
A space-economical self-index on the grammar-based compression is proposed. The algorithm by (Sakam...
The best general-purpose compression schemes make their gains by estimating a probability distributi...
196 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2000.We then turn to construction ...
Grammar-based compression is a well-studied technique to construct a context-free grammar (CFG) deri...
A grammar transform is a transformation that converts any data sequence to be compressed into a gram...
Sequence representations supporting not only direct access to their symbols, but also rank/select op...
This paper describes Grammar Learning by Partition Search, a general method for automatically constr...
Grammar based compression, where one replaces a long string by a small context-free grammar that gen...
Abstract. This paper gives a survey on recent progress in grammar-based compression for trees. Also ...
We present a simple linear-time algorithm constructing a~context-free grammar of size O(g \log (N/g)...
This paper describes a technique that develops models of symbol sequences in the form of small, huma...
Abstract. A space-efficient linear-time approximation algorithm for the grammar-based compression pr...
We introduce GCIS, a grammar compression algorithm based on the induced suffix sorting algorithm SAI...
We introduce a compression technique for suffix arrays. It is sensitive to the compressibility of th...
We report on a new experimental analysis of high-order entropy-compressed suffix arrays, which retai...
A space-economical self-index on the grammar-based compression is proposed. The algorithm by (Sakam...
The best general-purpose compression schemes make their gains by estimating a probability distributi...
196 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2000.We then turn to construction ...
Grammar-based compression is a well-studied technique to construct a context-free grammar (CFG) deri...
A grammar transform is a transformation that converts any data sequence to be compressed into a gram...
Sequence representations supporting not only direct access to their symbols, but also rank/select op...
This paper describes Grammar Learning by Partition Search, a general method for automatically constr...
Grammar based compression, where one replaces a long string by a small context-free grammar that gen...
Abstract. This paper gives a survey on recent progress in grammar-based compression for trees. Also ...
We present a simple linear-time algorithm constructing a~context-free grammar of size O(g \log (N/g)...
This paper describes a technique that develops models of symbol sequences in the form of small, huma...
Abstract. A space-efficient linear-time approximation algorithm for the grammar-based compression pr...