We examine a previously known sublinear-time algorithm for approximating the length of a string’s optimal (i.e. shortest) Lempel-Ziv parsing (a.k.a. LZ77 factorization). This length is a measure of compressibility under the LZ77 compression algorithm, so the algorithm also estimates a string’s compressibility. The algorithm’s approximation approach is based on a connection between optimal Lempel-Ziv parsing length and the number of distinct substrings of different lengths in a string. Some aspects of the algorithm are described more explicitly than in earlier work, including the constraints on its input and how to distinguish between strings with short vs. long optimal parsings in sublinear time; several proofs (and pseudocode listings) are...
Lyndon factorization and Lempel-Ziv (LZ) factorization are both important tools for analysing the st...
AbstractSheinwald, Lempel, and Ziv (1995,Inform. and Comput.116, 128–133) proved that the power of o...
Given a string S, the compressed indexing problem is to preprocess S into a compressed representatio...
This work raises the question of approximating the compressibility of a string with respect to a fix...
We present an algorithm that computes the Lempel-Ziv decomposition in O(n(log σ + log log n)) time a...
International audienceWe give a space-efficient simple algorithm for computing the Lempel-Ziv factor...
One of the most famous and investigated lossless data-compression schemes is the one introduced by L...
This paper investigates the size in bits of the LZ77 encoding, which is the most popular and efficie...
We present new algorithms for the sliding window Lempel-Ziv (LZ77) problem and the approximate right...
The objective of the shortest common superstring problem is to find a string of minimum length that ...
Lempel-Ziv factorization of a string is a fundamental tool that is used by myriad data compressors. ...
We present a deterministic algorithm that constructs in linear time and space the LZ-End parsing (a ...
We present an algorithm that constructs the LZ-End parsing (a variation of LZ77) of a given string o...
Lempel–Ziv (LZ77 or, briefly, LZ) is one of the most effective and widely-used compressors for repet...
Since 1977, when Lempel and Ziv described a kind of string factorization useful for text compression...
Lyndon factorization and Lempel-Ziv (LZ) factorization are both important tools for analysing the st...
AbstractSheinwald, Lempel, and Ziv (1995,Inform. and Comput.116, 128–133) proved that the power of o...
Given a string S, the compressed indexing problem is to preprocess S into a compressed representatio...
This work raises the question of approximating the compressibility of a string with respect to a fix...
We present an algorithm that computes the Lempel-Ziv decomposition in O(n(log σ + log log n)) time a...
International audienceWe give a space-efficient simple algorithm for computing the Lempel-Ziv factor...
One of the most famous and investigated lossless data-compression schemes is the one introduced by L...
This paper investigates the size in bits of the LZ77 encoding, which is the most popular and efficie...
We present new algorithms for the sliding window Lempel-Ziv (LZ77) problem and the approximate right...
The objective of the shortest common superstring problem is to find a string of minimum length that ...
Lempel-Ziv factorization of a string is a fundamental tool that is used by myriad data compressors. ...
We present a deterministic algorithm that constructs in linear time and space the LZ-End parsing (a ...
We present an algorithm that constructs the LZ-End parsing (a variation of LZ77) of a given string o...
Lempel–Ziv (LZ77 or, briefly, LZ) is one of the most effective and widely-used compressors for repet...
Since 1977, when Lempel and Ziv described a kind of string factorization useful for text compression...
Lyndon factorization and Lempel-Ziv (LZ) factorization are both important tools for analysing the st...
AbstractSheinwald, Lempel, and Ziv (1995,Inform. and Comput.116, 128–133) proved that the power of o...
Given a string S, the compressed indexing problem is to preprocess S into a compressed representatio...