On the Approximation Ratio of Ordered Parsings

Navarro G.
Ochoa C.
Prezza N.

Publication date

January 2020

DOI

10.1109/TIT.2020.3042746

Abstract

Shannon’s entropy is a clear lower bound for statistical compression. The situation is not so well understood for dictionary-based compression. A plausible lower bound is b, the least number of phrases of a general bidirectional parse of a text, where phrases can be copied from anywhere else in the text. Since computing b is NP-complete, a popular gold standard is z, the number of phrases in the Lempel-Ziv parse of the text, which is computed in linear time and yields the least number of phrases when those can be copied only from the left. Almost nothing has been known for decades about the approximation ratio of z with respect to b. In this paper we prove that z = O(b log(n/b)), where n is the text length. We also show that the bound is ti...

Extracted data

We use cookies to provide a better user experience.

Data Protection

On the Approximation Ratio of Ordered Parsings

Abstract

Extracted data

On the Approximation Ratio of Ordered Parsings

Abstract

Extracted data

Related items

Related items