Most modern lossless data compression techniques used today, are based in dictionaries. If some string of data being compressed matches a portion previously seen, then such string is included in the dictionary and its reference is included every time it appears. A possible generalization of this scheme is to consider not only strings made of consecutive symbols, but more general patterns with gaps between its symbols. The main problems with this approach are the complexity of pattern discovery algorithms and the complexity for the selection of a good subset of patterns. In this paper we address the last of these problems. We demonstrate that such problem is NP-complete and we provide some preliminary results about heuristics that points to ...
AbstractA gapped pattern is a sequence consisting of regular alphabet symbols and of joker symbols t...
Abstract. We study the fully compressed pattern matching problem (FCPM problem): Given T and P which...
It has long been known that the compression redundancy of independent and identically distributed (i...
Abstract. Most modern lossless data compression techniques used today, are based in dictionaries. If...
. We survey the complexity issues related to several algorithmic problems for compressed one- and tw...
We consider gapped variants of classical data compression paradigms (Ziv, J. and Lempel, A.,1977, 19...
In this paper we focus on the problem of compressed pattern matching for the text compression using ...
TR-COSC 07/01This paper provides a survey of techniques for pattern matching in compressed text and ...
We introduce a general framework which is suitable to capture an essence of compressed pattern match...
The String-to-Dictionary Matching Problem is defined, in which a string is searched for in all the p...
AbstractWe introduce a general framework which is suitable to capture the essence of compressed patt...
Pattern mining based on data compression has been successfully applied in many data mining tasks. Fo...
Pattern mining is one of the best-known concepts in Data Mining. A big problem in pattern mining is ...
Compression based pattern mining has been successfully applied to many data mining tasks. We propose...
Abstract. We study the fully compressed pattern matching problem (FCPM problem): Given T and P which...
AbstractA gapped pattern is a sequence consisting of regular alphabet symbols and of joker symbols t...
Abstract. We study the fully compressed pattern matching problem (FCPM problem): Given T and P which...
It has long been known that the compression redundancy of independent and identically distributed (i...
Abstract. Most modern lossless data compression techniques used today, are based in dictionaries. If...
. We survey the complexity issues related to several algorithmic problems for compressed one- and tw...
We consider gapped variants of classical data compression paradigms (Ziv, J. and Lempel, A.,1977, 19...
In this paper we focus on the problem of compressed pattern matching for the text compression using ...
TR-COSC 07/01This paper provides a survey of techniques for pattern matching in compressed text and ...
We introduce a general framework which is suitable to capture an essence of compressed pattern match...
The String-to-Dictionary Matching Problem is defined, in which a string is searched for in all the p...
AbstractWe introduce a general framework which is suitable to capture the essence of compressed patt...
Pattern mining based on data compression has been successfully applied in many data mining tasks. Fo...
Pattern mining is one of the best-known concepts in Data Mining. A big problem in pattern mining is ...
Compression based pattern mining has been successfully applied to many data mining tasks. We propose...
Abstract. We study the fully compressed pattern matching problem (FCPM problem): Given T and P which...
AbstractA gapped pattern is a sequence consisting of regular alphabet symbols and of joker symbols t...
Abstract. We study the fully compressed pattern matching problem (FCPM problem): Given T and P which...
It has long been known that the compression redundancy of independent and identically distributed (i...