We divide a string into k segments, each with only one sort of symbols, so as to minimize the total number of exceptions. Motivations come from machine learning and data mining. For binary strings we develop a linear-timealgorithm for any k. Key to efficiency is a special-purpose data structure, called W-tree, which reflects relations between repetition lengths of symbols. For non-binary strings we give a nontrivial dynamic programming algorithm. Our problem is equivalent to finding weighted independent sets with certain size constraints, either in paths (binary case) or special interval graphs (general case). We also show that this problem is FPT in bounded-degree graphs
AbstractA suitably weighted Index Tree such as a B-tree or a Suffix Tree can be easily adapted to st...
AbstractGiven a pattern string P=p1p2⋯pm and K parallel text strings T={Tk=t1k⋯tnk|1⩽k⩽K} over an in...
In 1970, Knuth, Pratt, and Morris [1] showed how to do basic pattern matching in linear time. Relate...
We divide a string into k segments, each with only one sort of symbols, so as to minimize the total ...
We study the design of efficient algorithms for combinatorial pattern matching. More concretely, we ...
Abstract. Binary jumbled pattern matching asks to preprocess a binary string S in order to answer qu...
We study the complexity of the problem of searching for a set of patterns that separate two given se...
A repetition in a string of letters consists of exact concatenations of identical factors of the str...
Abstract. In this paper, we propose the definition of a measure for sets of strings of length not gr...
The prefix table of a string is one of the most fundamental data structures of algorithms on strings...
Strings are generated by sequences of independent draws from a given alphabet. For a given pattern H...
AbstractAn algorithm is presented to detect—within optimal time O(n log n) and space O(n), off-line ...
A subsequence is obtained from a string by deleting any number of characters; thus in contrast to a ...
We study the problem of mining substring patterns from string databases. Patterns are selected using...
Abstract. The shapes of binary trees can be encoded as permutations having a very special property. ...
AbstractA suitably weighted Index Tree such as a B-tree or a Suffix Tree can be easily adapted to st...
AbstractGiven a pattern string P=p1p2⋯pm and K parallel text strings T={Tk=t1k⋯tnk|1⩽k⩽K} over an in...
In 1970, Knuth, Pratt, and Morris [1] showed how to do basic pattern matching in linear time. Relate...
We divide a string into k segments, each with only one sort of symbols, so as to minimize the total ...
We study the design of efficient algorithms for combinatorial pattern matching. More concretely, we ...
Abstract. Binary jumbled pattern matching asks to preprocess a binary string S in order to answer qu...
We study the complexity of the problem of searching for a set of patterns that separate two given se...
A repetition in a string of letters consists of exact concatenations of identical factors of the str...
Abstract. In this paper, we propose the definition of a measure for sets of strings of length not gr...
The prefix table of a string is one of the most fundamental data structures of algorithms on strings...
Strings are generated by sequences of independent draws from a given alphabet. For a given pattern H...
AbstractAn algorithm is presented to detect—within optimal time O(n log n) and space O(n), off-line ...
A subsequence is obtained from a string by deleting any number of characters; thus in contrast to a ...
We study the problem of mining substring patterns from string databases. Patterns are selected using...
Abstract. The shapes of binary trees can be encoded as permutations having a very special property. ...
AbstractA suitably weighted Index Tree such as a B-tree or a Suffix Tree can be easily adapted to st...
AbstractGiven a pattern string P=p1p2⋯pm and K parallel text strings T={Tk=t1k⋯tnk|1⩽k⩽K} over an in...
In 1970, Knuth, Pratt, and Morris [1] showed how to do basic pattern matching in linear time. Relate...