Abstract. We present an efficient algorithm for calculating q-gram frequencies on strings represented in compressed form, namely, as a straight line program (SLP). Given an SLP T of size n that represents string T, the algorithm computes the occurrence frequencies of all q-grams in T, by reducing the problem to the weighted q-gram frequencies problem on a trie-like structure of size m = jT j dup(q; T), where dup(q; T) is a quantity that represents the amount of redundancy that the SLP captures with respect to q-grams. The reduced problem can be solved in linear time. Since m = O(qn), the running time of our algorithm is O(minfjT jdup(q; T); qng), improving our previous O(qn) algorithm when q = Ω(jT j=n).
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
We propose a general approach for frequency based string mining, which has many applications, e.g. i...
We present simple and efficient algorithms for calculating q-gram frequencies on strings represented...
AbstractWe present simple and efficient algorithms for calculating q-gram frequencies on strings rep...
A popular and much studied class of filters for approximate string matching is based on finding comm...
The q-gram filter is a popular filtering method for approximate string matching. It compares substri...
Abstract. Text mining from large scaled data is of great importance in computer sci-ence. In this pa...
This paper considers the issue of frequency consolidation in lists of different length word n-grams ...
This paper considers the issue of frequency consolidation in lists of different length word n-grams ...
AbstractTwo strings parameterize match if there is a bijection defined on the alphabet that transfor...
The paper describes a simple and fast randomized test for equality of grammar-compressed strings. Th...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
In this article, a fully compressed pattern matching problem is studied. The compression is represen...
Given a text T of length n and a pattern P of length m, the string matching problem is a task to fin...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
We propose a general approach for frequency based string mining, which has many applications, e.g. i...
We present simple and efficient algorithms for calculating q-gram frequencies on strings represented...
AbstractWe present simple and efficient algorithms for calculating q-gram frequencies on strings rep...
A popular and much studied class of filters for approximate string matching is based on finding comm...
The q-gram filter is a popular filtering method for approximate string matching. It compares substri...
Abstract. Text mining from large scaled data is of great importance in computer sci-ence. In this pa...
This paper considers the issue of frequency consolidation in lists of different length word n-grams ...
This paper considers the issue of frequency consolidation in lists of different length word n-grams ...
AbstractTwo strings parameterize match if there is a bijection defined on the alphabet that transfor...
The paper describes a simple and fast randomized test for equality of grammar-compressed strings. Th...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
In this article, a fully compressed pattern matching problem is studied. The compression is represen...
Given a text T of length n and a pattern P of length m, the string matching problem is a task to fin...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
We propose an indexing technique for approximate text searching, which is practical and powerful, an...
We propose a general approach for frequency based string mining, which has many applications, e.g. i...