Abstract. Motivated by the problem of counting unique visitors to a website, we consider how to preprocess a string s[1..n] such that later, given a substring’s endpoints, we can quickly count how many distinct characters that substring contains. The smallest reasonably fast previous data structure for this problem takes n log σ+O(n log log n) bits and an-swers queries in O(log n) time. We give a data structure for this problem that takes nH0(s)+O(n)+ o(nH0(s)) bits, where H0(s) is the 0th-order empirical entropy of s, and answers queries in O(log ) time, where is the length of the query substring. As far as we know, this is the first data structure, where the query time depends only on and not on n. We also show how our data structure ca...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...
Abstract. The Karp-Rabin fingerprint of a string is a type of hash value that due to its strong prop...
In this paper we study the problem of estimating the number of occurrences of substrings in textual ...
We consider the problem of storing a string S in dynamic compressed form, while permitting operation...
Abstract. We consider the problem of storing a dynamic string S over an alphabetΣ = { 1,..., σ} in ...
A subsequence is obtained from a string by deleting any number of characters; thus in contrast to a ...
Suppose an oracle knows a string S that is unknown to us and that we want to determine. The oracle c...
Subsequence pattern matching problems on compressed text were first considered by Cegielski et al. (...
We consider the problem of encoding a string of length n from an integer alphabet of size so access,...
Abstract. Bille and Gørtz (2011) recently introduced the problem of substring range counting, for wh...
We design two compressed data structures for the full-text indexing problem that support efficient s...
Let T[1, n] be a string of length n and T[i, j] be the substring of T starting at position i and end...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
Abstract. In this paper we investigate the problem of building a static data structure that represen...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...
Abstract. The Karp-Rabin fingerprint of a string is a type of hash value that due to its strong prop...
In this paper we study the problem of estimating the number of occurrences of substrings in textual ...
We consider the problem of storing a string S in dynamic compressed form, while permitting operation...
Abstract. We consider the problem of storing a dynamic string S over an alphabetΣ = { 1,..., σ} in ...
A subsequence is obtained from a string by deleting any number of characters; thus in contrast to a ...
Suppose an oracle knows a string S that is unknown to us and that we want to determine. The oracle c...
Subsequence pattern matching problems on compressed text were first considered by Cegielski et al. (...
We consider the problem of encoding a string of length n from an integer alphabet of size so access,...
Abstract. Bille and Gørtz (2011) recently introduced the problem of substring range counting, for wh...
We design two compressed data structures for the full-text indexing problem that support efficient s...
Let T[1, n] be a string of length n and T[i, j] be the substring of T starting at position i and end...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
Abstract. In this paper we investigate the problem of building a static data structure that represen...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...
Abstract. The Karp-Rabin fingerprint of a string is a type of hash value that due to its strong prop...