AbstractA suitably weighted Index Tree such as a B-tree or a Suffix Tree can be easily adapted to store, for a given string x and for all substrings w of x, the number of distinct instances of w along x. The storage needed is seen to be linear in the length of x: moreover, the whole statistics can itself be derived in linear time, off-line of a RAM. If the substring w has nontrivial periods, however, the number of distinct instances might differ from that of distinct non-overlapping occurrences along x. It is shown here that O(n log n) storage units—n standing for the length of x—are sufficient to organize this second kind of statistics, in such a way that the maximum number of nonoverlapping instances for arbitrary w along x can be retriev...
We study in a probabilistic framework some topics concerning the way words can overlap. Our probabil...
[[abstract]]Let T be a string with n characters over an alphabet of constant size. A recent breakthr...
Abstract. Bille and Gørtz (2011) recently introduced the problem of substring range counting, for wh...
The string statistics problem consists of preprocessing a string of length n such that given a query...
Given a textSnlng..:: of length It. the MASr (minimal alI.gmerued nJjJix tree) or I is a digital ind...
The identification of strings that are, by some measure, redundant or rare in the context of larger ...
AbstractWe study in a probabilistic framework some topics concerning the way words can overlap. Our ...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
In highly repetitive strings, like collections of genomes from the same species, distinct measures o...
The suffix array is one of the most prevalent data structures for string indexing; it stores the lex...
AbstractAn algorithm is presented to detect—within optimal time O(n log n) and space O(n), off-line ...
International audienceSuffix trees are highly regarded data structures for text indexing and string ...
Abstract. Designing external memory data structures for string databases is of significant recent in...
We consider the problem of indexing a string t of length n to report the occurrences of a query patt...
The prefix table of a string is one of the most fundamental data structures of algorithms on strings...
We study in a probabilistic framework some topics concerning the way words can overlap. Our probabil...
[[abstract]]Let T be a string with n characters over an alphabet of constant size. A recent breakthr...
Abstract. Bille and Gørtz (2011) recently introduced the problem of substring range counting, for wh...
The string statistics problem consists of preprocessing a string of length n such that given a query...
Given a textSnlng..:: of length It. the MASr (minimal alI.gmerued nJjJix tree) or I is a digital ind...
The identification of strings that are, by some measure, redundant or rare in the context of larger ...
AbstractWe study in a probabilistic framework some topics concerning the way words can overlap. Our ...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
In highly repetitive strings, like collections of genomes from the same species, distinct measures o...
The suffix array is one of the most prevalent data structures for string indexing; it stores the lex...
AbstractAn algorithm is presented to detect—within optimal time O(n log n) and space O(n), off-line ...
International audienceSuffix trees are highly regarded data structures for text indexing and string ...
Abstract. Designing external memory data structures for string databases is of significant recent in...
We consider the problem of indexing a string t of length n to report the occurrences of a query patt...
The prefix table of a string is one of the most fundamental data structures of algorithms on strings...
We study in a probabilistic framework some topics concerning the way words can overlap. Our probabil...
[[abstract]]Let T be a string with n characters over an alphabet of constant size. A recent breakthr...
Abstract. Bille and Gørtz (2011) recently introduced the problem of substring range counting, for wh...