We introduce a new algorithm for constructing the generalized suffix array of a collection of highly similar strings. As a first step, we construct a compressed representation of the matching statistics of the collection with respect to a reference string. We then use this data structure to distribute suffixes into a partial order, and subsequently to speed up suffix comparisons to complete the generalized suffix array. Our experimental evidence with a prototype implementation (a tool we call sacamats) shows that on string collections with highly similar strings we can construct the suffix array in time competitive with or faster than the fastest available methods. Along the way, we describe a heuristic for fast computation of the matching ...
Suffix trees are one of the most versatile data structures in stringology, with many applications in...
AbstractThe suffix tree is one of the most important data structures in string processing and compar...
Computing the matching statistics of a string S with respect to a string T on an alphabet of size si...
We introduce a new algorithm for constructing the generalized suffix array of a collection of highly...
We present a parallel algorithm for lexicographically sorting the suffixes of a string. Suffix sorti...
AbstractWe propose a fast and memory-efficient algorithm for lexicographically sorting the suffixes ...
The suffix sorting problem is to construct the suffix array for an input sequence. Given a sequence ...
Abstract. The suffix tree of alignment is an index data structure for similar strings. Given an alig...
AbstractWe present a linear time algorithm to sort all the suffixes of a string over a large alphabe...
Suffix sorting is arguably the most fundamental building block in string algorithmics, like regular ...
Finding all longest suffix-prefix matches for a collection of strings is known as the all pairs suff...
Background: The construction of a suffix array for a collection of strings is a fundamental task in ...
We consider an index data structure for similar strings. The generalized suffix tree can be a soluti...
The suffix tree of alignment is an index data structure for similar strings. Given an alignment of s...
This thesis presents an application of a generalized suffix tree extended by the use of frequency of...
Suffix trees are one of the most versatile data structures in stringology, with many applications in...
AbstractThe suffix tree is one of the most important data structures in string processing and compar...
Computing the matching statistics of a string S with respect to a string T on an alphabet of size si...
We introduce a new algorithm for constructing the generalized suffix array of a collection of highly...
We present a parallel algorithm for lexicographically sorting the suffixes of a string. Suffix sorti...
AbstractWe propose a fast and memory-efficient algorithm for lexicographically sorting the suffixes ...
The suffix sorting problem is to construct the suffix array for an input sequence. Given a sequence ...
Abstract. The suffix tree of alignment is an index data structure for similar strings. Given an alig...
AbstractWe present a linear time algorithm to sort all the suffixes of a string over a large alphabe...
Suffix sorting is arguably the most fundamental building block in string algorithmics, like regular ...
Finding all longest suffix-prefix matches for a collection of strings is known as the all pairs suff...
Background: The construction of a suffix array for a collection of strings is a fundamental task in ...
We consider an index data structure for similar strings. The generalized suffix tree can be a soluti...
The suffix tree of alignment is an index data structure for similar strings. Given an alignment of s...
This thesis presents an application of a generalized suffix tree extended by the use of frequency of...
Suffix trees are one of the most versatile data structures in stringology, with many applications in...
AbstractThe suffix tree is one of the most important data structures in string processing and compar...
Computing the matching statistics of a string S with respect to a string T on an alphabet of size si...