The suffix array is the key to efficient solutions for myriads of string processing problems in different application domains, like data compression, data mining, or bioinformatics. With the rapid growth of available data, suffix array construction algorithms have to be adapted to advanced computational models such as external memory and distributed computing. In this article, we present five suffix array construction algorithms utilizing the new algorithmic big data batch processing framework Thrill, which allows scalable processing of input sizes on distributed systems in orders of magnitude that have not been considered before
AbstractThe suffix array is a data structure that finds numerous applications in string processing p...
Text search is a classical problem in Computer Science, with many data-intensive applications. For t...
An algorithm for the parallel construction of suffix arrays generation for any texts with larger alp...
Abstract. Suffix arrays are a simple and powerful data structure for text processing that can be use...
This dissertation focuses on two fundamental sorting problems: string sorting and suffix sorting. Th...
Suffix arrays are a simple and powerful data structure for text processing that can be used for full...
We present a fast space-efficient algorithm for constructing compressed suffix arrays (CSA). The alg...
An algorithm for the distributed computation of suffix arrays for large texts is presented. The para...
Suffix arrays are a simple and powerful data structure for text processing that can be used for full...
The construction of full-text indexes on very large text collections is nowadays a hot problem. Th...
The construction of full-text indexes on very large text collections is nowadays a hot problem. The ...
We present a parallel algorithm for lexicographically sorting the suffixes of a string. Suffix sorti...
We investigate the problem of building full-text substring indexes for inputs significantly larger t...
AbstractWe present a new variant of the suffix tree called a distributed suffix tree (DST) which all...
String indexes such as the suffix array (SA) and the closely related longest common prefix (LCP) arr...
AbstractThe suffix array is a data structure that finds numerous applications in string processing p...
Text search is a classical problem in Computer Science, with many data-intensive applications. For t...
An algorithm for the parallel construction of suffix arrays generation for any texts with larger alp...
Abstract. Suffix arrays are a simple and powerful data structure for text processing that can be use...
This dissertation focuses on two fundamental sorting problems: string sorting and suffix sorting. Th...
Suffix arrays are a simple and powerful data structure for text processing that can be used for full...
We present a fast space-efficient algorithm for constructing compressed suffix arrays (CSA). The alg...
An algorithm for the distributed computation of suffix arrays for large texts is presented. The para...
Suffix arrays are a simple and powerful data structure for text processing that can be used for full...
The construction of full-text indexes on very large text collections is nowadays a hot problem. Th...
The construction of full-text indexes on very large text collections is nowadays a hot problem. The ...
We present a parallel algorithm for lexicographically sorting the suffixes of a string. Suffix sorti...
We investigate the problem of building full-text substring indexes for inputs significantly larger t...
AbstractWe present a new variant of the suffix tree called a distributed suffix tree (DST) which all...
String indexes such as the suffix array (SA) and the closely related longest common prefix (LCP) arr...
AbstractThe suffix array is a data structure that finds numerous applications in string processing p...
Text search is a classical problem in Computer Science, with many data-intensive applications. For t...
An algorithm for the parallel construction of suffix arrays generation for any texts with larger alp...