Abstract. In this paper we present an efficient external memory algorithm to compute the string graph from a collection of reads, which is a fundamental data representation used for sequence assembly. Our algorithm builds upon some recent results on lightweight Burrows-Wheeler Transform (BWT) and Longest Common Prefix (LCP) construction providing, as a by-product, an efficient procedure to extend intervals of the BWT that could be of independent interest. We have implemented our algorithm and compared its efficiency against SGA— the most advanced assembly string graph construction program.
In this paper we describe algorithms for computing the Burrows-Wheeler Transform (bwt) and for build...
The Burrows Wheeler transform has applications in data compression as well as full text indexing. De...
The longest common prefix array is a very advantageous data structure that, combined with the suffix...
Sequencing technologies produce larger and larger collections of biosequences that have to be stored...
Recent progress in the field of \{DNA\} sequencing motivates us to consider the problem of computing...
In this paper we present external memory algorithms for some string problems. External memory algori...
Abstract Background Suffix arrays, augmented by additional data structures, allow solving efficientl...
We present a collection of new techniques for designing and analyzing efficient external-memory algo...
Motivation: New sequencing technologies generate larger amount of short reads data at decreasing cos...
We present a collection of new techniques for designing and analyzing efficient external-memory algo...
this paper we describe algorithms for computing the Burrows-Wheeler Transform (bwt) and for building...
We show that the Longest Common Prefix Array of a text collection of total size n on alphabet [1, σ]...
In this paper we describe algorithms for computing the Burrows-Wheeler Transform (bwt) and for build...
The Burrows Wheeler transform has applications in data compression as well as full text indexing. De...
The longest common prefix array is a very advantageous data structure that, combined with the suffix...
Sequencing technologies produce larger and larger collections of biosequences that have to be stored...
Recent progress in the field of \{DNA\} sequencing motivates us to consider the problem of computing...
In this paper we present external memory algorithms for some string problems. External memory algori...
Abstract Background Suffix arrays, augmented by additional data structures, allow solving efficientl...
We present a collection of new techniques for designing and analyzing efficient external-memory algo...
Motivation: New sequencing technologies generate larger amount of short reads data at decreasing cos...
We present a collection of new techniques for designing and analyzing efficient external-memory algo...
this paper we describe algorithms for computing the Burrows-Wheeler Transform (bwt) and for building...
We show that the Longest Common Prefix Array of a text collection of total size n on alphabet [1, σ]...
In this paper we describe algorithms for computing the Burrows-Wheeler Transform (bwt) and for build...
The Burrows Wheeler transform has applications in data compression as well as full text indexing. De...
The longest common prefix array is a very advantageous data structure that, combined with the suffix...