International audienceComputing suffix-prefix overlaps for a large collection of strings is a fundamental building block for the analysis of genomic next-generation sequencing data. The approximate suffix-prefix overlap problem is to find all pairs of strings from a given set such that a prefix of one string is similar to a suffix of the other. Välimäki et al. (Information and Com-putation, 2012) gave a solution to this problem based on suffix filters. In this work, we propose two improvements to the method of Välimäki et al. that reduce the running time of the computation
We study the string similarity search problem with edit-distance constraints, which, given a set of ...
International audienceApproximate searching using an index is an important application in many field...
We study the string similarity search problem with edit-distance constraints, which, given a set of ...
International audienceComputing suffix-prefix overlaps for a large collection of strings is a fundam...
AbstractFinding approximate overlaps is the first phase of many sequence assembly methods. Given a s...
The next-generation sequencing (NGS) technology outputs a huge number of sequences (reads) that requ...
Finding all longest suffix-prefix matches for a collection of strings is known as the all pairs suff...
The evolution of the next generation sequencing technology increases the demand for efficient soluti...
The next generation sequencing technology creates a huge number of sequences (reads), which constitu...
We introduce a new algorithm for constructing the generalized suffix array of a collection of highly...
International audienceThe inference of common motifs in a set of strings is a well-known problem wit...
All-pairs suffix-prefix matching is an important part of DNA sequence assembly where it is the most ...
We present a novel algorithmic framework for solving approximate sequence matching problems that per...
We show how to parallelize the optimal algorithm proposed by Tustumi et al. [19] to solve the all-pa...
In this paper, we propose Genomic-oriented Rapid Algorithm for String Pattern-match (GRASPm), an alg...
We study the string similarity search problem with edit-distance constraints, which, given a set of ...
International audienceApproximate searching using an index is an important application in many field...
We study the string similarity search problem with edit-distance constraints, which, given a set of ...
International audienceComputing suffix-prefix overlaps for a large collection of strings is a fundam...
AbstractFinding approximate overlaps is the first phase of many sequence assembly methods. Given a s...
The next-generation sequencing (NGS) technology outputs a huge number of sequences (reads) that requ...
Finding all longest suffix-prefix matches for a collection of strings is known as the all pairs suff...
The evolution of the next generation sequencing technology increases the demand for efficient soluti...
The next generation sequencing technology creates a huge number of sequences (reads), which constitu...
We introduce a new algorithm for constructing the generalized suffix array of a collection of highly...
International audienceThe inference of common motifs in a set of strings is a well-known problem wit...
All-pairs suffix-prefix matching is an important part of DNA sequence assembly where it is the most ...
We present a novel algorithmic framework for solving approximate sequence matching problems that per...
We show how to parallelize the optimal algorithm proposed by Tustumi et al. [19] to solve the all-pa...
In this paper, we propose Genomic-oriented Rapid Algorithm for String Pattern-match (GRASPm), an alg...
We study the string similarity search problem with edit-distance constraints, which, given a set of ...
International audienceApproximate searching using an index is an important application in many field...
We study the string similarity search problem with edit-distance constraints, which, given a set of ...