Abstract Background Various indexing techniques have been applied by next generation sequencing read mapping tools. The choice of a particular data structure is a trade-off between memory consumption, mapping throughput, and construction time. Results We present the succinct hash index – a novel data structure for read mapping which is a variant of the classical q-gram index with a particularly small memory footprint occupying between 3.5 and 5.3 GB for a human reference genome for typical parameter settings. The succinct hash index features two novel seed selection algorithms (group seeding and variable-length seeding) and an efficient parallel construction algorithm, which we have implemented to design the FEM (Fast(F) and Efficient(E) re...
The growing volume of generated DNA sequencing data makes the problem of its long-term storage incre...
Abstract Background Seed location filtering is critical in DNA read mapping, a process where billion...
Motivation Mapping-based approaches have become limited in their application to very large sets o...
Abstract With the introduction of next-generation sequencing (NGS) technologies, we are facing an ex...
<p>With the introduction of next-generation sequencing (NGS) technologies, we are facing an exponent...
Motivation: Recently a number of programs have been proposed for mapping short reads to a reference ...
Motivation: Recently, a number of programs have been proposed for mapping short reads to a reference...
The high throughput of modern NGS sequencers coupled with the huge sizes of genomes currently analys...
Fast and robust algorithms and aligners have been developed to help the researchers in the analysis ...
While short read aligners, which predominantly use the FM-index, are able to easily index one or a f...
Motivation: The explosion of next-generation sequencing data has spawned the design of new algorithm...
International audienceAs genomes, transcriptomes and meta-genomes are being sequenced at a faster pa...
2011-11-02The breakthrough of second-generation sequencing has opened the door for many applications...
The analysis of next-generation sequencing (NGS) data is a major topic in bioinfor-matics: short rea...
We present Masai, a read mapper representing the state-of-the-art in terms of speed and accuracy. Ou...
The growing volume of generated DNA sequencing data makes the problem of its long-term storage incre...
Abstract Background Seed location filtering is critical in DNA read mapping, a process where billion...
Motivation Mapping-based approaches have become limited in their application to very large sets o...
Abstract With the introduction of next-generation sequencing (NGS) technologies, we are facing an ex...
<p>With the introduction of next-generation sequencing (NGS) technologies, we are facing an exponent...
Motivation: Recently a number of programs have been proposed for mapping short reads to a reference ...
Motivation: Recently, a number of programs have been proposed for mapping short reads to a reference...
The high throughput of modern NGS sequencers coupled with the huge sizes of genomes currently analys...
Fast and robust algorithms and aligners have been developed to help the researchers in the analysis ...
While short read aligners, which predominantly use the FM-index, are able to easily index one or a f...
Motivation: The explosion of next-generation sequencing data has spawned the design of new algorithm...
International audienceAs genomes, transcriptomes and meta-genomes are being sequenced at a faster pa...
2011-11-02The breakthrough of second-generation sequencing has opened the door for many applications...
The analysis of next-generation sequencing (NGS) data is a major topic in bioinfor-matics: short rea...
We present Masai, a read mapper representing the state-of-the-art in terms of speed and accuracy. Ou...
The growing volume of generated DNA sequencing data makes the problem of its long-term storage incre...
Abstract Background Seed location filtering is critical in DNA read mapping, a process where billion...
Motivation Mapping-based approaches have become limited in their application to very large sets o...