Converting a set of sequencing reads into a lossless compact data structure that encodes all the relevant biological information is a major challenge. The classical approaches are to build the string graph or the de Bruijn graph (dBG) of some order k. Each has advantages over the other depending on the application. Still, the ideal setting would be to have an index of the reads that is easy to build and can be adapted to any type of biological analysis. In this paper we propose rBOSS, a new data structure based on the Burrows-Wheeler Transform (BWT), which gets close to that ideal. Our rBOSS simultaneously encodes all the dBGs of a set of sequencing reads up to some order k, and for any dBG node v, it can compute in O(k) time all the other ...
While long reads produced by third-generation sequencing technology from, e.g, Pacific Biosciences h...
Graph based non-linear reference structures such as variation graphs and colored de Bruijn graphs en...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...
AbstractDNA sequencing technologies have tremendously increased their throughput, and hence complica...
Assembling genomic sequences from a set of overlapping reads is one of the most fundamental problems...
Assembling DNA fragments based on their overlaps remains the main assembly paradigm with long DNA fr...
International audienceWith the increasing impact of genomics in life sciences, the inference of high...
AbstractDNA sequencing is the process of determining the exact order of the nucleotide bases of an i...
Genome assembly is a fundamental problem in Bioinformatics, where for a given set of overlapping sub...
Background: Assembling genomic sequences from a set of overlapping reads is one of the most fundamen...
The de Bruijn graph has become a standard method in the analysis of sequencing reads in computationa...
Graphs such as de Bruijn graphs and OLC (overlap-layout-consensus) graphs have been widely adopted f...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...
The evolution of the next generation sequencing technology increases the demand for efficient soluti...
International audienceDNA sequencing technologies have tremendously increased their throughput, and ...
While long reads produced by third-generation sequencing technology from, e.g, Pacific Biosciences h...
Graph based non-linear reference structures such as variation graphs and colored de Bruijn graphs en...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...
AbstractDNA sequencing technologies have tremendously increased their throughput, and hence complica...
Assembling genomic sequences from a set of overlapping reads is one of the most fundamental problems...
Assembling DNA fragments based on their overlaps remains the main assembly paradigm with long DNA fr...
International audienceWith the increasing impact of genomics in life sciences, the inference of high...
AbstractDNA sequencing is the process of determining the exact order of the nucleotide bases of an i...
Genome assembly is a fundamental problem in Bioinformatics, where for a given set of overlapping sub...
Background: Assembling genomic sequences from a set of overlapping reads is one of the most fundamen...
The de Bruijn graph has become a standard method in the analysis of sequencing reads in computationa...
Graphs such as de Bruijn graphs and OLC (overlap-layout-consensus) graphs have been widely adopted f...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...
The evolution of the next generation sequencing technology increases the demand for efficient soluti...
International audienceDNA sequencing technologies have tremendously increased their throughput, and ...
While long reads produced by third-generation sequencing technology from, e.g, Pacific Biosciences h...
Graph based non-linear reference structures such as variation graphs and colored de Bruijn graphs en...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...