Abstract. De Brujin graphs are widely used in bioinformatics for pro-cessing next-generation sequencing (NGS) data. Due to the very large size of NGS datasets, it is essential to represent de Bruijn graphs compactly, and several approaches to this problem have been proposed recently. In this work, we show how to reduce the memory required by the algorithm of Chikhi and Rizk (WABI, 2012) that represents de Brujin graphs using Bloom filters. Our method requires 30 % to 40 % less memory with re-spect to their method, with insignificant impact to construction time. At the same time, our experiments showed a better query time compared to their method. This is, to our knowledge, the best practical representation for de Bruijn graphs.
A colored de Bruijn graph (also called a set of k-mer sets), is a set of k-mers with every k-mer ass...
Motivation: The de Bruijn graph has become a ubiquitous graph model for biological data ever since i...
Abstract. High throughput sequencing technologies have become fast and cheap in the past years. As a...
Abstract. De Brujin graphs are widely used in bioinformatics for pro-cessing next-generation sequenc...
12 pages, submittedInternational audienceDe Brujin graphs are widely used in bioinformatics for proc...
International audienceThe de Bruijn graph data structure is widely used in next-generation sequencin...
The de Bruijn graph has become a standard method in the analysis of sequencing reads in computationa...
International audienceThe de Bruijn graph plays an important role in bioinformatics, especially in t...
Abstract. The de Bruijn graph plays an important role in bioinformatics, especially in the context o...
International audienceAs datasets of DNA reads grow rapidly, it becomes more and more important to r...
Abstract—Ongoing improvements in Computational Biology (CB) research have generated massive amounts ...
Publisher's version (útgefin grein)Memory consumption of de Bruijn graphs is often prohibitive. Most...
Compacted de Bruijn graphs are one of the most fundamental data structures in computational genomics...
A colored de Bruijn graph (also called a set of k-mer sets), is a set of k-mers with every k-mer ass...
Motivation: The de Bruijn graph has become a ubiquitous graph model for biological data ever since i...
Abstract. High throughput sequencing technologies have become fast and cheap in the past years. As a...
Abstract. De Brujin graphs are widely used in bioinformatics for pro-cessing next-generation sequenc...
12 pages, submittedInternational audienceDe Brujin graphs are widely used in bioinformatics for proc...
International audienceThe de Bruijn graph data structure is widely used in next-generation sequencin...
The de Bruijn graph has become a standard method in the analysis of sequencing reads in computationa...
International audienceThe de Bruijn graph plays an important role in bioinformatics, especially in t...
Abstract. The de Bruijn graph plays an important role in bioinformatics, especially in the context o...
International audienceAs datasets of DNA reads grow rapidly, it becomes more and more important to r...
Abstract—Ongoing improvements in Computational Biology (CB) research have generated massive amounts ...
Publisher's version (útgefin grein)Memory consumption of de Bruijn graphs is often prohibitive. Most...
Compacted de Bruijn graphs are one of the most fundamental data structures in computational genomics...
A colored de Bruijn graph (also called a set of k-mer sets), is a set of k-mers with every k-mer ass...
Motivation: The de Bruijn graph has become a ubiquitous graph model for biological data ever since i...
Abstract. High throughput sequencing technologies have become fast and cheap in the past years. As a...