Abstract. De Brujin graphs are widely used in bioinformatics for pro-cessing next-generation sequencing data. Due to a very large size of NGS datasets, it is essential to represent de Bruijn graphs compactly, and several approaches to this problem have been proposed recently. In this work, we show how to reduce the memory required by the algorithm of [3] that represents de Brujin graphs using Bloom filters. Our method requires 30 % to 40 % less memory with respect to the method of [3], with insignificant impact to construction time. At the same time, our experiments showed a better query time compared to [3]. This is, to our knowledge, the best practical representation for de Bruijn graphs.
A colored de Bruijn graph (also called a set of k-mer sets), is a set of k-mers with every k-mer ass...
Motivation: The de Bruijn graph has become a ubiquitous graph model for biological data ever since i...
Abstract. High throughput sequencing technologies have become fast and cheap in the past years. As a...
12 pages, submittedInternational audienceDe Brujin graphs are widely used in bioinformatics for proc...
Abstract. De Brujin graphs are widely used in bioinformatics for pro-cessing next-generation sequenc...
International audienceThe de Bruijn graph data structure is widely used in next-generation sequencin...
International audienceThe de Bruijn graph plays an important role in bioinformatics, especially in t...
The de Bruijn graph has become a standard method in the analysis of sequencing reads in computationa...
Abstract. The de Bruijn graph plays an important role in bioinformatics, especially in the context o...
International audienceAs datasets of DNA reads grow rapidly, it becomes more and more important to r...
Abstract—Ongoing improvements in Computational Biology (CB) research have generated massive amounts ...
Publisher's version (útgefin grein)Memory consumption of de Bruijn graphs is often prohibitive. Most...
Compacted de Bruijn graphs are one of the most fundamental data structures in computational genomics...
A colored de Bruijn graph (also called a set of k-mer sets), is a set of k-mers with every k-mer ass...
Motivation: The de Bruijn graph has become a ubiquitous graph model for biological data ever since i...
Abstract. High throughput sequencing technologies have become fast and cheap in the past years. As a...
12 pages, submittedInternational audienceDe Brujin graphs are widely used in bioinformatics for proc...
Abstract. De Brujin graphs are widely used in bioinformatics for pro-cessing next-generation sequenc...
International audienceThe de Bruijn graph data structure is widely used in next-generation sequencin...
International audienceThe de Bruijn graph plays an important role in bioinformatics, especially in t...
The de Bruijn graph has become a standard method in the analysis of sequencing reads in computationa...
Abstract. The de Bruijn graph plays an important role in bioinformatics, especially in the context o...
International audienceAs datasets of DNA reads grow rapidly, it becomes more and more important to r...
Abstract—Ongoing improvements in Computational Biology (CB) research have generated massive amounts ...
Publisher's version (útgefin grein)Memory consumption of de Bruijn graphs is often prohibitive. Most...
Compacted de Bruijn graphs are one of the most fundamental data structures in computational genomics...
A colored de Bruijn graph (also called a set of k-mer sets), is a set of k-mers with every k-mer ass...
Motivation: The de Bruijn graph has become a ubiquitous graph model for biological data ever since i...
Abstract. High throughput sequencing technologies have become fast and cheap in the past years. As a...