International audienceMotivation: In many bioinformatics pipelines, k-mer counting is often a required step, with existing methods focusing on optimizing time or memory usage. These methods usually produce very large count tables explicitly representing k-mers themselves. Solutions avoiding explicit representation of k-mers include Minimal Perfect Hash Functions (MPHFs) or Count-Min sketches. The former is only applicable to static maps not subject to updates, while the latter suffers from potentially very large point-query errors, making it unsuitable when counters are required to be highly accurate. Results: We introduce Set-Min sketch, a sketching technique inspired by Count-Min sketch, for representing associative maps, more specificall...
Motivation: Counting the frequencies of k-mers in read libraries is often a first step in the analys...
Bioinformatics journal requires that we post only the pre-print, which does not include modification...
Summary: Counting all the k-mers (substrings of length k) in DNA/RNA sequencing reads is the prelimi...
International audiencek-mer counts are important features used by many bioinformatics pipelines. Exi...
International audienceMotivation: k-mer counting is a common task in bioinformatic pipelines, with m...
International audienceMotivation. k-mer counting is a common task in bioinformatic pipelines, with m...
Motivation: A major challenge in next-generation genome seque-ncing (NGS) is to assemble massive ove...
This is a talk given in the context of the BSC Life Sessions Abstract k-mers are used on a daily b...
The exponential growth of genomic data demands progress and research on scalable bioinformatics algo...
Motivation: Building the histogram of occurrences of every k-symbol long substring of nucleotide dat...
Motivation: Building the histogram of occurrences of every k-symbol long substring of nucleotide dat...
Counting the frequencies of k-mers in read libraries is often a first step in the analysis of high-t...
We propose a polynomial algorithm computing a minimum plain-text representation of k-mer sets, as we...
Motivation: The extraction of k-mers is a fundamental component in many complex analyses of large ne...
The exponential increase in publicly available sequencing data and genomic resources necessitates th...
Motivation: Counting the frequencies of k-mers in read libraries is often a first step in the analys...
Bioinformatics journal requires that we post only the pre-print, which does not include modification...
Summary: Counting all the k-mers (substrings of length k) in DNA/RNA sequencing reads is the prelimi...
International audiencek-mer counts are important features used by many bioinformatics pipelines. Exi...
International audienceMotivation: k-mer counting is a common task in bioinformatic pipelines, with m...
International audienceMotivation. k-mer counting is a common task in bioinformatic pipelines, with m...
Motivation: A major challenge in next-generation genome seque-ncing (NGS) is to assemble massive ove...
This is a talk given in the context of the BSC Life Sessions Abstract k-mers are used on a daily b...
The exponential growth of genomic data demands progress and research on scalable bioinformatics algo...
Motivation: Building the histogram of occurrences of every k-symbol long substring of nucleotide dat...
Motivation: Building the histogram of occurrences of every k-symbol long substring of nucleotide dat...
Counting the frequencies of k-mers in read libraries is often a first step in the analysis of high-t...
We propose a polynomial algorithm computing a minimum plain-text representation of k-mer sets, as we...
Motivation: The extraction of k-mers is a fundamental component in many complex analyses of large ne...
The exponential increase in publicly available sequencing data and genomic resources necessitates th...
Motivation: Counting the frequencies of k-mers in read libraries is often a first step in the analys...
Bioinformatics journal requires that we post only the pre-print, which does not include modification...
Summary: Counting all the k-mers (substrings of length k) in DNA/RNA sequencing reads is the prelimi...