Motivation: The growth of next-generation sequencing means that more effective and efficient archiving methods are needed to store the generated data for public dissemination and in anticipation of more mature analytical methods later. This article examines methods for compressing the quality score component of the data to partly address this problem. Results: We compare several compression policies for quality scores, in terms of both compression effectiveness and overall efficiency. The policies employ lossy and lossless transformations with one of several coding schemes. Experiments show that both lossy and lossless transformations are useful, and that simple coding methods, which consume less computing resources, are highly competitive,...
MOTIVATION: The storage and transmission of high-throughput sequencing data consumes significant res...
The dropping cost of sequencing human DNA has allowed for fast development of several projects aroun...
Motivation: The storage and transmission of high-throughput sequencing data consumes signifi-cant re...
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing th...
Motivation: The past decade has seen the introduction of new technologies that significantly lowered...
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing th...
To the Editor: Most next-generation sequencing (NGS) quality scores are space intensive, redundant ...
Current NGS techniques are becoming exponentially cheaper. As a result, there is an exponential grow...
Abstract—Recent advancements in sequencing tech-nology have led to a drastic reduction in the cost o...
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing th...
DNA sequencing is the process of determining the ordered sequence of the four nucleotide bases in a ...
Massive amounts of sequencing data are being generated thanks to advances in sequencing technology a...
We present Quip, a lossless compression algorithm for next-generation sequencing data in the FASTQ a...
The increase in memory and in network traffic used and caused by new sequenced biological data has r...
The intensive research interest in studying genomes has led to a series of advances in DNA sequencin...
MOTIVATION: The storage and transmission of high-throughput sequencing data consumes significant res...
The dropping cost of sequencing human DNA has allowed for fast development of several projects aroun...
Motivation: The storage and transmission of high-throughput sequencing data consumes signifi-cant re...
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing th...
Motivation: The past decade has seen the introduction of new technologies that significantly lowered...
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing th...
To the Editor: Most next-generation sequencing (NGS) quality scores are space intensive, redundant ...
Current NGS techniques are becoming exponentially cheaper. As a result, there is an exponential grow...
Abstract—Recent advancements in sequencing tech-nology have led to a drastic reduction in the cost o...
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing th...
DNA sequencing is the process of determining the ordered sequence of the four nucleotide bases in a ...
Massive amounts of sequencing data are being generated thanks to advances in sequencing technology a...
We present Quip, a lossless compression algorithm for next-generation sequencing data in the FASTQ a...
The increase in memory and in network traffic used and caused by new sequenced biological data has r...
The intensive research interest in studying genomes has led to a series of advances in DNA sequencin...
MOTIVATION: The storage and transmission of high-throughput sequencing data consumes significant res...
The dropping cost of sequencing human DNA has allowed for fast development of several projects aroun...
Motivation: The storage and transmission of high-throughput sequencing data consumes signifi-cant re...