The success of high-throughput sequencing has lead to an increasing number of projects which sequence large populations of a species. Storage and analysis of sequence data is a key challenge in these projects, because of the sheer size of the datasets. Compression is one simple technology to deal with this challenge. Referential factorization and compression schemes, which store only the differences between input sequence and a reference sequence, gained lots of interest in this field. Highly-similar sequences, e.g., Human genomes, can be compressed with a compression ratio of 1,000:1 and more, up to two orders of magnitude better than with standard compression techniques. Recently, it was shown that the compression against multiple referen...
BackgroundHigh-throughput sequencing (HTS) technologies play important roles in the life sciences by...
The growing volume of generated DNA sequencing data makes the problem of its long-term storage incre...
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar...
The success of high-throughput sequencing has lead to an increasing number of projects which sequenc...
Abstract Modern high-throughput sequencing technologies are able to generate DNA sequences at an ev...
The decreasing costs of genome sequencing is creating a demand for scalable storage and processing t...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
As a result of next generation sequencing technologies, during the last decade many studies have aim...
Relative compression, where a set of similar strings are compressed with respect to a reference stri...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recentl...
© The Author 2017. Published by Oxford University Press. All rights reserved. Motivation: The rapidl...
Motivation: Storing, transferring, and maintaining genomic databa-ses becomes a major challenge beca...
xiii, 115 p. : ill. (some col.) ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577M EIE 2009 WuDeoxyri...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
BackgroundHigh-throughput sequencing (HTS) technologies play important roles in the life sciences by...
The growing volume of generated DNA sequencing data makes the problem of its long-term storage incre...
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar...
The success of high-throughput sequencing has lead to an increasing number of projects which sequenc...
Abstract Modern high-throughput sequencing technologies are able to generate DNA sequences at an ev...
The decreasing costs of genome sequencing is creating a demand for scalable storage and processing t...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
As a result of next generation sequencing technologies, during the last decade many studies have aim...
Relative compression, where a set of similar strings are compressed with respect to a reference stri...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recentl...
© The Author 2017. Published by Oxford University Press. All rights reserved. Motivation: The rapidl...
Motivation: Storing, transferring, and maintaining genomic databa-ses becomes a major challenge beca...
xiii, 115 p. : ill. (some col.) ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577M EIE 2009 WuDeoxyri...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
BackgroundHigh-throughput sequencing (HTS) technologies play important roles in the life sciences by...
The growing volume of generated DNA sequencing data makes the problem of its long-term storage incre...
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar...