The success of high-throughput sequencing has lead to an increasing number of projects which sequence large populations of a species. Storage and analysis of sequence data is a key challenge in these projects, because of the sheer size of the datasets. Compression is one simple technology to deal with this challenge. Referential factorization and compres-sion schemes, which store only the differences between input sequence and a reference sequence, gained lots of interest in this field. Highly-similar sequences, e.g., Human genomes, can be compressed with a compression ratio of 1,000:1 and more, up to two orders of magnitude better than with standard compression techniques. Recently, it was shown that the compression against multiple refere...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
Genomic data technology has advanced by using many algorithms that not only facilitate a meaningful ...
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar...
The success of high-throughput sequencing has lead to an increasing number of projects which sequenc...
Abstract Modern high-throughput sequencing technologies are able to generate DNA sequences at an ev...
As a result of next generation sequencing technologies, during the last decade many studies have aim...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
Relative compression, where a set of similar strings are compressed with respect to a reference stri...
The decreasing costs of genome sequencing is creating a demand for scalable storage and processing t...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recentl...
xiii, 115 p. : ill. (some col.) ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577M EIE 2009 WuDeoxyri...
© The Author 2017. Published by Oxford University Press. All rights reserved. Motivation: The rapidl...
Motivation: Storing, transferring, and maintaining genomic databa-ses becomes a major challenge beca...
Motivation New DNA sequencing technologies have enabled the rapid analysis of many thousands of geno...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
Genomic data technology has advanced by using many algorithms that not only facilitate a meaningful ...
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar...
The success of high-throughput sequencing has lead to an increasing number of projects which sequenc...
Abstract Modern high-throughput sequencing technologies are able to generate DNA sequences at an ev...
As a result of next generation sequencing technologies, during the last decade many studies have aim...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
Relative compression, where a set of similar strings are compressed with respect to a reference stri...
The decreasing costs of genome sequencing is creating a demand for scalable storage and processing t...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recentl...
xiii, 115 p. : ill. (some col.) ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577M EIE 2009 WuDeoxyri...
© The Author 2017. Published by Oxford University Press. All rights reserved. Motivation: The rapidl...
Motivation: Storing, transferring, and maintaining genomic databa-ses becomes a major challenge beca...
Motivation New DNA sequencing technologies have enabled the rapid analysis of many thousands of geno...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
Genomic data technology has advanced by using many algorithms that not only facilitate a meaningful ...
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar...