Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar subsequences within the DNA sequence are found and encoded together. In this work, in addition to the intra-sequence similarity, we exploit the inter-sequence similarities in that similar subsequences are found within the DNA sequence as well as from other reference sequences. Hence, highly similar sequences from the same population or partially similar chromosome sequences of the same species can be compressed together to reduce the storage space. Experimental results show that the proposed scheme achieves good compressibility for both partially similar chromosome sequences and highly similar population sequences.Department of Electronic an...
DNA compression has been a subject of great interest since the availability of genomic databases. Al...
The exponential growth of high-throughput DNA sequence data has posed great challenges to genomic da...
In bio-sequence repositories and other applications, like for instance in the production of a Cd-rom...
Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recentl...
xiii, 115 p. : ill. (some col.) ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577M EIE 2009 WuDeoxyri...
2007 International Symposium on Computational Models for Life Sciences, CMLS '07, Gold Coast, QLD, 1...
Current DNA compression algorithms rely on finding repetitions within the DNA sequence so that simil...
DNA Sequence Compression can be achieved through exploiting the intra-sequence and inter-sequence si...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
Current DNA compression algorithms work by finding similar repeated regions within the DNA sequence ...
Relative compression, where a set of similar strings are compressed with respect to a reference stri...
Data Storage costs have an appreciable proportion of total cost in the creation and analysis of DNA ...
With increasing number of DNA sequences being discovered the problem of storing and using genomic da...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
compression algorithms: the case of approximate tandem repeats in DNA sequences E.Rivals14, O.Delgra...
DNA compression has been a subject of great interest since the availability of genomic databases. Al...
The exponential growth of high-throughput DNA sequence data has posed great challenges to genomic da...
In bio-sequence repositories and other applications, like for instance in the production of a Cd-rom...
Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recentl...
xiii, 115 p. : ill. (some col.) ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577M EIE 2009 WuDeoxyri...
2007 International Symposium on Computational Models for Life Sciences, CMLS '07, Gold Coast, QLD, 1...
Current DNA compression algorithms rely on finding repetitions within the DNA sequence so that simil...
DNA Sequence Compression can be achieved through exploiting the intra-sequence and inter-sequence si...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
Current DNA compression algorithms work by finding similar repeated regions within the DNA sequence ...
Relative compression, where a set of similar strings are compressed with respect to a reference stri...
Data Storage costs have an appreciable proportion of total cost in the creation and analysis of DNA ...
With increasing number of DNA sequences being discovered the problem of storing and using genomic da...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
compression algorithms: the case of approximate tandem repeats in DNA sequences E.Rivals14, O.Delgra...
DNA compression has been a subject of great interest since the availability of genomic databases. Al...
The exponential growth of high-throughput DNA sequence data has posed great challenges to genomic da...
In bio-sequence repositories and other applications, like for instance in the production of a Cd-rom...