The growing volume of generated DNA sequencing data makes the problem of its long-term storage increasingly important. In the first chapter we present ReCoil - an I/O efficient external memory algorithm designed for compression of very large datasets of short readsDNA data. Typically each position of DNA sequence is covered by multiple reads of a short read dataset and our algorithm makes use of resulting redundancy to achieve high compression rate. While compression based on encoding mismatches between the dataset and a similar reference can yield high compression rate, good quality reference sequence may be unavailable. Instead, ReCoil's compression is based on encoding the differences between similar or overlapping reads. As such reads m...
The storage, manipulation, and transfer of the large amounts of data produced by high-throughput seq...
High-throughput sequencing enables basic and translational biology to query the mechanics of both li...
Recent advances in DNA sequencing technology have dramatically increased the scale and scope of DNA ...
The growing volume of generated DNA sequencing data makes the problem of its long-term storage incre...
Abstract The growing volume of generated DNA sequencing data makes the problem of its ...
The high throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce...
Over the past years, high-throughput sequencing (HTS) has become an invaluable method of investigati...
BackgroundHigh-throughput sequencing (HTS) technologies play important roles in the life sciences by...
Motivation: Recently a number of programs have been proposed for mapping short reads to a reference ...
Motivation: Recently, a number of programs have been proposed for mapping short reads to a reference...
2011-11-02The breakthrough of second-generation sequencing has opened the door for many applications...
Next Generation Sequencing machines are generating mil-lions of short DNA sequences (reads) everyday...
Graduation date: 2012Within the past several years the technology of high-throughput sequencing has ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Current common storage media has limited ability to store data with present data explosion trends, w...
The storage, manipulation, and transfer of the large amounts of data produced by high-throughput seq...
High-throughput sequencing enables basic and translational biology to query the mechanics of both li...
Recent advances in DNA sequencing technology have dramatically increased the scale and scope of DNA ...
The growing volume of generated DNA sequencing data makes the problem of its long-term storage incre...
Abstract The growing volume of generated DNA sequencing data makes the problem of its ...
The high throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce...
Over the past years, high-throughput sequencing (HTS) has become an invaluable method of investigati...
BackgroundHigh-throughput sequencing (HTS) technologies play important roles in the life sciences by...
Motivation: Recently a number of programs have been proposed for mapping short reads to a reference ...
Motivation: Recently, a number of programs have been proposed for mapping short reads to a reference...
2011-11-02The breakthrough of second-generation sequencing has opened the door for many applications...
Next Generation Sequencing machines are generating mil-lions of short DNA sequences (reads) everyday...
Graduation date: 2012Within the past several years the technology of high-throughput sequencing has ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Current common storage media has limited ability to store data with present data explosion trends, w...
The storage, manipulation, and transfer of the large amounts of data produced by high-throughput seq...
High-throughput sequencing enables basic and translational biology to query the mechanics of both li...
Recent advances in DNA sequencing technology have dramatically increased the scale and scope of DNA ...