We are rapidly approaching the point where we have sequenced millions of human genomes. There is a pressing need for new data structures to store raw sequencing data and efficient algorithms for population scale analysis. Current reference based data formats do not fully exploit the redundancy in population sequencing nor take advantage of shared genetic variation. In recent years, the Burrows-Wheeler transform (BWT) and FM-index have been widely employed as a full text searchable index for read alignment and de novo assembly. We introduce the concept of a population BWT and use it to store and index the sequencing reads of 2,705 samples from the 1000 Genomes Project. A key feature is that as more genomes are added,...
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utili...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
While short read aligners, which predominantly use the FM-index, are able to easily index one or a f...
We are rapidly approaching the point where we have sequenced millions of human genomes. There is a p...
We are rapidly approaching the point where we have sequenced millions of human genomes. There is a p...
We are rapidly approaching the point where we have sequenced millions of human genomes. There is a p...
Cox AJ, Bauer MJ, Jakobi T, Rosone G. Large-scale compression of genomic sequence databases with the...
Motivation: The Burrows–Wheeler transform (BWT) is the foundation of many algorithms for compression...
<p>We explore the use of BWTs to store and compress the<br>raw sequencing reads of 26 human populati...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
International audienceCompressed full-text indexes are one of the main success stories of bioinforma...
International audienceCompressed full-text indexes are one of the main success stories of bioinforma...
International audienceCompressed full-text indexes are one of the main success stories of bioinforma...
Abstract. With the recent advances in DNA sequencing, it is now possible to have complete genomes of...
Short-read aligners predominantly use the FM-index, which is easily able to index one or a few human...
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utili...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
While short read aligners, which predominantly use the FM-index, are able to easily index one or a f...
We are rapidly approaching the point where we have sequenced millions of human genomes. There is a p...
We are rapidly approaching the point where we have sequenced millions of human genomes. There is a p...
We are rapidly approaching the point where we have sequenced millions of human genomes. There is a p...
Cox AJ, Bauer MJ, Jakobi T, Rosone G. Large-scale compression of genomic sequence databases with the...
Motivation: The Burrows–Wheeler transform (BWT) is the foundation of many algorithms for compression...
<p>We explore the use of BWTs to store and compress the<br>raw sequencing reads of 26 human populati...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
International audienceCompressed full-text indexes are one of the main success stories of bioinforma...
International audienceCompressed full-text indexes are one of the main success stories of bioinforma...
International audienceCompressed full-text indexes are one of the main success stories of bioinforma...
Abstract. With the recent advances in DNA sequencing, it is now possible to have complete genomes of...
Short-read aligners predominantly use the FM-index, which is easily able to index one or a few human...
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utili...
Motivation: Genomic repositories are rapidly growing, as witnessed by the 1000 Genomes or the UK10K ...
While short read aligners, which predominantly use the FM-index, are able to easily index one or a f...