The recently introduced long-read sequencing technologies (such as Pacific Biosciences or Oxford Nanopore) have substantially improved genome assemblies of many organisms, including the human reference genome. The technologies are, however, facing the challenge of high read errors. In this dissertation, we describe multiple algorithms for assembly and analysis of long-read sequencing data. First, we introduce the ABruijn algorithm for long-read assembly that bypasses the expensive read error-correction step by identifying reliable k-mers in reads. We then describe the Flye package, that combines ABruijn with a new repeat graph approach that accurately resolves the genomic structure. Finally, we extend Flye to the assembly of complex metagen...
Not AvailableCurrent de-novo assemblers are unable to effectively use the long-read sequencing data ...
Like a jigsaw puzzle with large pieces, a genome sequenced with long reads is easier to assemble. Ho...
De novo genome assembly is cornerstone to modern genomics studies. It is also a useful method for st...
Genome assembly is the problem of reconstructing genomes from DNA sequence reads. Even the best asse...
Abstract. Genome assembly using high throughput data with short reads, arguably, remains an unresolv...
The recent breakthroughs in assembling long error-prone reads were based on the overlap-layout-conse...
We study data-efficient and also practical de-novo genome assembly algorithm. Due to the advancement...
In the last decade, sequencing technology has progressed rapidly, leading to much faster and cheaper...
International audienceDNA sequencing data continue to progress toward longer reads with increasingly...
Genome sequence assembly presents a fascinating and frequently-changing challenge. As DNA sequencing...
Abstract While long-read sequencing allows for the complete assembly of bacterial genomes, long-read...
National audienceThird-generation long-read sequencing technologies tackle the repeat problem in gen...
MotivationWith advances in sequencing technology, it has become faster and cheaper to obtain short-r...
Background: Data sets from long-read sequencing platforms (Oxford Nanopore Technologies and Pacific ...
Despite their accuracy, next-generation DNA sequencing technologies have limited utility in analyzin...
Not AvailableCurrent de-novo assemblers are unable to effectively use the long-read sequencing data ...
Like a jigsaw puzzle with large pieces, a genome sequenced with long reads is easier to assemble. Ho...
De novo genome assembly is cornerstone to modern genomics studies. It is also a useful method for st...
Genome assembly is the problem of reconstructing genomes from DNA sequence reads. Even the best asse...
Abstract. Genome assembly using high throughput data with short reads, arguably, remains an unresolv...
The recent breakthroughs in assembling long error-prone reads were based on the overlap-layout-conse...
We study data-efficient and also practical de-novo genome assembly algorithm. Due to the advancement...
In the last decade, sequencing technology has progressed rapidly, leading to much faster and cheaper...
International audienceDNA sequencing data continue to progress toward longer reads with increasingly...
Genome sequence assembly presents a fascinating and frequently-changing challenge. As DNA sequencing...
Abstract While long-read sequencing allows for the complete assembly of bacterial genomes, long-read...
National audienceThird-generation long-read sequencing technologies tackle the repeat problem in gen...
MotivationWith advances in sequencing technology, it has become faster and cheaper to obtain short-r...
Background: Data sets from long-read sequencing platforms (Oxford Nanopore Technologies and Pacific ...
Despite their accuracy, next-generation DNA sequencing technologies have limited utility in analyzin...
Not AvailableCurrent de-novo assemblers are unable to effectively use the long-read sequencing data ...
Like a jigsaw puzzle with large pieces, a genome sequenced with long reads is easier to assemble. Ho...
De novo genome assembly is cornerstone to modern genomics studies. It is also a useful method for st...