Abstract Genome assembly is one of the most important problems in computational genomics. Here, we suggest addressing an issue that arises in homology-based scaffolding, that is, when linking and ordering contigs to obtain larger pseudo-chromosomes by means of a second incomplete assembly of a related species. The idea is to use alignments of binned regions in one contig to find the most homologous contig in the other assembly. We show that ordering the contigs of the other assembly can be expressed by a new string problem, the longest run subsequence problem (LRS). We show that LRS is NP-hard and present reduction rules and two algorithmic approaches that, together, are able to solve large instances of LRS to provable optimality. All data ...
International audienceNext-generation de novo short reads assemblers typically use the following str...
Scaffolding step in the genome assembly aims to determine the order and the orientation of a huge nu...
International audienceScaffolding is the final step in assembling Next Generation Sequencing data, i...
Genome assembly is one of the most important problems in computational genomics. Here, we suggest ad...
Genome assembly is one of the most important problems in computational genomics. Here, we suggest ad...
International audienceThis work focuses simultaneously on both the scaffolding and gap filling phase...
Abstract. Long DNA sequences have to be cut using restriction en-zymes into small fragments whose le...
Genome assembly asks to reconstruct an unknown string from many shorter substrings of it. Even thoug...
Generating a defined set of genetic constructs within a large combinatorial space provides a powerfu...
Abstract—We describe a set of computational problems motivated by certain analysis tasks in genome r...
In this paper we study the problem of sorting unsigned genomes by double-cut-and-join operations, wh...
Motivation: Intimately tied to assembly quality is the complexity of the de Bruijn graph built by th...
Genetic maps order genetic markers along chromosomes. They are, for instance, extensively used in ma...
International audienceWe develop a method for solving genome scaffolding as a problem of finding a l...
Information in various applications is often expressed as character sequences over a finite alphabet...
International audienceNext-generation de novo short reads assemblers typically use the following str...
Scaffolding step in the genome assembly aims to determine the order and the orientation of a huge nu...
International audienceScaffolding is the final step in assembling Next Generation Sequencing data, i...
Genome assembly is one of the most important problems in computational genomics. Here, we suggest ad...
Genome assembly is one of the most important problems in computational genomics. Here, we suggest ad...
International audienceThis work focuses simultaneously on both the scaffolding and gap filling phase...
Abstract. Long DNA sequences have to be cut using restriction en-zymes into small fragments whose le...
Genome assembly asks to reconstruct an unknown string from many shorter substrings of it. Even thoug...
Generating a defined set of genetic constructs within a large combinatorial space provides a powerfu...
Abstract—We describe a set of computational problems motivated by certain analysis tasks in genome r...
In this paper we study the problem of sorting unsigned genomes by double-cut-and-join operations, wh...
Motivation: Intimately tied to assembly quality is the complexity of the de Bruijn graph built by th...
Genetic maps order genetic markers along chromosomes. They are, for instance, extensively used in ma...
International audienceWe develop a method for solving genome scaffolding as a problem of finding a l...
Information in various applications is often expressed as character sequences over a finite alphabet...
International audienceNext-generation de novo short reads assemblers typically use the following str...
Scaffolding step in the genome assembly aims to determine the order and the orientation of a huge nu...
International audienceScaffolding is the final step in assembling Next Generation Sequencing data, i...