Abstract—We describe a set of computational problems motivated by certain analysis tasks in genome resequencing. These are assembly problems for which multiple distinct sequences must be assembled, but where the relative positions of reads to be assembled are already known. This information is obtained from a common reference genome and is characteristic of resequencing experiments. The simplest variant of the problem aims at determining a minimum set of superstrings such that each sequenced read matches at least one superstring. We give an algorithm with time complexity OðNÞ, where N is the sum of the lengths of reads, substantially improving on previous algorithms for solving the same problem. We also examine the problem of finding the sm...
© The Author 2017. Published by Oxford University Press. All rights reserved. The application of adv...
We introduce a data structure called a superword array for finding quickly matches between DNA seque...
This paper studies several combinatorial problems arising from finding the conserved genes of two ge...
In the last decade, sequencing technology has progressed rapidly, leading to much faster and cheaper...
International audienceWith the increasing impact of genomics in life sciences, the inference of high...
Because of its connection to the well-known NP-complete shortest superstring combinatorial optimizat...
Background The development of sequencing technology increases the number of genomes being sequenced....
Despite the prodigious throughput of the sequencing instruments currently on the market, the assembl...
In this thesis we present algorithmic results for computational problems arising in two important ar...
Abstract. Long DNA sequences have to be cut using restriction en-zymes into small fragments whose le...
Abstract Genome assembly is one of the most important problems in computational genomics. Here, we s...
Recent studies in genomics have highlighted the significance of structural variation in deter-mining...
Whole genome shotgun assembly is the process of taking many short sequenced segments (reads) and rec...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...
© The Author 2017. Published by Oxford University Press. All rights reserved. The application of adv...
We introduce a data structure called a superword array for finding quickly matches between DNA seque...
This paper studies several combinatorial problems arising from finding the conserved genes of two ge...
In the last decade, sequencing technology has progressed rapidly, leading to much faster and cheaper...
International audienceWith the increasing impact of genomics in life sciences, the inference of high...
Because of its connection to the well-known NP-complete shortest superstring combinatorial optimizat...
Background The development of sequencing technology increases the number of genomes being sequenced....
Despite the prodigious throughput of the sequencing instruments currently on the market, the assembl...
In this thesis we present algorithmic results for computational problems arising in two important ar...
Abstract. Long DNA sequences have to be cut using restriction en-zymes into small fragments whose le...
Abstract Genome assembly is one of the most important problems in computational genomics. Here, we s...
Recent studies in genomics have highlighted the significance of structural variation in deter-mining...
Whole genome shotgun assembly is the process of taking many short sequenced segments (reads) and rec...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...
One of the most computationally intensive tasks in computational biology is de novo genome assembly,...
© The Author 2017. Published by Oxford University Press. All rights reserved. The application of adv...
We introduce a data structure called a superword array for finding quickly matches between DNA seque...
This paper studies several combinatorial problems arising from finding the conserved genes of two ge...