AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at the core of many molecular biology problems. Assume that we are given n DNA sequences s1, …, sn. The Consensus Patterns problem, which has been widely studied in bioinformatics research, in its simplest form, asks for a region of length L in each si, and a median string s of length L so that the total Hamming distance from s to these regions is minimized. We show that the problem is NP-hard and give a polynomial time approximation scheme (PTAS) for it. We then present an efficient approximation algorithm for the consensus pattern problem under the original relative entropy measure. As an interesting application of our analysis, we further obt...
The problem of discovering frequent arrangements of regions of high occurrence of one or more items ...
Abstract. Given a set of sequences, S, and degeneracy parameter, d, the Consensus Sequence problem a...
The LONGEST COMMON SUBSEQUENCE problem is examined from the point of view of parameterized computati...
Algorithms for finding similar, or highly conserved, regions in a group of sequences are at the core...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
Motivation: A consensus sequence for a family of related sequences is, as the name suggests, a seque...
Motivation: A consensus sequence for a family of related sequences is, as the name suggests, a seque...
In Bioinformatics, finding conserved regions in genomic sequences remains to be a challenge not just...
International audienceWe address the issue of structured motif inference. This problem is stated as ...
Abstract. With the exploding size of genome databases, it is becoming increasingly important to devi...
The paper describes an algorithm to compute a consensus sequence from a set of DNA sequences of appr...
The problem of discovering frequent poly-regions (i.e. regions of high occurrence of a set of items ...
A growing number of measures of sequence similarity is being based on some underlying notion of rela...
International audienceWe address the difficult issue of structured motif inference. This problem is ...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
The problem of discovering frequent arrangements of regions of high occurrence of one or more items ...
Abstract. Given a set of sequences, S, and degeneracy parameter, d, the Consensus Sequence problem a...
The LONGEST COMMON SUBSEQUENCE problem is examined from the point of view of parameterized computati...
Algorithms for finding similar, or highly conserved, regions in a group of sequences are at the core...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
Motivation: A consensus sequence for a family of related sequences is, as the name suggests, a seque...
Motivation: A consensus sequence for a family of related sequences is, as the name suggests, a seque...
In Bioinformatics, finding conserved regions in genomic sequences remains to be a challenge not just...
International audienceWe address the issue of structured motif inference. This problem is stated as ...
Abstract. With the exploding size of genome databases, it is becoming increasingly important to devi...
The paper describes an algorithm to compute a consensus sequence from a set of DNA sequences of appr...
The problem of discovering frequent poly-regions (i.e. regions of high occurrence of a set of items ...
A growing number of measures of sequence similarity is being based on some underlying notion of rela...
International audienceWe address the difficult issue of structured motif inference. This problem is ...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
The problem of discovering frequent arrangements of regions of high occurrence of one or more items ...
Abstract. Given a set of sequences, S, and degeneracy parameter, d, the Consensus Sequence problem a...
The LONGEST COMMON SUBSEQUENCE problem is examined from the point of view of parameterized computati...