Several biological problems require the identification of regions in a sequence where some feature occurs within a target density range: examples including the location of GC-rich regions, identification of CpG islands, and sequence matching. Mathematically, this corresponds to searching a string of 0's and 1's for a substring whose relative proportion of 1's lies between given lower and upper bounds. We consider the algorithmic problem of locating the longest such substring, as well as other related problems (such as finding the shortest substring or a maximal set of disjoint substrings). For locating the longest such substring, we develop an algorithm that runs in O(n) time, improving upon the previous best-known O(n log n) result. For th...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
We study the problem of computing optimal spaced seeds for identifying homologous coding DNA sequen...
AbstractWe study two fundamental problems concerning the search for interesting regions in sequences...
AbstractWe study two fundamental problems concerning the search for interesting regions in sequences...
We study two fundamental problems concerning the search for interesting regions in sequences:(i)give...
AbstractWe study an abstract optimization problem arising from biomolecular sequence analysis. For a...
Abstract We study the problem of finding a specific density subsequence of a se-quence arising from ...
We consider the constrained longest common subsequence problem with an arbitrary set of input string...
Algorithms for finding similar, or highly conserved, regions in a group of sequences are at the core...
Given a sequence of pairs of numbers ( a i , l i ), i = 1, 2, ..., n , with l i \u3e 0, and another ...
Given a sequence of pairs of numbers ( a i , l i ), i = 1, 2, ..., n , with l i \u3e 0, and another ...
Given a sequence of pairs of numbers ( a i , l i ), i = 1, 2, ..., n , with l i \u3e 0, and another ...
Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law corre...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
We study the problem of computing optimal spaced seeds for identifying homologous coding DNA sequen...
AbstractWe study two fundamental problems concerning the search for interesting regions in sequences...
AbstractWe study two fundamental problems concerning the search for interesting regions in sequences...
We study two fundamental problems concerning the search for interesting regions in sequences:(i)give...
AbstractWe study an abstract optimization problem arising from biomolecular sequence analysis. For a...
Abstract We study the problem of finding a specific density subsequence of a se-quence arising from ...
We consider the constrained longest common subsequence problem with an arbitrary set of input string...
Algorithms for finding similar, or highly conserved, regions in a group of sequences are at the core...
Given a sequence of pairs of numbers ( a i , l i ), i = 1, 2, ..., n , with l i \u3e 0, and another ...
Given a sequence of pairs of numbers ( a i , l i ), i = 1, 2, ..., n , with l i \u3e 0, and another ...
Given a sequence of pairs of numbers ( a i , l i ), i = 1, 2, ..., n , with l i \u3e 0, and another ...
Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law corre...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
We study the problem of computing optimal spaced seeds for identifying homologous coding DNA sequen...