Given a text string x of n symbols and an integer constant d, we consider the problem of finding, for any pair (y,z) of subwords of x, the tandem index associated with the pair, which is defined as the number of times that y and z occur in tandem (i.e., with no intermediate occurrence of either one of them) within a distance of d symbols of x. Although in principle there might be O(n^4) distinct subword pairs in x, it is seen that it suffices to consider a family of only O(n^2) such pairs, with the property that for any neglected pair (y^',z^') there exists a corresponding pair (y,z) contained in our family such that: (i) y^' is a prefix of y and z^' is a prefix of z; and (ii) the tandem index of (y^',z^') equals that of (y,z). The main con...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
Gusfield D, Stoye J. Linear time algorithms for finding and representing all the tandem repeats in a...
We consider the problem of testing whether (a large part of) a given string X of length n over some ...
Given a text string x of n symbols and an integer constant d, we consider the problem of finding, fo...
AbstractGiven a text string x of n symbols and an integer constant d, we consider the problem of fin...
Given a textstring x of n symbols aJld an illtcgcr constant. d, we consider the problem of finding, ...
Given a textstring x of n symbols and an integer constant d, we consider the problem of finding, for...
We address the problem of extracting pairs of subwords (m1,m2) from a text string s of length n, suc...
Eliminating the possible redundancy from a set of candidate motifs occurring in an input string is f...
A tandem repeat is an occurrence of two adjacent identical substrings. In this paper, we introduce t...
AbstractWe study the problem of detecting all occurrences of (primitive) tandem repeats and tandem a...
Stoye J, Gusfield D. Simple and flexible detection of contiguous repeats using a suffix tree. Theore...
Colloque avec actes et comité de lecture. internationale.International audienceWe propose an algorit...
The computation of statistical indexes such as frequency counts, expected probabilities, and over/un...
We consider the problem of identifying tandem scattered subsequences within a string. Our algorithm ...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
Gusfield D, Stoye J. Linear time algorithms for finding and representing all the tandem repeats in a...
We consider the problem of testing whether (a large part of) a given string X of length n over some ...
Given a text string x of n symbols and an integer constant d, we consider the problem of finding, fo...
AbstractGiven a text string x of n symbols and an integer constant d, we consider the problem of fin...
Given a textstring x of n symbols aJld an illtcgcr constant. d, we consider the problem of finding, ...
Given a textstring x of n symbols and an integer constant d, we consider the problem of finding, for...
We address the problem of extracting pairs of subwords (m1,m2) from a text string s of length n, suc...
Eliminating the possible redundancy from a set of candidate motifs occurring in an input string is f...
A tandem repeat is an occurrence of two adjacent identical substrings. In this paper, we introduce t...
AbstractWe study the problem of detecting all occurrences of (primitive) tandem repeats and tandem a...
Stoye J, Gusfield D. Simple and flexible detection of contiguous repeats using a suffix tree. Theore...
Colloque avec actes et comité de lecture. internationale.International audienceWe propose an algorit...
The computation of statistical indexes such as frequency counts, expected probabilities, and over/un...
We consider the problem of identifying tandem scattered subsequences within a string. Our algorithm ...
Background: The discovery of surprisingly frequent patterns is of paramount interest in bioinformati...
Gusfield D, Stoye J. Linear time algorithms for finding and representing all the tandem repeats in a...
We consider the problem of testing whether (a large part of) a given string X of length n over some ...