AbstractWe present a complete analysis of the statistics of number of occurrences of a regular expression pattern in a random text. This covers “motifs” widely used in computational biology. Our approach is based on: (i) classical constructive results in automata and formal language theory; (ii) analytic combinatorics that is used for deriving asymptotic properties from generating functions; (iii) computer algebra in order to determine generating functions explicitly, analyse generating functions and extract coefficients efficiently. We provide constructions for overlapping or non-overlapping matches of a regular expression. A companion implementation produces: multivariate generating functions for the statistics under study; a fast computa...
Colloque avec actes et comité de lecture. internationale.International audienceWe study the followin...
This monograph synthesizes several studies in average-case analysis of algorithms and in computation...
Consider a given pattern H and a random text T generated by a Markovian source. We study the frequen...
We present a complete analysis of the statistics of number of occurrences of a regular expression pa...
atteson. ~ p eaplant, biology, yale. edu We present algorithms for the exact computation of the prob...
AbstractEvaluation of the expected frequency of occurrences of a given set of patterns in a DNA sequ...
Consider a given pattern H and a random text T generated by a Markovian source of any order. We stud...
This work investigates frequency distributions of strings within a text. The mathematical derivation...
In previous work [10], we considered algorithms related to the statistics of matches with words and...
Since early stages of bioinformatics, substrings played a crucial role in the search and discovery o...
International audienceIt is well known that, under some aperiodicity and irreducibility conditions, ...
International audienceThe analysis of pattern occurrences has numerous applications, in particular i...
In pattern matching algorithms, two characteristic parameters play an important rˆole : the number o...
The motif discovery problem consists of uncovering exceptional patterns (called motifs) in sets of ...
AbstractWe study the following problem: given a biological sequence S, a multiset M of motifs and an...
Colloque avec actes et comité de lecture. internationale.International audienceWe study the followin...
This monograph synthesizes several studies in average-case analysis of algorithms and in computation...
Consider a given pattern H and a random text T generated by a Markovian source. We study the frequen...
We present a complete analysis of the statistics of number of occurrences of a regular expression pa...
atteson. ~ p eaplant, biology, yale. edu We present algorithms for the exact computation of the prob...
AbstractEvaluation of the expected frequency of occurrences of a given set of patterns in a DNA sequ...
Consider a given pattern H and a random text T generated by a Markovian source of any order. We stud...
This work investigates frequency distributions of strings within a text. The mathematical derivation...
In previous work [10], we considered algorithms related to the statistics of matches with words and...
Since early stages of bioinformatics, substrings played a crucial role in the search and discovery o...
International audienceIt is well known that, under some aperiodicity and irreducibility conditions, ...
International audienceThe analysis of pattern occurrences has numerous applications, in particular i...
In pattern matching algorithms, two characteristic parameters play an important rˆole : the number o...
The motif discovery problem consists of uncovering exceptional patterns (called motifs) in sets of ...
AbstractWe study the following problem: given a biological sequence S, a multiset M of motifs and an...
Colloque avec actes et comité de lecture. internationale.International audienceWe study the followin...
This monograph synthesizes several studies in average-case analysis of algorithms and in computation...
Consider a given pattern H and a random text T generated by a Markovian source. We study the frequen...