Abstract. A “semi-probabilistic ” alignment algorithm which combines ideas from Smith-Waterman and probabilistic alignment is proposed and studied in detail. It is predicted that the score statistics of this “hybrid ” algorithm is of the universal Gumbel form, with the key Gumbel parameter λ taking on a fixed asymptotic value for a wide variety of scoring parameters. We have also characterized the “extremal ensemble”, i.e., the collection of sequence pairs exhibiting similarities that a given scoring system is most sensitive to. Based on this extremal ensemble, a simple recipe for the computation of the “relative entropy”, and from it the correction to λ due to finite sequence length is also given. This allows us to assign p-values to the a...
Motivation: Although pairwise sequence alignment is essential in comparative genomic sequence analys...
The Smith–Waterman algorithm yields a single alignment, which, albeit optimal, can be strongly affec...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...
We looked at various alignment algorithms with different scoring schemes. We argued that the score o...
The search for similarity between two biological sequences lies at the core of many applications in ...
International audienceAlignment algorithms usually rely on simplified models of gaps for computation...
The statistical significance of gapped local align-ments is characterized by analyzing the extremal ...
The statistical properties of local alignment algorithms with gaps are analyzed theoretically for uu...
International audienceWe propose a new method to approximate the signi cativity of gapped local sequ...
A simple general approximation for the distribution of gapped local alignment scores is presented, s...
In order to assess the significance of sequence alignments it is crucial to know the distribution of...
We study the problem of similarity detection by sequence alignment with gaps, using a recently estab...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...
A simple general approximation for the distribution of gapped local alignment scores is presented, s...
Classic alignment algorithms utilize scoring functions which maximize similarity or minimize edit di...
Motivation: Although pairwise sequence alignment is essential in comparative genomic sequence analys...
The Smith–Waterman algorithm yields a single alignment, which, albeit optimal, can be strongly affec...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...
We looked at various alignment algorithms with different scoring schemes. We argued that the score o...
The search for similarity between two biological sequences lies at the core of many applications in ...
International audienceAlignment algorithms usually rely on simplified models of gaps for computation...
The statistical significance of gapped local align-ments is characterized by analyzing the extremal ...
The statistical properties of local alignment algorithms with gaps are analyzed theoretically for uu...
International audienceWe propose a new method to approximate the signi cativity of gapped local sequ...
A simple general approximation for the distribution of gapped local alignment scores is presented, s...
In order to assess the significance of sequence alignments it is crucial to know the distribution of...
We study the problem of similarity detection by sequence alignment with gaps, using a recently estab...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...
A simple general approximation for the distribution of gapped local alignment scores is presented, s...
Classic alignment algorithms utilize scoring functions which maximize similarity or minimize edit di...
Motivation: Although pairwise sequence alignment is essential in comparative genomic sequence analys...
The Smith–Waterman algorithm yields a single alignment, which, albeit optimal, can be strongly affec...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...