We propose an approximate distribution for the gapped local score of a two sequence comparison. Our method stands on combining an adapted scoring scheme that includes the gaps and an approximate distribution of the ungapped local score of two independent sequences of i.i.d. random variables. The new scoring scheme is defined on h-tuples of the sequences, using the gapped global score. The influence of h and the accuracy of the p-value are numerically studied and compared with obtained p-value of BLAST. The numerical experiments emphasize that our approximate p-values outperform the BLAST ones, particularly for both simulated and real short sequences.
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
We proposed a simple formula to assess the statistical significance of homologous segments found in ...
A scoring scheme is presented to measure the similarity score between two biological sequences, wher...
International audienceWe propose an approximate distribution for the gapped local score of a two seq...
A simple general approximation for the distribution of gapped local alignment scores is presented, s...
A simple general approximation for the distribution of gapped local alignment scores is presented, s...
International audienceWe propose a new method to approximate the signi cativity of gapped local sequ...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...
International audienceThe local score of a DNA sequence, also called Smith and Waterman score \citea...
The search for similarity between two biological sequences lies at the core of many applications in ...
Assume that two sequences from a finite alphabet are optimally aligned according to a scoring system...
International audienceUsing random walk theory, we first establish explicitly the exact distribution...
Using random walk theory, we first establish explicitly the exact distribution of the maximal partia...
The statistical significance of gapped local align-ments is characterized by analyzing the extremal ...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
We proposed a simple formula to assess the statistical significance of homologous segments found in ...
A scoring scheme is presented to measure the similarity score between two biological sequences, wher...
International audienceWe propose an approximate distribution for the gapped local score of a two seq...
A simple general approximation for the distribution of gapped local alignment scores is presented, s...
A simple general approximation for the distribution of gapped local alignment scores is presented, s...
International audienceWe propose a new method to approximate the signi cativity of gapped local sequ...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...
A heuristic approximation to the score distribution of gapped alignments in the logarithmic domain i...
International audienceThe local score of a DNA sequence, also called Smith and Waterman score \citea...
The search for similarity between two biological sequences lies at the core of many applications in ...
Assume that two sequences from a finite alphabet are optimally aligned according to a scoring system...
International audienceUsing random walk theory, we first establish explicitly the exact distribution...
Using random walk theory, we first establish explicitly the exact distribution of the maximal partia...
The statistical significance of gapped local align-ments is characterized by analyzing the extremal ...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
We proposed a simple formula to assess the statistical significance of homologous segments found in ...
A scoring scheme is presented to measure the similarity score between two biological sequences, wher...