Word match counts have traditionally been proposed as an alignment-free measure of similarity for biological sequences. The D2 statistic, which simply counts the number of exact word matches between two sequences, is a useful test bed for developing rigorous mathe-matical results, which can then be extended to more biologically useful measures. The distributional properties of the D2 statistic under the null hypothesis of identically and independently distributed letters have been studied extensively, but no comprehensive study of the D2 distribution for biologically more realistic higher-order Markovian sequences exists. Here we derive exact formulas for the mean and variance of the D2 statistic for Markovian sequences of any order, and de...
International audienceBACKGROUND: In bioinformatics it is common to search for a pattern of interest...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
International audienceBACKGROUND: In bioinformatics it is common to search for a pattern of interest...
The D2 statistic, which counts the number of word matches between two given sequences, has long been...
Word match counts have traditionally been proposed as an alignment-free measure of similarity for bi...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
Word matches are often used in sequence comparison methods, either as a measure of sequence similari...
Word matches are often used in sequence comparison methods, either as a measure of sequence similari...
Given two sequences over a finite alphabet L, the D₂ statistic is the number of m-letter word match...
International audienceIn this paper, me give an overview about the different results existing on the...
The D2 statistic, defined as the number of matches of words of some pre-specified length k, is a com...
International audiencen the following, an overview is given on statistical and probabilistic propert...
Given two sequences of length n over a finite alphabet A of size \A\ = d, the D2 statistic is the nu...
International audienceBACKGROUND: In bioinformatics it is common to search for a pattern of interest...
International audienceBACKGROUND: In bioinformatics it is common to search for a pattern of interest...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
International audienceBACKGROUND: In bioinformatics it is common to search for a pattern of interest...
The D2 statistic, which counts the number of word matches between two given sequences, has long been...
Word match counts have traditionally been proposed as an alignment-free measure of similarity for bi...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
Word matches are often used in sequence comparison methods, either as a measure of sequence similari...
Word matches are often used in sequence comparison methods, either as a measure of sequence similari...
Given two sequences over a finite alphabet L, the D₂ statistic is the number of m-letter word match...
International audienceIn this paper, me give an overview about the different results existing on the...
The D2 statistic, defined as the number of matches of words of some pre-specified length k, is a com...
International audiencen the following, an overview is given on statistical and probabilistic propert...
Given two sequences of length n over a finite alphabet A of size \A\ = d, the D2 statistic is the nu...
International audienceBACKGROUND: In bioinformatics it is common to search for a pattern of interest...
International audienceBACKGROUND: In bioinformatics it is common to search for a pattern of interest...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
International audienceBACKGROUND: In bioinformatics it is common to search for a pattern of interest...