The D2 statistic is defined as the number of word matches of prespecified length k, with up to t mismatches, shared between two given sequences. This statistic finds its application in alignment-free comparisons of biological sequences. It has two main advantages over alignment-based methods for nucleotide and amino-acid sequence comparisons, such as BLAST (basic local alignment search tool). These are (i) D2 does not assume that homologous segments are contiguous, and (ii) the algorithm is computationally extremely fast, the runtime being proportional to the size of the sequences in the case of exact matches. This review article summarises results to date on determining the distributional properties of the D2 statistic for a range of biolo...
Motivation: Recently, a range of new statistics have become available for the alignment-free compari...
Given two sequences of length n over a finite alphabet A of size \A\ = d, the D2 statistic is the nu...
BACKGROUND: The number of k-words shared between two sequences is a simple and effcient alignment-f...
The D2 statistic is defined as the number of word matches of prespecified length k, with up to t mis...
k-word matches, the number of words of length k shared between two sequences, also known as the D2 s...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
The D2 statistic, defined as the number of matches of words of some pre-specified length k, is a com...
The D2 statistic, defined as the number of matches of words of some pre-specified length k, is a com...
Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for iden...
Word matches are often used in sequence comparison methods, either as a measure of sequence similari...
Word matches are often used in sequence comparison methods, either as a measure of sequence similari...
Next-generation sequencing (NGS) technologies have generated enormous amounts of shotgun read data, ...
When it comes to the analysis of biological sequences, alignment based methods have long been in the...
Motivation: Recently, a range of new statistics have become available for the alignment-free compari...
Given two sequences of length n over a finite alphabet A of size \A\ = d, the D2 statistic is the nu...
BACKGROUND: The number of k-words shared between two sequences is a simple and effcient alignment-f...
The D2 statistic is defined as the number of word matches of prespecified length k, with up to t mis...
k-word matches, the number of words of length k shared between two sequences, also known as the D2 s...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
This study focuses on an alignment-free sequence comparison method: the number of words of length k ...
The D2 statistic, defined as the number of matches of words of some pre-specified length k, is a com...
The D2 statistic, defined as the number of matches of words of some pre-specified length k, is a com...
Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for iden...
Word matches are often used in sequence comparison methods, either as a measure of sequence similari...
Word matches are often used in sequence comparison methods, either as a measure of sequence similari...
Next-generation sequencing (NGS) technologies have generated enormous amounts of shotgun read data, ...
When it comes to the analysis of biological sequences, alignment based methods have long been in the...
Motivation: Recently, a range of new statistics have become available for the alignment-free compari...
Given two sequences of length n over a finite alphabet A of size \A\ = d, the D2 statistic is the nu...
BACKGROUND: The number of k-words shared between two sequences is a simple and effcient alignment-f...