Alignment-free distance measures are generally less accurate but more efficient than traditional alignment-based metrics. In the context of genome sequence analysis, the efficiency gain is often so substantial that it outweights the loss in accuracy. However, a further disadvantage of alignment-free distances is that their relationship to evolutionary events such as substitutions is generally unknown. We have therefore derived an estimator of the number of substitutions per site between two unaligned DNA sequences, K-r. Simulations show that this estimator works well with "ideal'' data. We compare K-r to two alternative alignment-free distances: a k-tuple distance and a measure of relative entropy based on average common substring length. A...
This repository contains 24,000 pairs of nucleotide sequences (and associated parameters) that have ...
BACKGROUND: Existing sequence alignment algorithms use heuristic scoring schemes based on biological...
Motivation: A standard approach to classifying sets of genomes is to calculate their pairwise distan...
Alignment-free distance measures are generally less accurate but more efficient than traditional ali...
Inferring evolutionary relationships based on comparative analysis of genomic data remains a fundame...
Motivation: Genome comparison is central to contemporary genomics and typically relies on sequence a...
We study the number Nk of length-k word matches between pairs of evolutionarily related DNA sequence...
Phylogenetics and population genetics are central disciplines in evolutionary biology. Both are base...
Alignment-free methods are increasingly used to calculate evolutionary distances between DNA and pro...
We have recently developed a distance metric for efficiently estimating the number of substitutions ...
We have recently developed a distance metric for efficiently estimating the number of substitutions ...
International audienceAlignment-free methods are increasingly used to estimate distances between DNA...
based on it has shown promising results. alignments. Our main result uses algorithmic (Kolmogorov) ...
2011-10-16Phylogenetic tree reconstruction is important for the understanding of the evolutionary hi...
The genetic distance between biological sequences is a fundamental quantity in molecular evolution. ...
This repository contains 24,000 pairs of nucleotide sequences (and associated parameters) that have ...
BACKGROUND: Existing sequence alignment algorithms use heuristic scoring schemes based on biological...
Motivation: A standard approach to classifying sets of genomes is to calculate their pairwise distan...
Alignment-free distance measures are generally less accurate but more efficient than traditional ali...
Inferring evolutionary relationships based on comparative analysis of genomic data remains a fundame...
Motivation: Genome comparison is central to contemporary genomics and typically relies on sequence a...
We study the number Nk of length-k word matches between pairs of evolutionarily related DNA sequence...
Phylogenetics and population genetics are central disciplines in evolutionary biology. Both are base...
Alignment-free methods are increasingly used to calculate evolutionary distances between DNA and pro...
We have recently developed a distance metric for efficiently estimating the number of substitutions ...
We have recently developed a distance metric for efficiently estimating the number of substitutions ...
International audienceAlignment-free methods are increasingly used to estimate distances between DNA...
based on it has shown promising results. alignments. Our main result uses algorithmic (Kolmogorov) ...
2011-10-16Phylogenetic tree reconstruction is important for the understanding of the evolutionary hi...
The genetic distance between biological sequences is a fundamental quantity in molecular evolution. ...
This repository contains 24,000 pairs of nucleotide sequences (and associated parameters) that have ...
BACKGROUND: Existing sequence alignment algorithms use heuristic scoring schemes based on biological...
Motivation: A standard approach to classifying sets of genomes is to calculate their pairwise distan...