Sequence alignment underpins all of comparative genomics, yet it remains an incompletely solved problem. In particular, the statistical uncertainty within inferred alignments is often disregarded, while parametric or phylogenetic inferences are considered meaningless without confidence estimates. Here, we report on a theoretical and simulation study of pairwise alignments of genomic DNA at human–mouse divergence. We find that >15% of aligned bases are incorrect in existing whole-genome alignments, and we identify three types of alignment error, each leading to systematic biases in all algorithms considered. Careful modeling of the evolutionary process improves alignment quality; however, these improvements are modest compared with the remai...
Background: Inference of sequence homology is inherently an evolutionary question, dependent upon ev...
Computational biology is replete with high-dimensional (high-D) discrete prediction and inference pr...
We develop techniques to estimate the statistical significance of gap-free alignments between two ge...
Sequence alignment underpins all of comparative genomics, yet it remains an incompletely solved prob...
Abstract Background While most multiple sequence alignment programs expect that all or most of their...
Molecular evolutionary biology allows us to look into the past by analyzing sequences of amino acids...
Multiple sequence alignment (MSA) is the heart of comparative sequence analysis. Recent studies demo...
A method is described for performing global alignment of noncoding DNA sequences based on an evoluti...
Evolutionary studies usually use a two-step process to investigate sequence data. Step one estimates...
Phylogenetic reconstructions are essential in genomics data analyses and depend on accurate multiple...
Background: Molecular evolutionary studies of noncoding sequences rely on multiple alignments. Yet ...
Since the identification of DNA/RNA as genetic material, deciphering the code of life has been a maj...
New DNA sequencing technologies have achieved breakthroughs in throughput, at the expense of higher ...
For several decades, sequence alignment has been a widely used tool in bioinformatics. For instance,...
Abstract Background Multiple Sequence Alignments (MSAs) are the starting point of molecular evolutio...
Background: Inference of sequence homology is inherently an evolutionary question, dependent upon ev...
Computational biology is replete with high-dimensional (high-D) discrete prediction and inference pr...
We develop techniques to estimate the statistical significance of gap-free alignments between two ge...
Sequence alignment underpins all of comparative genomics, yet it remains an incompletely solved prob...
Abstract Background While most multiple sequence alignment programs expect that all or most of their...
Molecular evolutionary biology allows us to look into the past by analyzing sequences of amino acids...
Multiple sequence alignment (MSA) is the heart of comparative sequence analysis. Recent studies demo...
A method is described for performing global alignment of noncoding DNA sequences based on an evoluti...
Evolutionary studies usually use a two-step process to investigate sequence data. Step one estimates...
Phylogenetic reconstructions are essential in genomics data analyses and depend on accurate multiple...
Background: Molecular evolutionary studies of noncoding sequences rely on multiple alignments. Yet ...
Since the identification of DNA/RNA as genetic material, deciphering the code of life has been a maj...
New DNA sequencing technologies have achieved breakthroughs in throughput, at the expense of higher ...
For several decades, sequence alignment has been a widely used tool in bioinformatics. For instance,...
Abstract Background Multiple Sequence Alignments (MSAs) are the starting point of molecular evolutio...
Background: Inference of sequence homology is inherently an evolutionary question, dependent upon ev...
Computational biology is replete with high-dimensional (high-D) discrete prediction and inference pr...
We develop techniques to estimate the statistical significance of gap-free alignments between two ge...