Sequence alignment underpins all of comparative genomics, yet it remains an incompletely solved problem. In particular, the statistical uncertainty within inferred alignments is often disregarded, while parametric or phylogenetic inferences are considered meaningless without confidence estimates. Here, we report on a theoretical and simulation study of pairwise alignments of genomic DNA at human-mouse divergence. We find that >15% of aligned bases are incorrect in existing whole-genome alignments, and we identify three types of alignment error, each leading to systematic biases in all algorithms considered. Careful modeling of the evolutionary process improves alignment quality; however, these improvements are modest compared with the remai...
Computational biology is replete with high-dimensional (high-D) discrete prediction and inference pr...
We explore several computational approaches to analyzing interspecies genomic sequence alignments, a...
For several decades, sequence alignment has been a widely used tool in bioinformatics. For instance,...
Sequence alignment underpins all of comparative genomics, yet it remains an incompletely solved prob...
Abstract Background While most multiple sequence alignment programs expect that all or most of their...
Molecular evolutionary biology allows us to look into the past by analyzing sequences of amino acids...
Multiple sequence alignment (MSA) is the heart of comparative sequence analysis. Recent studies demo...
We develop techniques to estimate the statistical significance of gap-free alignments between two ge...
A method is described for performing global alignment of noncoding DNA sequences based on an evoluti...
Abstract Background The flood of genomic data to help build and date the tree of life requires autom...
Phylogenetic reconstructions are essential in genomics data analyses and depend on accurate multiple...
Evolutionary studies usually use a two-step process to investigate sequence data. Step one estimates...
Background: Molecular evolutionary studies of noncoding sequences rely on multiple alignments. Yet ...
Since the identification of DNA/RNA as genetic material, deciphering the code of life has been a maj...
Abstract Background Multiple Sequence Alignments (MSAs) are the starting point of molecular evolutio...
Computational biology is replete with high-dimensional (high-D) discrete prediction and inference pr...
We explore several computational approaches to analyzing interspecies genomic sequence alignments, a...
For several decades, sequence alignment has been a widely used tool in bioinformatics. For instance,...
Sequence alignment underpins all of comparative genomics, yet it remains an incompletely solved prob...
Abstract Background While most multiple sequence alignment programs expect that all or most of their...
Molecular evolutionary biology allows us to look into the past by analyzing sequences of amino acids...
Multiple sequence alignment (MSA) is the heart of comparative sequence analysis. Recent studies demo...
We develop techniques to estimate the statistical significance of gap-free alignments between two ge...
A method is described for performing global alignment of noncoding DNA sequences based on an evoluti...
Abstract Background The flood of genomic data to help build and date the tree of life requires autom...
Phylogenetic reconstructions are essential in genomics data analyses and depend on accurate multiple...
Evolutionary studies usually use a two-step process to investigate sequence data. Step one estimates...
Background: Molecular evolutionary studies of noncoding sequences rely on multiple alignments. Yet ...
Since the identification of DNA/RNA as genetic material, deciphering the code of life has been a maj...
Abstract Background Multiple Sequence Alignments (MSAs) are the starting point of molecular evolutio...
Computational biology is replete with high-dimensional (high-D) discrete prediction and inference pr...
We explore several computational approaches to analyzing interspecies genomic sequence alignments, a...
For several decades, sequence alignment has been a widely used tool in bioinformatics. For instance,...