Sequence alignments are the foundations of life science research, but most innovation so far focuses on optimal alignments, while information derived from suboptimal solutions is ignored. We argue that one optimal alignment per pairwise sequence comparison is a reasonable approximation when dealing with very similar sequences but is insufficient when exploring the biodiversity of the protein universe at tree-of-life scale. To overcome this limitation, we introduce pairwise alignment-safety to uncover the amino acid positions robustly shared across all suboptimal solutions. We implement EMERALD, a software library for alignment-safety inference, and apply it to 400k sequences from the SwissProt database.Peer reviewe
For as long as biologists have been computing alignments of sequences, the question of what values t...
Most sequence alignment tools can successfully align protein sequences with higher levels of sequenc...
The estimation of multiple sequence alignments of protein sequences is a basic step in many bioinfor...
Publisher Copyright: © 2023, The Author(s).Sequence alignments are the foundations of life science r...
Abstract Background While the pairwise alignments produced by sequence similarity searches are a pow...
Motivation: Alignments are correspondences between sequences. How reliable are alignments of amino a...
Background: Guide-trees are used as part of an essential heuristic to enable the calculation of mult...
Abstract Background There have been many algorithms and software programs implemented for the infere...
Motivation: Protein sequence alignment plays a critical role in computational biology as it is an in...
We present a method for attributing a measure of reliability to a residue pair in an optimal alignme...
A major computational challenge in the genomic era is annotating structure/function to the vast quan...
Proteins are macromolecules that play a pivotal role in biological processes in living organisms. St...
Sequence alignment has become one of the essential bioinformatics tools in biomedical research. Exis...
Motivation: Within bioinformatics, the textual alignment of amino acid sequences has long dominated ...
Abstract Background Protein sequence alignment analyses have become a crucial step for many bioinfor...
For as long as biologists have been computing alignments of sequences, the question of what values t...
Most sequence alignment tools can successfully align protein sequences with higher levels of sequenc...
The estimation of multiple sequence alignments of protein sequences is a basic step in many bioinfor...
Publisher Copyright: © 2023, The Author(s).Sequence alignments are the foundations of life science r...
Abstract Background While the pairwise alignments produced by sequence similarity searches are a pow...
Motivation: Alignments are correspondences between sequences. How reliable are alignments of amino a...
Background: Guide-trees are used as part of an essential heuristic to enable the calculation of mult...
Abstract Background There have been many algorithms and software programs implemented for the infere...
Motivation: Protein sequence alignment plays a critical role in computational biology as it is an in...
We present a method for attributing a measure of reliability to a residue pair in an optimal alignme...
A major computational challenge in the genomic era is annotating structure/function to the vast quan...
Proteins are macromolecules that play a pivotal role in biological processes in living organisms. St...
Sequence alignment has become one of the essential bioinformatics tools in biomedical research. Exis...
Motivation: Within bioinformatics, the textual alignment of amino acid sequences has long dominated ...
Abstract Background Protein sequence alignment analyses have become a crucial step for many bioinfor...
For as long as biologists have been computing alignments of sequences, the question of what values t...
Most sequence alignment tools can successfully align protein sequences with higher levels of sequenc...
The estimation of multiple sequence alignments of protein sequences is a basic step in many bioinfor...