Abstract. For as long as biologists have been computing alignments of sequences, the question of what values to use for scoring substitutions and gaps has persisted. While some choices for substitution scores are now common, largely due to convention, there is no standard for choosing gap penalties. An objective way to resolve this question is to learn the appropriate values by solving the Inverse String Alignment Problem: given examples of correct alignments, find parameter values that make the examples be optimal-scoring alignments of their strings. We present a new polynomial-time algorithm for Inverse String Align-ment that is simple to implement, fast in practice, and for the first time can learn hundreds of parameters simultaneously. ...
We present a new machine learning approach to the inverse parametric sequence alignment problem: giv...
The accuracy of an alignment between two protein sequences can be improved by including other detect...
Assume that two sequences from a finite alphabet are optimally aligned according to a scoring system...
For as long as biologists have been computing alignments of sequences, the question of what values t...
Abstract. When aligning biological sequences, the choice of parameter values for the alignment scori...
We consider the inverse parametric sequence alignment problem, where a sequence alignment is given a...
An essential tool in biology is the alignment of multiple sequences. Biologists use multiple sequenc...
Abstract Background While the pairwise alignments produced by sequence similarity searches are a pow...
Accurate alignments of sequences are needed for many types of analyses. Aligned sequences might be t...
The level of conservation between two homologous sequences often varies among sequence regions; func...
Motivation: Protein sequence alignment plays a critical role in computational biology as it is an in...
Abstract Background While substitution matrices can readily be computed from reference alignments, i...
The problem of aligning multiple protein sequences is essential to many biological analyses, but mos...
The computation of protein sequence alignments is one of the most fundamental tasks in computational...
While the multiple sequence alignment output by an aligner strongly depends on the parameter values ...
We present a new machine learning approach to the inverse parametric sequence alignment problem: giv...
The accuracy of an alignment between two protein sequences can be improved by including other detect...
Assume that two sequences from a finite alphabet are optimally aligned according to a scoring system...
For as long as biologists have been computing alignments of sequences, the question of what values t...
Abstract. When aligning biological sequences, the choice of parameter values for the alignment scori...
We consider the inverse parametric sequence alignment problem, where a sequence alignment is given a...
An essential tool in biology is the alignment of multiple sequences. Biologists use multiple sequenc...
Abstract Background While the pairwise alignments produced by sequence similarity searches are a pow...
Accurate alignments of sequences are needed for many types of analyses. Aligned sequences might be t...
The level of conservation between two homologous sequences often varies among sequence regions; func...
Motivation: Protein sequence alignment plays a critical role in computational biology as it is an in...
Abstract Background While substitution matrices can readily be computed from reference alignments, i...
The problem of aligning multiple protein sequences is essential to many biological analyses, but mos...
The computation of protein sequence alignments is one of the most fundamental tasks in computational...
While the multiple sequence alignment output by an aligner strongly depends on the parameter values ...
We present a new machine learning approach to the inverse parametric sequence alignment problem: giv...
The accuracy of an alignment between two protein sequences can be improved by including other detect...
Assume that two sequences from a finite alphabet are optimally aligned according to a scoring system...