We present an algorithm for the optimal alignment of sequences to genome graphs. It works by phrasing the edit distance minimization task as finding a shortest path on an implicit alignment graph. To find a shortest path, we instantiate the A* paradigm with a novel domain-specific heuristic function that accounts for the upcoming subsequence in the query to be aligned, resulting in a provably optimal alignment algorithm called AStarix. Experimental evaluation of AStarix shows that it is 1–2 orders of magnitude faster than state-of-the-art optimal algorithms on the task of aligning Illumina reads to reference genome graphs. Implementations and evaluations are available at https://github.com/eth-sri/astarix
Graph based non-linear reference structures such as variation graphs and colored de Bruijn graphs en...
Abstract Background Aligning short reads to a reference genome is an important task in many genome a...
[[abstract]]This paper presents a novel approach algorithm for bimolecular sequences alignment. Sequ...
We present an algorithm for the optimal alignment of sequences to genome graphs. It works by phrasin...
Motivation: Graphs are commonly used to represent sets of sequences. Either edges or nodes can be la...
Sequence alignment by exact or approximate string matching is one of the fundamental problems in bio...
Motivation: Many multiple sequence alignment tools have been developed in the past, progressing eith...
Background Recent advances in rapid, low-cost sequencing have opened up the opportunity to study ...
Sequence alignment is an important tool for describing relationships between sequences. Many sequenc...
BACKGROUND : Aligning short reads to a reference genome is an important task in many genome analysis...
Sequence alignment has become a routine procedure in evolutionary biology in looking for evolutionar...
With more and more biological sequences available, sequence analyses have become very important in b...
AbstractThe multiple alignment of the sequences of DNA and proteins is applicable to various importa...
The technologies for sequencing genetic materials have improved vastly during the last fifteen years...
An essential tool in biology is the alignment of multiple sequences. Biologists use multiple sequenc...
Graph based non-linear reference structures such as variation graphs and colored de Bruijn graphs en...
Abstract Background Aligning short reads to a reference genome is an important task in many genome a...
[[abstract]]This paper presents a novel approach algorithm for bimolecular sequences alignment. Sequ...
We present an algorithm for the optimal alignment of sequences to genome graphs. It works by phrasin...
Motivation: Graphs are commonly used to represent sets of sequences. Either edges or nodes can be la...
Sequence alignment by exact or approximate string matching is one of the fundamental problems in bio...
Motivation: Many multiple sequence alignment tools have been developed in the past, progressing eith...
Background Recent advances in rapid, low-cost sequencing have opened up the opportunity to study ...
Sequence alignment is an important tool for describing relationships between sequences. Many sequenc...
BACKGROUND : Aligning short reads to a reference genome is an important task in many genome analysis...
Sequence alignment has become a routine procedure in evolutionary biology in looking for evolutionar...
With more and more biological sequences available, sequence analyses have become very important in b...
AbstractThe multiple alignment of the sequences of DNA and proteins is applicable to various importa...
The technologies for sequencing genetic materials have improved vastly during the last fifteen years...
An essential tool in biology is the alignment of multiple sequences. Biologists use multiple sequenc...
Graph based non-linear reference structures such as variation graphs and colored de Bruijn graphs en...
Abstract Background Aligning short reads to a reference genome is an important task in many genome a...
[[abstract]]This paper presents a novel approach algorithm for bimolecular sequences alignment. Sequ...