Pairwise local sequence alignment methods have been the prevailing technique to identify homologous nucleotides between related species. However, existing methods that identify and align all homologous nucleotides in one or more genomes have suffered from poor scalability and limited accuracy. We propose a novel method that couples a gapped extension heuristic with an efficient filtration method for identifying interspersed repeats in genome sequences. During gapped extension, we use the MUSCLE implementation of progressive global multiple alignment with iterative refinement. The resulting gapped extensions potentially contain alignments of unrelated sequence. We detect and remove such undesirable alignments using a hidden Markov model (HMM...
Motivation: Homologous sequences are sometimes similar over some regions but different over other re...
Abstract Background Non-coding DNA sequences comprise a very large proportion of the total genomic c...
AbstractWe present improved techniques for finding homologous regions in DNA and protein sequences. ...
This paper presents new optimizations designed to improve an algorithm at the state-of-the-art for f...
AbstractSequence alignment is a central tool in molecular biology. A Multiple sequence alignment (MS...
A major challenge in computational biology is the identification of evolutionarily related macromole...
<p>(a) Tester constructs comprise pairs of closely-positioned DNA repeats: the base-pair sequence of...
Conventional phylogenetic tree estimation methods assume that all sites in a DNA multiple alignment...
Motivation: Progressive algorithms are widely used heuristics for the production of alignments among...
We present an improved phylogenetic factorial hidden Markov model (FHMM) for detecting two types of ...
CConventional phylogenetic tree estimation methods assume that all sites in a DNA multiple alignment...
Molecular evolutionary biology allows us to look into the past by analyzing sequences of amino acids...
Multiple Sequence Alignment (MSA) is one of the basic tool for interpreting the information obtained...
this paper, we focus on repetitive sequences in which short words are repeated many times, referred ...
<br>Motivation: A recently proposed method for detecting recombination in DNA sequence alignments is...
Motivation: Homologous sequences are sometimes similar over some regions but different over other re...
Abstract Background Non-coding DNA sequences comprise a very large proportion of the total genomic c...
AbstractWe present improved techniques for finding homologous regions in DNA and protein sequences. ...
This paper presents new optimizations designed to improve an algorithm at the state-of-the-art for f...
AbstractSequence alignment is a central tool in molecular biology. A Multiple sequence alignment (MS...
A major challenge in computational biology is the identification of evolutionarily related macromole...
<p>(a) Tester constructs comprise pairs of closely-positioned DNA repeats: the base-pair sequence of...
Conventional phylogenetic tree estimation methods assume that all sites in a DNA multiple alignment...
Motivation: Progressive algorithms are widely used heuristics for the production of alignments among...
We present an improved phylogenetic factorial hidden Markov model (FHMM) for detecting two types of ...
CConventional phylogenetic tree estimation methods assume that all sites in a DNA multiple alignment...
Molecular evolutionary biology allows us to look into the past by analyzing sequences of amino acids...
Multiple Sequence Alignment (MSA) is one of the basic tool for interpreting the information obtained...
this paper, we focus on repetitive sequences in which short words are repeated many times, referred ...
<br>Motivation: A recently proposed method for detecting recombination in DNA sequence alignments is...
Motivation: Homologous sequences are sometimes similar over some regions but different over other re...
Abstract Background Non-coding DNA sequences comprise a very large proportion of the total genomic c...
AbstractWe present improved techniques for finding homologous regions in DNA and protein sequences. ...