One way to represent a DNA sequence is to break it down into substrings of length L, called L-tuples, and count the occurence of each L-tuple in the sequence. This representation defines a mapping of a sequence into a numerical space by a numerical feature vector of fixed length, that allows to measure sequence similarity in an alignment free way simply using disssimilarity functions between vectors. This work presents a benchmark study of 4 alignment free disssimilarity functions between sequences, computed on their L-tuples representation, for the purpose of sequence classification. In our experiments, we have tested the classes of geometric-based, correlation-based and information-based dissimilarities, incorporating them into a nearest ...
Motivation: Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. ...
In this paper, we have proposed a novel alignment-free method for comparing the similarity of protei...
Computing similarity between 2 nucleotide sequences is one of the fundamental problems in bioinforma...
One way to represent a DNA sequence is to break it down into substrings of length L, called L-tuples...
Epigenetic mechanisms such as nucleosome positioning, histone modications and DNA methylation play a...
Background: DNA sequence analysis is an important research topic in bioinformatics. Evaluating the s...
Motivation: Alignment-free sequence comparison methods are still in the early stages of development ...
Biological sequences are constantly evolving so there are mutations, deletions and inserts. Because ...
A growing number of measures of sequence similarity is being based on some underlying notion of rela...
AbstractGraphical representation of DNA sequences is one of the most popular techniques for alignmen...
Motivation: Several measures of DNA sequence dissimilarity have beendeveloped.Thepurposeof this pape...
Motivation: Alignment-free sequence comparison methods can compute the pairwise similarity between a...
Motivation: Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. ...
Sequence comparison is a fundamental task in computational biology, traditionally dominated by align...
We study the problem of similarity detection by sequence alignment with gaps, using a recently estab...
Motivation: Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. ...
In this paper, we have proposed a novel alignment-free method for comparing the similarity of protei...
Computing similarity between 2 nucleotide sequences is one of the fundamental problems in bioinforma...
One way to represent a DNA sequence is to break it down into substrings of length L, called L-tuples...
Epigenetic mechanisms such as nucleosome positioning, histone modications and DNA methylation play a...
Background: DNA sequence analysis is an important research topic in bioinformatics. Evaluating the s...
Motivation: Alignment-free sequence comparison methods are still in the early stages of development ...
Biological sequences are constantly evolving so there are mutations, deletions and inserts. Because ...
A growing number of measures of sequence similarity is being based on some underlying notion of rela...
AbstractGraphical representation of DNA sequences is one of the most popular techniques for alignmen...
Motivation: Several measures of DNA sequence dissimilarity have beendeveloped.Thepurposeof this pape...
Motivation: Alignment-free sequence comparison methods can compute the pairwise similarity between a...
Motivation: Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. ...
Sequence comparison is a fundamental task in computational biology, traditionally dominated by align...
We study the problem of similarity detection by sequence alignment with gaps, using a recently estab...
Motivation: Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. ...
In this paper, we have proposed a novel alignment-free method for comparing the similarity of protei...
Computing similarity between 2 nucleotide sequences is one of the fundamental problems in bioinforma...