The minimal-length encoding approach is applied to define concept of sequence similarity. A sequence is defined to be similar to another sequence or to a set of keywords if it can be encoded in a small number of bits by taking advantage of common subwords. Minimal-length encoding of a sequence is computed in linear time, using a data compression algorithm that is based on a dynamic programming strategy and the directed acyclic word graph data structure. No assumptions about common word ("k-tuple") length are made in advance, and common words of any length are considered. The newly proposed algorithmic significance method provides an exact upper bound on the probability that sequence sim-ilarity has occurred by chance, thus elimina...
This paper is concerned with a novel way for measuring similarity between sequences of alphanumeric ...
Abslract- DNA matching is an important key to understanding genomes, evolution, relationships betwee...
Matching a biological sequence against a probabilistic pattern (or profile) is a common task in comp...
DNA Sequence Compression can be achieved through exploiting the intra-sequence and inter-sequence si...
Motivation: Several measures of DNA sequence dissimilarity have beendeveloped.Thepurposeof this pape...
Background: DNA sequence analysis is an important research topic in bioinformatics. Evaluating the s...
International audienceMotivation: Analysis of genetic sequences is usually based on finding similar ...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recentl...
This article investigates aspects of similarity between complete sequences of mitochondrial DNA by d...
Finding the sequence similarity between two genetic codes is an important problem in computational b...
LNCS v. 6124 is Proceedings of the 6th International Conference, AAIM 2010To study the genetic varia...
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar...
This paper suggests a novel way for measuring the similarity between sequences of symbols from alpha...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
This paper is concerned with a novel way for measuring similarity between sequences of alphanumeric ...
Abslract- DNA matching is an important key to understanding genomes, evolution, relationships betwee...
Matching a biological sequence against a probabilistic pattern (or profile) is a common task in comp...
DNA Sequence Compression can be achieved through exploiting the intra-sequence and inter-sequence si...
Motivation: Several measures of DNA sequence dissimilarity have beendeveloped.Thepurposeof this pape...
Background: DNA sequence analysis is an important research topic in bioinformatics. Evaluating the s...
International audienceMotivation: Analysis of genetic sequences is usually based on finding similar ...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
Traditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recentl...
This article investigates aspects of similarity between complete sequences of mitochondrial DNA by d...
Finding the sequence similarity between two genetic codes is an important problem in computational b...
LNCS v. 6124 is Proceedings of the 6th International Conference, AAIM 2010To study the genetic varia...
Classical DNA sequence compression algorithms consider only intra-sequence similarity, i.e., similar...
This paper suggests a novel way for measuring the similarity between sequences of symbols from alpha...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
This paper is concerned with a novel way for measuring similarity between sequences of alphanumeric ...
Abslract- DNA matching is an important key to understanding genomes, evolution, relationships betwee...
Matching a biological sequence against a probabilistic pattern (or profile) is a common task in comp...