Matching a biological sequence against a probabilistic pattern (or profile) is a common task in computational biology. A probabilistic profile, represented as a scoring matrix, is more suitable than a deterministic pattern to retain the peculiarities of a given segment of a family of biological sequences. Brute-force algorithms take O(NP) to match a sequence of N characters against a profile of length P ≪ N. In this work, we exploit string compression techniques to speedup brute-force profile matching. We present two algorithms, based on run-length encoding (RLE) and LZ78, that reduce computational complexity by the compression factor of the encoding. While RLE provides little improvement with respect to brute-force algorithm (the number of...
the sequence. It gives us all text positions of each duplicated pattern. ffl The program chooses ea...
Motivation: Comparison of nucleic acid and protein sequences is a fundamental tool of modern bioinfo...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
Matching a biological sequence against a probabilistic pattern (or profile) is a common task in comp...
Motivation: Matching a biological sequence against a proba-bilistic pattern (or profile) is a common...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
This paper introduces a novel algorithm for biological sequence compression that makes use of both s...
In bio-sequence repositories and other applications, like for instance in the production of a Cd-rom...
DNA sequencing is the process of determining the ordered sequence of the four nucleotide bases in a ...
This file was last viewed in Adobe Acrobat Pro.The brute force algorithm for string matching has a w...
Abstract. Search for sequence similarity in large-scale databases of DNA and protein sequences is on...
The minimal-length encoding approach is applied to define concept of sequence similarity. A sequence...
Graduation date: 1993As the volume of genetic sequence data increases due to improved sequencing\ud ...
As the usage of technology increases rapidly today, the amount of data created also increases expone...
Position-specific scoring matrices are a popular choice for modelling signals or motifs in biologica...
the sequence. It gives us all text positions of each duplicated pattern. ffl The program chooses ea...
Motivation: Comparison of nucleic acid and protein sequences is a fundamental tool of modern bioinfo...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
Matching a biological sequence against a probabilistic pattern (or profile) is a common task in comp...
Motivation: Matching a biological sequence against a proba-bilistic pattern (or profile) is a common...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
This paper introduces a novel algorithm for biological sequence compression that makes use of both s...
In bio-sequence repositories and other applications, like for instance in the production of a Cd-rom...
DNA sequencing is the process of determining the ordered sequence of the four nucleotide bases in a ...
This file was last viewed in Adobe Acrobat Pro.The brute force algorithm for string matching has a w...
Abstract. Search for sequence similarity in large-scale databases of DNA and protein sequences is on...
The minimal-length encoding approach is applied to define concept of sequence similarity. A sequence...
Graduation date: 1993As the volume of genetic sequence data increases due to improved sequencing\ud ...
As the usage of technology increases rapidly today, the amount of data created also increases expone...
Position-specific scoring matrices are a popular choice for modelling signals or motifs in biologica...
the sequence. It gives us all text positions of each duplicated pattern. ffl The program chooses ea...
Motivation: Comparison of nucleic acid and protein sequences is a fundamental tool of modern bioinfo...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...