The problem of discovering frequent poly-regions (i.e. regions of high occurrence of a set of items or patterns of a given alphabet) in a sequence is studied, and three efficient approaches are proposed to solve it. The first one is entropy-based and applies a recursive segmentation technique that produces a set of candidate segments which may potentially lead to a poly-region. The key idea of the second approach is the use of a set of sliding windows over the sequence. Each sliding window covers a sequence segment and keeps a set of statistics that mainly include the number of occurrences of each item or pattern in that segment. Combining these statistics efficiently yields the complete set of poly-regions in the given sequence. The third ...
In this paper, an approach for efficiently extracting the repeating patterns in a biological sequenc...
A new method for the search of local repeats in long DNA sequences, such as complete genomes, is pre...
The emergence of automated high-throughput sequencing technologies has resulted in a huge increase o...
The problem of discovering frequent poly-regions (i.e. regions of high occurrence of a set of items ...
The problem of discovering frequent arrangements of regions of high occurrence of one or more items ...
We study the problem of mining poly-regions in DNA. A poly-region is defined as a bursty DNA area, i...
Algorithms for finding similar, or highly conserved, regions in a group of sequences are at the core...
We study a problem of mining frequently occurring periodic patterns with a gap requirement from sequ...
AbstractWe study two fundamental problems concerning the search for interesting regions in sequences...
Abstract Summary The genomes of many species are dominated by short sequences repeated consecutively...
Bio-data analysis deals with the most vital discovering problem of similarity search and finding rel...
Unusual patterns in nucleic acid or protein sequences are often suspected for their biological relev...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law corre...
Genomics, with the high amount of heterogeneous data that it is generating, is opening many interest...
In this paper, an approach for efficiently extracting the repeating patterns in a biological sequenc...
A new method for the search of local repeats in long DNA sequences, such as complete genomes, is pre...
The emergence of automated high-throughput sequencing technologies has resulted in a huge increase o...
The problem of discovering frequent poly-regions (i.e. regions of high occurrence of a set of items ...
The problem of discovering frequent arrangements of regions of high occurrence of one or more items ...
We study the problem of mining poly-regions in DNA. A poly-region is defined as a bursty DNA area, i...
Algorithms for finding similar, or highly conserved, regions in a group of sequences are at the core...
We study a problem of mining frequently occurring periodic patterns with a gap requirement from sequ...
AbstractWe study two fundamental problems concerning the search for interesting regions in sequences...
Abstract Summary The genomes of many species are dominated by short sequences repeated consecutively...
Bio-data analysis deals with the most vital discovering problem of similarity search and finding rel...
Unusual patterns in nucleic acid or protein sequences are often suspected for their biological relev...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law corre...
Genomics, with the high amount of heterogeneous data that it is generating, is opening many interest...
In this paper, an approach for efficiently extracting the repeating patterns in a biological sequenc...
A new method for the search of local repeats in long DNA sequences, such as complete genomes, is pre...
The emergence of automated high-throughput sequencing technologies has resulted in a huge increase o...