Suffix-trees are popular indexing structures for various sequence processing problems in biological data management. We investigate here the possibility of enhancing the search efficiency of disk-resident suffix-trees through customized layouts of tree-nodes to disk-pages. Specifically, we propose a new layout strategy, called Stellar, that provides significantly improved search performance on a representative set of real genomic sequences. Further, Stellar supports both the standard root-to-leaf lookup queries as well as sophisticated sequence search algorithms that exploit the suffix-links of suffix-trees. Our results are encouraging with regard to the ultimate objective of seamlessly integrating sequence processing in database e...
Abstract. Suffix trees have been established as one of the most versatile index structures for unstr...
In recent years, bioinformatics becomes an important research field because there are more and more ...
Online persistent suffix tree construction has been con-sidered impractical due to its excessive I/O...
Suffix-trees are popular indexing structures for various sequence processing problems in biological ...
Abstract. Suffix-trees are popular indexing structures for various sequence processing problems in b...
The suffix tree is a well known and popular indexing structure for various sequence processing probl...
Over the last decade, biological sequence repositories have been growing at an exponential rate. Sop...
We propose a new method to build persistent suffix trees for indexing the genomic data. Our algorith...
A suffix tree is a fundamental data structure for string search-ing algorithms. Unfortunately, when ...
Mammalian genomes are typically 3Gbps (gibabase pairs) in size. The largest public database NCBI (Na...
The amount of available biological sequences, represented as strings over the DNA and protein alphab...
Sequence data is one of the rapidly growing types of data. New efficient and scalable techniques are...
The suffix tree (or equivalently, the enhanced suffix array) provides efficient solutions to many pr...
Sequence datasets are ubiquitous in modern life-science applications, and querying sequences is a co...
Abstract. Designing external memory data structures for string databases is of significant recent in...
Abstract. Suffix trees have been established as one of the most versatile index structures for unstr...
In recent years, bioinformatics becomes an important research field because there are more and more ...
Online persistent suffix tree construction has been con-sidered impractical due to its excessive I/O...
Suffix-trees are popular indexing structures for various sequence processing problems in biological ...
Abstract. Suffix-trees are popular indexing structures for various sequence processing problems in b...
The suffix tree is a well known and popular indexing structure for various sequence processing probl...
Over the last decade, biological sequence repositories have been growing at an exponential rate. Sop...
We propose a new method to build persistent suffix trees for indexing the genomic data. Our algorith...
A suffix tree is a fundamental data structure for string search-ing algorithms. Unfortunately, when ...
Mammalian genomes are typically 3Gbps (gibabase pairs) in size. The largest public database NCBI (Na...
The amount of available biological sequences, represented as strings over the DNA and protein alphab...
Sequence data is one of the rapidly growing types of data. New efficient and scalable techniques are...
The suffix tree (or equivalently, the enhanced suffix array) provides efficient solutions to many pr...
Sequence datasets are ubiquitous in modern life-science applications, and querying sequences is a co...
Abstract. Designing external memory data structures for string databases is of significant recent in...
Abstract. Suffix trees have been established as one of the most versatile index structures for unstr...
In recent years, bioinformatics becomes an important research field because there are more and more ...
Online persistent suffix tree construction has been con-sidered impractical due to its excessive I/O...