Over the last decade, biological sequence repositories have been growing at an exponential rate. Sophisticated indexing techniques are required to facilitate efficient searching through these humongous genetic repositories. A particularly attractive index structure for such sequence processing is the classical suffix-tree, a vertically compressed trie structure built over the set of all suffixes of a sequence. Its attractiveness stems from its linearity properties -- suffix-tree construction times are linear in the size of the indexed sequences, while search times are linear in the size of the query strings. In practice, however, the promise of suffix-trees is not realized for extremely long sequences, such as the human genome, that run int...
Sequence data is one of the rapidly growing types of data. New efficient and scalable techniques are...
The amount of available biological sequences, represented as strings over the DNA and protein alphab...
With advances in sequencing technology and through ag-gressive sequencing efforts, DNA sequence data...
The suffix tree is a well known and popular indexing structure for various sequence processing probl...
Abstract. Suffix-trees are popular indexing structures for various sequence pro-cessing problems in ...
Abstract. Suffix-trees are popular indexing structures for various sequence processing problems in b...
Mammalian genomes are typically 3Gbps (gibabase pairs) in size. The largest public database NCBI (Na...
Schürmann K-B, Stoye J. Suffix Tree Construction and Storage with Limited Main Memory. Forschungsber...
Sequence datasets are ubiquitous in modern life-science applications, and querying sequences is a co...
The suffix tree (or equivalently, the enhanced suffix array) provides efficient solutions to many pr...
Online persistent suffix tree construction has been con-sidered impractical due to its excessive I/O...
A suffix tree is a fundamental data structure for string search-ing algorithms. Unfortunately, when ...
We propose a new method to build persistent suffix trees for indexing the genomic data. Our algorith...
Online persistent suffix tree construction has been considered impractical due to its excessive I/O ...
Abstract. Designing external memory data structures for string databases is of significant recent in...
Sequence data is one of the rapidly growing types of data. New efficient and scalable techniques are...
The amount of available biological sequences, represented as strings over the DNA and protein alphab...
With advances in sequencing technology and through ag-gressive sequencing efforts, DNA sequence data...
The suffix tree is a well known and popular indexing structure for various sequence processing probl...
Abstract. Suffix-trees are popular indexing structures for various sequence pro-cessing problems in ...
Abstract. Suffix-trees are popular indexing structures for various sequence processing problems in b...
Mammalian genomes are typically 3Gbps (gibabase pairs) in size. The largest public database NCBI (Na...
Schürmann K-B, Stoye J. Suffix Tree Construction and Storage with Limited Main Memory. Forschungsber...
Sequence datasets are ubiquitous in modern life-science applications, and querying sequences is a co...
The suffix tree (or equivalently, the enhanced suffix array) provides efficient solutions to many pr...
Online persistent suffix tree construction has been con-sidered impractical due to its excessive I/O...
A suffix tree is a fundamental data structure for string search-ing algorithms. Unfortunately, when ...
We propose a new method to build persistent suffix trees for indexing the genomic data. Our algorith...
Online persistent suffix tree construction has been considered impractical due to its excessive I/O ...
Abstract. Designing external memory data structures for string databases is of significant recent in...
Sequence data is one of the rapidly growing types of data. New efficient and scalable techniques are...
The amount of available biological sequences, represented as strings over the DNA and protein alphab...
With advances in sequencing technology and through ag-gressive sequencing efforts, DNA sequence data...