Computing Reviews (1998) Categories and Subject Descriptors: E.4 Coding and Information Theory — data compaction and compression F.2.2 Analysis of Algorithms and Problem Complexity: Nonnumerical Algorithms and Problems — pattern matching, sorting and searchingA repetitive sequence collection is one where portions of a base sequence of length n are repeated many times with small variations, forming a collection of total length N. Examples of such collections are version control data and genome sequences of individuals, where the differences can be expressed by lists of basic edit operations. Flexible and efficient data analysis on a such typically huge collection is plausible using suffix trees. However, suffix tree occupies O(N logN...
Abstract. We introduce the first self-index based on the Lempel-Ziv 1977 compression format (LZ77). ...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
Highly repetitive strings are increasingly being amassed by genome sequencing experiments, and by ve...
A repetitive sequence collection is one where portions of a base sequence of length n are repeated m...
Volume: 5541A repetitive sequence collection is one where portions of a base sequence of length n ar...
A repetitive sequence collection is one where portions of a emph{base sequence} of length $n$ are re...
Computing Reviews (1998) Categories and Subject Descriptors: E.4 Coding and Information Theory — da...
We propose algorithms that, given the input string of length n over integer alphabet of size σ, cons...
Indexing highly repetitive texts—such as genomic databases, software repositories and versioned text...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
We consider document listing on string collections, that is, finding in which strings a given patter...
This thesis studies problems related to compressed full-text indexes. A full-text index is a data st...
This thesis deals with space-efficient algorithms to compress and index texts. The aim of compressio...
The final publication is available via http://dx.doi.org/10.1016/j.is.2016.04.002[Abstract] Indexing...
Abstract. We introduce the first self-index based on the Lempel-Ziv 1977 compression format (LZ77). ...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
Highly repetitive strings are increasingly being amassed by genome sequencing experiments, and by ve...
A repetitive sequence collection is one where portions of a base sequence of length n are repeated m...
Volume: 5541A repetitive sequence collection is one where portions of a base sequence of length n ar...
A repetitive sequence collection is one where portions of a emph{base sequence} of length $n$ are re...
Computing Reviews (1998) Categories and Subject Descriptors: E.4 Coding and Information Theory — da...
We propose algorithms that, given the input string of length n over integer alphabet of size σ, cons...
Indexing highly repetitive texts—such as genomic databases, software repositories and versioned text...
The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes ba...
We consider document listing on string collections, that is, finding in which strings a given patter...
This thesis studies problems related to compressed full-text indexes. A full-text index is a data st...
This thesis deals with space-efficient algorithms to compress and index texts. The aim of compressio...
The final publication is available via http://dx.doi.org/10.1016/j.is.2016.04.002[Abstract] Indexing...
Abstract. We introduce the first self-index based on the Lempel-Ziv 1977 compression format (LZ77). ...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
Highly repetitive strings are increasingly being amassed by genome sequencing experiments, and by ve...