[[abstract]]Run-Length-Encoding (RLE) is a data compression technique that is used in various applications, e.g., time series, biological sequences, and multimedia databases. One of the main challenges is how to operate on (e.g., index, search, and retrieve) compressed data without decompressing it. In this paper, we introduce the String B-tree for Compressed sequences, termed the SBC-tree, for indexing and searching RLE-compressed sequences of arbitrary length. The SBC-tree is a two-level index structure based on the well-known String B-tree and a 3-sided range query structure [7]. The SBC-tree supports pattern matching queries such as substring matching, prefix matching, and range search operations over RLE-compressed sequences. The SBC-t...
Indexing highly repetitive texts—such as genomic databases, software repositories and versioned text...
This thesis deals with space-efficient algorithms to compress and index texts. The aim of compressio...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...
Run-Length-Encoding (RLE) is a data compression technique that is used in various applications, e.g....
Run-Length-Encoding (RLE) is a data compression tech-nique that is used in various applications, e.g...
Computing Reviews (1998) Categories and Subject Descriptors: E.4 Coding and Information Theory — da...
An indexed sequence of strings is a data structure for storing a string sequence that supports rando...
We design two compressed data structures for the full-text indexing problem that support efficient s...
Similarity search over long sequence dataset becomes increasingly popular in many emerging applicati...
Similarity search over long sequence dataset becomes increasingly popular in many emerging applicati...
The proliferation of online text, such as on the World Wide Web and in databases, motivates the need...
[[abstract]]Let T be a string with n characters over an alphabet of constant size. A recent breakthr...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
A repetitive sequence collection is one where portions of a base sequence of length n are repeated m...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
Indexing highly repetitive texts—such as genomic databases, software repositories and versioned text...
This thesis deals with space-efficient algorithms to compress and index texts. The aim of compressio...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...
Run-Length-Encoding (RLE) is a data compression technique that is used in various applications, e.g....
Run-Length-Encoding (RLE) is a data compression tech-nique that is used in various applications, e.g...
Computing Reviews (1998) Categories and Subject Descriptors: E.4 Coding and Information Theory — da...
An indexed sequence of strings is a data structure for storing a string sequence that supports rando...
We design two compressed data structures for the full-text indexing problem that support efficient s...
Similarity search over long sequence dataset becomes increasingly popular in many emerging applicati...
Similarity search over long sequence dataset becomes increasingly popular in many emerging applicati...
The proliferation of online text, such as on the World Wide Web and in databases, motivates the need...
[[abstract]]Let T be a string with n characters over an alphabet of constant size. A recent breakthr...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
A repetitive sequence collection is one where portions of a base sequence of length n are repeated m...
This doctoral dissertation presents a range of results concerning efficient algorithms and data stru...
Indexing highly repetitive texts—such as genomic databases, software repositories and versioned text...
This thesis deals with space-efficient algorithms to compress and index texts. The aim of compressio...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...