Full-text indexing aims at building a data structure over a given text capable of efficiently finding arbitrary text patterns, and possibly requiring little space. We propose two suffix array inspired full-text indexes. One, called SA-hash, augments the suffix array with a hash table to speed up pattern searches due to significantly narrowed search interval before the binary search phase. The other, called FBCSA, is a compact data structure, similar to Mäkinen’s compact suffix array (MakCSA), but working on fixed size blocks. Experiments on the widely used Pizza & Chili datasets show that SA-hash is about 2–3 times faster in pattern searches (counts) than the standard suffix array, for the price of requiring 0.2n–1.1n bytes of extra space, ...
The suffix tree (or equivalently, the enhanced suffix array) provides efficient solutions to many pr...
Suffix arrays are a simple and powerful data structure for text processing that can be used for full...
The construction of full-text indexes on very large text collections is nowadays a hot problem. The ...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
The suffix array is an efficient data structure for in-memory pattern search. Suffix arrays can also...
The suffix array is a space-efficient data structure that provides fast access to all occurrences of...
The proliferation of online text, such as on the World Wide Web and in databases, motivates the need...
This thesis studies efficient algorithms for constructing the most fundamen-tal data structures used...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...
The suffix array is a classic full-text index, combining effectiveness with simplicity. We discuss t...
The construction of full-text indexes on very large text collections is nowadays a hot problem. Th...
This is the published version. Copyright 2005 Society for Industrial and Applied MathematicsThe pro...
The construction of full-text indexes on very large text collections is nowadays a hot problem. The ...
[[abstract]]Recent research in compressing suffix arrays has resulted in two breakthrough indexing d...
We report on a new and improved version of high-order entropy-compressed suffix arrays, which has th...
The suffix tree (or equivalently, the enhanced suffix array) provides efficient solutions to many pr...
Suffix arrays are a simple and powerful data structure for text processing that can be used for full...
The construction of full-text indexes on very large text collections is nowadays a hot problem. The ...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
The suffix array is an efficient data structure for in-memory pattern search. Suffix arrays can also...
The suffix array is a space-efficient data structure that provides fast access to all occurrences of...
The proliferation of online text, such as on the World Wide Web and in databases, motivates the need...
This thesis studies efficient algorithms for constructing the most fundamen-tal data structures used...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...
The suffix array is a classic full-text index, combining effectiveness with simplicity. We discuss t...
The construction of full-text indexes on very large text collections is nowadays a hot problem. Th...
This is the published version. Copyright 2005 Society for Industrial and Applied MathematicsThe pro...
The construction of full-text indexes on very large text collections is nowadays a hot problem. The ...
[[abstract]]Recent research in compressing suffix arrays has resulted in two breakthrough indexing d...
We report on a new and improved version of high-order entropy-compressed suffix arrays, which has th...
The suffix tree (or equivalently, the enhanced suffix array) provides efficient solutions to many pr...
Suffix arrays are a simple and powerful data structure for text processing that can be used for full...
The construction of full-text indexes on very large text collections is nowadays a hot problem. The ...