Compact and hash based variants of the suffix array

Grabowski, S.
Raniszewski, M.

Open link

Publication date

January 2017

DOI

10.1515/bpasts-2017-0046

Publisher

Polska Akademia Nauk. Czasopisma i Monografie PAN

Abstract

Full-text indexing aims at building a data structure over a given text capable of efficiently finding arbitrary text patterns, and possibly requiring little space. We propose two suffix array inspired full-text indexes. One, called SA-hash, augments the suffix array with a hash table to speed up pattern searches due to significantly narrowed search interval before the binary search phase. The other, called FBCSA, is a compact data structure, similar to Mäkinen’s compact suffix array (MakCSA), but working on fixed size blocks. Experiments on the widely used Pizza & Chili datasets show that SA-hash is about 2–3 times faster in pattern searches (counts) than the standard suffix array, for the price of requiring 0.2n–1.1n bytes of extra space, ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Compact and hash based variants of the suffix array

Abstract

Extracted data

Compact and hash based variants of the suffix array

Abstract

Extracted data

Related items

Related items