AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queries on general sequences. On a collection of D documents of total length n, current approaches require at least |CSA|+O(nlgDlglgD) or 2|CSA|+o(n) bits of space, where CSA is a full-text index. Using monotone minimal perfect hash functions (mmphfs), we give new algorithms for document listing with frequencies and top-k document retrieval using just |CSA|+O(nlglglgD) bits. We also improve current solutions that use 2|CSA|+o(n) bits, and consider other problems such as colored range listing, top-k most important documents, and computing arbitrary frequencies. We give proof-of-concept experimental results that show that using mmphfs may provide re...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
The size of electronic data is currently growing at a faster rate than computer memory and disk stor...
Abstract: Full-text database systems require an in-dex to allow fast access to documents based on th...
AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queri...
Abstract. Given a collection of strings (called documents), the top-k document retrieval problem is ...
Abstract: An optimal index solving top-k document retrieval [Navarro and Nekrich, SODA’12] takes O(m...
This thesis studies problems related to compressed full-text indexes. A full-text index is a data st...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
We design two compressed data structures for the full-text indexing problem that support efficient s...
We address the problem of indexing a collectionD = {T1,T2,...TD} of D string documents of total leng...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
This chapter has demonstrated the feasibility of full-text indexing of large information bases. The ...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...
Let D = {d1, d2, d3,..., dD} be a given set of D (string) docu-ments of total length n. The top-k do...
Full-text indexes provide fast substring search over large text collections. A serious problem of th...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
The size of electronic data is currently growing at a faster rate than computer memory and disk stor...
Abstract: Full-text database systems require an in-dex to allow fast access to documents based on th...
AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queri...
Abstract. Given a collection of strings (called documents), the top-k document retrieval problem is ...
Abstract: An optimal index solving top-k document retrieval [Navarro and Nekrich, SODA’12] takes O(m...
This thesis studies problems related to compressed full-text indexes. A full-text index is a data st...
Given a sequence S = s_1 s_2 ... s_n of integers smaller than r = O(polylog(n)), we show how S can b...
We design two compressed data structures for the full-text indexing problem that support efficient s...
We address the problem of indexing a collectionD = {T1,T2,...TD} of D string documents of total leng...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
This chapter has demonstrated the feasibility of full-text indexing of large information bases. The ...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...
Let D = {d1, d2, d3,..., dD} be a given set of D (string) docu-ments of total length n. The top-k do...
Full-text indexes provide fast substring search over large text collections. A serious problem of th...
In this paper we design two compressed data structures for the full-text indexing problem. These da...
The size of electronic data is currently growing at a faster rate than computer memory and disk stor...
Abstract: Full-text database systems require an in-dex to allow fast access to documents based on th...