Abstract We present the AS-Index, a new index structure for exact string search in disk resident databases. AS-index relies on a classical inverted file structure, its main innovation being a probabilistic search based on the properties of algebraic signatures used both for n-grams hashing and pattern search. Specifically, the properties of our signatures allow to carry out a search by inspecting only two of the posting lists. The algorithm thus enjoys the unique feature of requiring a constant number of disk accesses, independently from both the pattern size and the database size. We conduct extensive experiments on large datasets to evaluate our index behavior. They confirm that it steadily provides a search performance proportional to th...
We design two compressed data structures for the full-text indexing problem that support efficient s...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...
AS-Index is a new index structure for exact string search in disk resident databases. It uses hashin...
AS-Index is a new index structure for exact string search in disk resident databases. It uses hashin...
We propose a novel string search algorithm for data stored once and read many times. Our search meth...
We propose a novel string (pattern) matching algorithm called n-gram search. We intend it for the re...
We propose a novel string (pattern) matching algorithm called n-gram search. We intend it for the re...
We propose a novel string search algorithm for data stored once and read many times. Our search met...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
Abstract—The suffix array is an efficient data structure for in-memory pattern search. Suffix arrays...
A compressed full-text self-index for a text T is a data structure requiring reduced space and able ...
Within the field of machine learning for systems, learning-based methods have brought new perspectiv...
The proliferation of online text, such as on the World Wide Web and in databases, motivates the need...
The aim of this work is to show the novel index structure based suffix array and ternary search tree...
We design two compressed data structures for the full-text indexing problem that support efficient s...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...
AS-Index is a new index structure for exact string search in disk resident databases. It uses hashin...
AS-Index is a new index structure for exact string search in disk resident databases. It uses hashin...
We propose a novel string search algorithm for data stored once and read many times. Our search meth...
We propose a novel string (pattern) matching algorithm called n-gram search. We intend it for the re...
We propose a novel string (pattern) matching algorithm called n-gram search. We intend it for the re...
We propose a novel string search algorithm for data stored once and read many times. Our search met...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
Abstract—The suffix array is an efficient data structure for in-memory pattern search. Suffix arrays...
A compressed full-text self-index for a text T is a data structure requiring reduced space and able ...
Within the field of machine learning for systems, learning-based methods have brought new perspectiv...
The proliferation of online text, such as on the World Wide Web and in databases, motivates the need...
The aim of this work is to show the novel index structure based suffix array and ternary search tree...
We design two compressed data structures for the full-text indexing problem that support efficient s...
The collection indexing problem is defined as follows: Given a collection of highly similar strings,...
The proliferation of online text, such as found on the World Wide Web and in online databases, motiv...