Within the field of machine learning for systems, learning-based methods have brought new perspective to indexing by reframing it as a cumulative distribution function (CDF) modeling problem. The burgeoning field, despite its nascence, has brought with it many opportunities and efficiencies. However, most work in this area has focused on efficiently indexing numerical keys, as the additional challenges posed by indexing strings have prevented the effective application of these techniques to string domains. We hypothesize that the machine learning approaches which have, in recent years, made significant strides in scalar indexing applications can also be effectively adapted to string applications. First, we introduce the RadixStringSpline (R...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
Machine Learning Techniques, properly combined with Data Structures, have resulted in Learned Static...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...
Lookup of strings into in-memory database indexes is a problem with different considerations from th...
© 2020, VLDB Endowment. All rights reserved. Recent advancements in learned index structures propose...
© 2020 ACM. Recent research has shown that learned models can outperform state-of-the-art index stru...
© 2020 Association for Computing Machinery. Scanning and filtering over multi-dimensional tables are...
Abstract We present the AS-Index, a new index structure for exact string search in disk resident dat...
© 2020 Association for Computing Machinery. Recent work on "learned indexes" has changed the way we ...
We propose a new algorithmic framework that solves frequency-related data mining queries on database...
We consider the problem of substring searching in large databases. Typical applications of this prob...
Inverted indexes are vital in providing fast key-word-based search. For every term in the document c...
AS-Index is a new index structure for exact string search in disk resident databases. It uses hashin...
The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sa...
© 2018 Association for Computing Machinery. Indexes are models: a B-Tree-Index can be seen as a mode...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
Machine Learning Techniques, properly combined with Data Structures, have resulted in Learned Static...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...
Lookup of strings into in-memory database indexes is a problem with different considerations from th...
© 2020, VLDB Endowment. All rights reserved. Recent advancements in learned index structures propose...
© 2020 ACM. Recent research has shown that learned models can outperform state-of-the-art index stru...
© 2020 Association for Computing Machinery. Scanning and filtering over multi-dimensional tables are...
Abstract We present the AS-Index, a new index structure for exact string search in disk resident dat...
© 2020 Association for Computing Machinery. Recent work on "learned indexes" has changed the way we ...
We propose a new algorithmic framework that solves frequency-related data mining queries on database...
We consider the problem of substring searching in large databases. Typical applications of this prob...
Inverted indexes are vital in providing fast key-word-based search. For every term in the document c...
AS-Index is a new index structure for exact string search in disk resident databases. It uses hashin...
The minimizers sampling mechanism is a popular mechanism for string sampling. However, minimizers sa...
© 2018 Association for Computing Machinery. Indexes are models: a B-Tree-Index can be seen as a mode...
Abstract This paper is about compressed full-text indexes. That is, our goal is to represent full-te...
Machine Learning Techniques, properly combined with Data Structures, have resulted in Learned Static...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...