Modern query engines rely heavily on hash tables for query processing. Overall query performance and memory footprint is often determined by how hash tables and the tuples within them are represented. In this work, we propose three complementary techniques to improve this representation: Domain-Guided Prefix Suppression bit-packs keys and values tightly to reduce hash table record width. Optimistic Splitting decomposes values (and operations on them) into (operations on) frequently- and infrequently-accessed value slices. By removing the infrequently-accessed value slices from the hash table record, it improves cache locality. The Unique Strings Self-aligned Region (USSR) accelerates handling frequently occurring strings, which are widespre...
Hash tables are a ubiquitous class of dictionary data structures. However, standard hash table imple...
In this article, we examine a realization of an open addressing hash table in the chained allocated ...
The need to store and query a set of strings – a string dictionary – arises in many kinds of applica...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Hashing is one of the fundamental techniques used to implement query processing operators such as gr...
There is a common problem of operating on hash values of elements of some database. In this paper th...
Modern in-memory databases are typically used for high-performance workloads, therefore they have to...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
A key decision when developing in-memory computing applications is choice of a mechanism to store an...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Extracting valuable information from the rapidly growing field of Big Data faces serious performance...
Hash tables are a ubiquitous class of dictionary data structures. However, standard hash table imple...
In this article, we examine a realization of an open addressing hash table in the chained allocated ...
The need to store and query a set of strings – a string dictionary – arises in many kinds of applica...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Hashing is one of the fundamental techniques used to implement query processing operators such as gr...
There is a common problem of operating on hash values of elements of some database. In this paper th...
Modern in-memory databases are typically used for high-performance workloads, therefore they have to...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
A key decision when developing in-memory computing applications is choice of a mechanism to store an...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Extracting valuable information from the rapidly growing field of Big Data faces serious performance...
Hash tables are a ubiquitous class of dictionary data structures. However, standard hash table imple...
In this article, we examine a realization of an open addressing hash table in the chained allocated ...
The need to store and query a set of strings – a string dictionary – arises in many kinds of applica...