Modern query engines rely heavily on hash tables for query processing. Overall query performance and memory footprint is often determined by how hash tables and the tuples within them are represented. In this work, we propose three complementary techniques to improve this representation: Domain-Guided Prefix Suppression bit-packs keys and values tightly to reduce hash table record width. Optimistic Splitting decomposes values (and operations on them) into (operations on) frequently- and infrequently-accessed value slices. By removing the infrequently-accessed value slices from the hash table record, it improves cache locality. The Unique Strings Self-aligned Region (USSR) accelerates handling frequently occurring strings, which are widespre...
Bitmap indices have been widely and successfully used in scientific and commercial databases. Compre...
Hashing has yet to be widely accepted as a component of hard real-time systems and hardware implemen...
We present new hash tables for joins, and a hash join based on them, that consumes far less memory a...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Hashing is one of the fundamental techniques used to implement query processing operators such as gr...
A key decision when developing in-memory computing applications is choice of a mechanism to store an...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
There is a common problem of operating on hash values of elements of some database. In this paper th...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
In this article, we examine a realization of an open addressing hash table in the chained allocated ...
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, f...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Bitmap indices have been widely and successfully used in scientific and commercial databases. Compre...
Hashing has yet to be widely accepted as a component of hard real-time systems and hardware implemen...
We present new hash tables for joins, and a hash join based on them, that consumes far less memory a...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Hashing is one of the fundamental techniques used to implement query processing operators such as gr...
A key decision when developing in-memory computing applications is choice of a mechanism to store an...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
There is a common problem of operating on hash values of elements of some database. In this paper th...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
In this article, we examine a realization of an open addressing hash table in the chained allocated ...
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, f...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Bitmap indices have been widely and successfully used in scientific and commercial databases. Compre...
Hashing has yet to be widely accepted as a component of hard real-time systems and hardware implemen...
We present new hash tables for joins, and a hash join based on them, that consumes far less memory a...