Modern query engines rely heavily on hash tables for query processing. Overall query performance and memory footprint is often determined by how hash tables and the tuples within them are represented. In this work, we propose three complementary techniques to improve this representation: Domain-Guided Prefix Suppression bit-packs keys and values tightly to reduce hash table record width. Optimistic Splitting decomposes values (and operations on them) into (operations on) frequently-accessed and infrequently-accessed value slices. By removing the infrequently-accessed value slices from the hash table record, it improves cache locality. The Unique Strings Self-aligned Region (USSR) accelerates handling frequently-occurring strings, which are ...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Bloom filters are widely used in genome assembly, IoT applications and several network applications ...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Hashing is one of the fundamental techniques used to implement query processing operators such as gr...
There is a common problem of operating on hash values of elements of some database. In this paper th...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Modern in-memory databases are typically used for high-performance workloads, therefore they have to...
In this article, we examine a realization of an open addressing hash table in the chained allocated ...
Extracting valuable information from the rapidly growing field of Big Data faces serious performance...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, f...
A key decision when developing in-memory computing applications is choice of a mechanism to store an...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Bloom filters are widely used in genome assembly, IoT applications and several network applications ...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Modern query engines rely heavily on hash tables for query processing. Overall query performance and...
Hashing is one of the fundamental techniques used to implement query processing operators such as gr...
There is a common problem of operating on hash values of elements of some database. In this paper th...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Modern in-memory databases are typically used for high-performance workloads, therefore they have to...
In this article, we examine a realization of an open addressing hash table in the chained allocated ...
Extracting valuable information from the rapidly growing field of Big Data faces serious performance...
We consider the dictionary problem in external memory and improve the update time of the well-known ...
Tries are popular data structures for storing a set of strings, where common prefixes are represente...
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, f...
A key decision when developing in-memory computing applications is choice of a mechanism to store an...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Bloom filters are widely used in genome assembly, IoT applications and several network applications ...