In this paper we present HATCH, a novel hash join engine. We follow a new design point which enables us to effectively cache the hash table entries in fast BRAM resources, meanwhile supporting collision resolution in hardware. HATCH enables us to have the best of two worlds: (i) to use the full capacity of the DDR memory to store complete hash tables, and (ii) by employing a cache, to exploit the high access speed of BRAMs. We demonstrate the usefulness of our approach by running hash join operations from 5 TPCH benchmark queries and report speedups up to 2.8x over a pipeline-optimized baseline.The research leading to these results has received funding from the European Unions Seventh Framework Programme (FP7/2007-2013), for Advanced Analyt...
Common web infrastructure relies on distributed main memory key-value stores to reduce access load o...
Abstract—The architectural changes introduced with multi-core CPUs have triggered a redesign of main...
In the past decade, the exponential growth in commodity CPUs speed has far outpaced advances in memo...
In this paper we present HATCH, a novel hash join engine. We follow a new design point which enables...
Extracting valuable information from the rapidly growing field of Big Data faces serious performance...
The multi-way hash join is one of the commonly used and time-consuming database operations. Many alg...
We present new hash tables for joins, and a hash join based on them, that consumes far less memory a...
FPGA-based data processing is becoming increasingly relevant in data centers, as the transformation ...
The hash join algorithm family is one of the leading techniques for equi-join performance evaluation...
Join is an important database operation. As computer architectures evolve, the best join algorithm m...
Hashing is one of the fundamental techniques used to implement query processing operators such as gr...
Until recently, the use of graphics processing units (GPUs) for query processing was limited by the ...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Fast concurrent hash tables are an increasingly important building block as we scale systems to grea...
The architectural changes introduced with multicore CPUs have triggered a redesign of main-memory jo...
Common web infrastructure relies on distributed main memory key-value stores to reduce access load o...
Abstract—The architectural changes introduced with multi-core CPUs have triggered a redesign of main...
In the past decade, the exponential growth in commodity CPUs speed has far outpaced advances in memo...
In this paper we present HATCH, a novel hash join engine. We follow a new design point which enables...
Extracting valuable information from the rapidly growing field of Big Data faces serious performance...
The multi-way hash join is one of the commonly used and time-consuming database operations. Many alg...
We present new hash tables for joins, and a hash join based on them, that consumes far less memory a...
FPGA-based data processing is becoming increasingly relevant in data centers, as the transformation ...
The hash join algorithm family is one of the leading techniques for equi-join performance evaluation...
Join is an important database operation. As computer architectures evolve, the best join algorithm m...
Hashing is one of the fundamental techniques used to implement query processing operators such as gr...
Until recently, the use of graphics processing units (GPUs) for query processing was limited by the ...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Fast concurrent hash tables are an increasingly important building block as we scale systems to grea...
The architectural changes introduced with multicore CPUs have triggered a redesign of main-memory jo...
Common web infrastructure relies on distributed main memory key-value stores to reduce access load o...
Abstract—The architectural changes introduced with multi-core CPUs have triggered a redesign of main...
In the past decade, the exponential growth in commodity CPUs speed has far outpaced advances in memo...