This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of cache misses by improving the data temporal locality for multiple concurrent queries. Our idea is based on the observation that, in many workloads such as decision support systems (DSS), there is usually significant amount of data sharing among different concurrent queries. MiniTasking exploits such data sharing to improve data temporal locality by scheduling query execution at three levels: query level batching, operator level grouping and mini-task level scheduling. The experimental results with various types of concurrent TPC-H query workloads show that, with the traditional N-ary Storage Model (NSM) layout, MiniTasking significantly reduce...
Leveraging data in distributed caches for large scale query processing applications is becoming more...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
Many techniques developed for parallel database systems were focused on large-scale, often prototypi...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
Database systems frequently have to execute a batch of related queries. Multi-query optimization exp...
Database systems access memory either sequentially or randomly. Contrary to sequential access and de...
In distributed query processing systems, load balancing plays an important role in maximizing system...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...
Abstract. Leveraging data in distributed caches for large scale query process-ing applications is be...
Abstract: In-memory database management systems have the potential to reduce the execution time of c...
Leveraging data in distributed caches for large scale query processing applications is becoming more...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
Many techniques developed for parallel database systems were focused on large-scale, often prototypi...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
Database systems frequently have to execute a batch of related queries. Multi-query optimization exp...
Database systems access memory either sequentially or randomly. Contrary to sequential access and de...
In distributed query processing systems, load balancing plays an important role in maximizing system...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...
Abstract. Leveraging data in distributed caches for large scale query process-ing applications is be...
Abstract: In-memory database management systems have the potential to reduce the execution time of c...
Leveraging data in distributed caches for large scale query processing applications is becoming more...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
Many techniques developed for parallel database systems were focused on large-scale, often prototypi...