This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improving the data temporal locality for multiple concurrent queries. Our idea is based on the observation that, in many workloads such as decision support systems (DSS), there is usually significant amount of data sharing among different concurrent queries. MiniTasking exploits such data sharing characteristics to improve data temporal locality by scheduling query execution at three levels: (1) It batches queries based on their data sharing characteristics and the cache configuration. (2) It groups operators that share certain data. (3) It schedules mini-tasks which are small pieces of computation in operator groups according to their data locality...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
As a process executes on a CPU, it builds up state in that CPU's cache. In multiprogrammed work...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...
This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of c...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
Database systems access memory either sequentially or randomly. Contrary to sequential access and de...
In distributed query processing systems, load balancing plays an important role in maximizing system...
Leveraging data in distributed caches for large scale query processing applications is becoming more...
Abstract. Leveraging data in distributed caches for large scale query process-ing applications is be...
This article proposes a cache pattern with multi-queries and describes the multi-query optimization ...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
As a process executes on a CPU, it builds up state in that CPU's cache. In multiprogrammed work...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...
This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of c...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
Database systems access memory either sequentially or randomly. Contrary to sequential access and de...
In distributed query processing systems, load balancing plays an important role in maximizing system...
Leveraging data in distributed caches for large scale query processing applications is becoming more...
Abstract. Leveraging data in distributed caches for large scale query process-ing applications is be...
This article proposes a cache pattern with multi-queries and describes the multi-query optimization ...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
As a process executes on a CPU, it builds up state in that CPU's cache. In multiprogrammed work...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...