Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improving the data temporal locality for multiple concurrent queries. Our idea is based on the observation that, in many workloads such as decision support systems (DSS), there is usually significant amount of data sharing among different concurrent queries. MiniTasking exploits such data sharing characteristics to improve data temporal locality by scheduling query execution at three levels: (1) It batches queries based on their data sharing characteristics and the cache configuration. (2) It groups operators that share certain data. (3) It schedules mini-tasks which are small pieces of computation in operator groups according to their dat...
Leveraging data in distributed caches for large scale query processing applications is becoming more...
Database systems frequently have to execute a batch of related queries. Multi-query optimization exp...
As a process executes on a CPU, it builds up state in that CPU's cache. In multiprogrammed work...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of c...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
In distributed query processing systems, load balancing plays an important role in maximizing system...
This article proposes a cache pattern with multi-queries and describes the multi-query optimization ...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
Database systems access memory either sequentially or randomly. Contrary to sequential access and de...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
Abstract. Leveraging data in distributed caches for large scale query process-ing applications is be...
When data analysis applicationsare employed in a multi-client environment, a data server must servic...
Leveraging data in distributed caches for large scale query processing applications is becoming more...
Database systems frequently have to execute a batch of related queries. Multi-query optimization exp...
As a process executes on a CPU, it builds up state in that CPU's cache. In multiprogrammed work...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of c...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
In distributed query processing systems, load balancing plays an important role in maximizing system...
This article proposes a cache pattern with multi-queries and describes the multi-query optimization ...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
Database systems access memory either sequentially or randomly. Contrary to sequential access and de...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
Abstract. Leveraging data in distributed caches for large scale query process-ing applications is be...
When data analysis applicationsare employed in a multi-client environment, a data server must servic...
Leveraging data in distributed caches for large scale query processing applications is becoming more...
Database systems frequently have to execute a batch of related queries. Multi-query optimization exp...
As a process executes on a CPU, it builds up state in that CPU's cache. In multiprogrammed work...