In modern large-scale distributed systems, analytics jobs submitted by various users often share similar work, for example scanning and processing the same subset of data. Instead of optimizing jobs independently, which may result in redundant and wasteful processing, multi-query optimization techniques can be employed to save a considerable amount of cluster resources. In this work, we introduce a novel method combining in-memory cache primitives and multi-query optimization, to improve the efficiency of data-intensive, scalable computing frameworks. By careful selection and exploitation of common (sub)expressions, while satisfying memory constraints, our method transforms a batch of queries into a new, more efficient one which avoids unne...
The primary goal of the database system is to provide the user a convenient and efficient access to ...
This work investigates the leverage that can be obtained from compiler optimization techniques for ...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
MQO is a distributed multiple query processing middleware that can optimize query processing for da...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
This article proposes a cache pattern with multi-queries and describes the multi-query optimization ...
MQO is a distributed multiple query processing middleware that can use resources available on the Gr...
Database systems frequently have to execute a batch of related queries. Multi-query optimization exp...
In distributed query processing systems, load balancing plays an important role in maximizing system...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...
With the emergence of new data management systems (DMS) in context of big data and cloud computing, ...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
The primary goal of the database system is to provide the user a convenient and efficient access to ...
This work investigates the leverage that can be obtained from compiler optimization techniques for ...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
MQO is a distributed multiple query processing middleware that can optimize query processing for da...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...
One of the important characteristics of emerging multicores/manycores is the existence of 'shared on...
This article proposes a cache pattern with multi-queries and describes the multi-query optimization ...
MQO is a distributed multiple query processing middleware that can use resources available on the Gr...
Database systems frequently have to execute a batch of related queries. Multi-query optimization exp...
In distributed query processing systems, load balancing plays an important role in maximizing system...
In distributed query processing systems where caching infrastructure is distributed and scales with ...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...
With the emergence of new data management systems (DMS) in context of big data and cloud computing, ...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
The primary goal of the database system is to provide the user a convenient and efficient access to ...
This work investigates the leverage that can be obtained from compiler optimization techniques for ...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...