Today’s data deluge enables organizations to collect massive data, and analyze it with an ever-increasing number of concurrent queries. Traditional data warehouses (DW) face a challenging problem in executing this task, due to their query-centric model: each query is optimized and executed independently. This model results in high contention for resources. Thus, modern DW depart from the query-centric model to execution models involving sharing of common data and work. Our goal is to show when and how a DW should employ sharing. We evaluate experimentally two sharing method-ologies, based on their original prototype systems, that exploit work sharing opportunities among concurrent queries at run-time: Si-multaneous Pipelining (SP), which sh...
Relational DBMS typically execute concurrent queries independently by invoking a set of operator ins...
MapReduce is a popular model of executing time-consuming analytical queries as a batch of tasks on l...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...
Today an ever increasing amount of data is collected and analyzed by researchers, businesses, and sc...
Intuitively, aggressive work sharing among concurrent queries in a database system should always imp...
Conventional data warehouses employ the query-at-a-time model, which maps each query to a distinct p...
Current systems for data-parallel, incremental processing and view maintenance over high-rate stream...
Today, an ever-increasing number of researchers, businesses, and data scientists collect and analyze...
Database systems frequently have to execute a set of related queries, which share several common sub...
Summarization: An emerging challenge in modern distributed querying is to effi- ciently process mult...
Optimization of join queries based on average selectivities is suboptimal in highly correlated datab...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
textabstractThis paper analyzes the performance of concurrent (index) scan operations in both record...
In this paper, we present data threaded execution, a new strategy to exploit both, pipelining and in...
This paper analyzes the performance of concurrent (index) scan operations in both record (NSM/PAX) a...
Relational DBMS typically execute concurrent queries independently by invoking a set of operator ins...
MapReduce is a popular model of executing time-consuming analytical queries as a batch of tasks on l...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...
Today an ever increasing amount of data is collected and analyzed by researchers, businesses, and sc...
Intuitively, aggressive work sharing among concurrent queries in a database system should always imp...
Conventional data warehouses employ the query-at-a-time model, which maps each query to a distinct p...
Current systems for data-parallel, incremental processing and view maintenance over high-rate stream...
Today, an ever-increasing number of researchers, businesses, and data scientists collect and analyze...
Database systems frequently have to execute a set of related queries, which share several common sub...
Summarization: An emerging challenge in modern distributed querying is to effi- ciently process mult...
Optimization of join queries based on average selectivities is suboptimal in highly correlated datab...
This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses by improvi...
textabstractThis paper analyzes the performance of concurrent (index) scan operations in both record...
In this paper, we present data threaded execution, a new strategy to exploit both, pipelining and in...
This paper analyzes the performance of concurrent (index) scan operations in both record (NSM/PAX) a...
Relational DBMS typically execute concurrent queries independently by invoking a set of operator ins...
MapReduce is a popular model of executing time-consuming analytical queries as a batch of tasks on l...
Abstract. This paper proposes a novel idea, called MiniTasking to reduce the number of cache misses ...