Big data analytics often involves complex join queries over two or more tables. Such join processing is expensive in a distributed setting both because large amounts of data must be read from disk, and because of data shuffling across the network. Many techniques based on data partitioning have been proposed to reduce the amount of data that must be accessed, often focusing on finding the best partitioning scheme for a particular workload, rather than adapting to changes in the workload over time. In this paper, we present AdaptDB, an adaptive storage manager for analytical database workloads in a distributed setting. It works by partitioning datasets across a cluster and incrementally refining data partitioning as queries are run. Ada...
The performance of parallel data analytics systems becomes increasingly important with the rise of B...
The performance of parallel data analytics systems becomes increasingly important with the rise of B...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
© 2017 Association for Computing Machinery. Data partitioning is crucial to improving query performa...
Data partitioning significantly improves the query performance in distributed database systems. A la...
Distributed database systems are widely used to provide scalable storage, update and query facilitie...
Analytical workloads in data warehouses often include heavy joins where queries involve multiple fac...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
The standard way to get linear scaling in a distributed OLTP DBMS is to horizontally partition data ...
We present Schism, a novel workload-aware approach for database partitioning and replication designe...
The publication of machine-readable information has been significantly increasing both in the magnit...
Conventional data warehouses employ the query-at-a-time model, which maps each query to a distinct p...
The performance of parallel data analytics systems becomes increasingly important with the rise of B...
The performance of parallel data analytics systems becomes increasingly important with the rise of B...
The performance of parallel data analytics systems becomes increasingly important with the rise of B...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
© 2017 Association for Computing Machinery. Data partitioning is crucial to improving query performa...
Data partitioning significantly improves the query performance in distributed database systems. A la...
Distributed database systems are widely used to provide scalable storage, update and query facilitie...
Analytical workloads in data warehouses often include heavy joins where queries involve multiple fac...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
The standard way to get linear scaling in a distributed OLTP DBMS is to horizontally partition data ...
We present Schism, a novel workload-aware approach for database partitioning and replication designe...
The publication of machine-readable information has been significantly increasing both in the magnit...
Conventional data warehouses employ the query-at-a-time model, which maps each query to a distinct p...
The performance of parallel data analytics systems becomes increasingly important with the rise of B...
The performance of parallel data analytics systems becomes increasingly important with the rise of B...
The performance of parallel data analytics systems becomes increasingly important with the rise of B...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...