© 2017 Association for Computing Machinery. Data partitioning is crucial to improving query performance and severalworkload-based partitioning techniques have been proposed in database literature. However, many modern analytic applications involve ad-hoc or exploratory analysis where users do not have a representative query workload a priori. Static workload-based data partitioning techniques are therefore not suitable for such settings. In this paper, we propose Amoeba, a distributed storage system that uses adaptive multi-attribute data partitioning to efficiently support ad-hoc as well as recurring queries. Amoeba requires zero set-up and tuning effort, allowing analysts to get the benefits of partitioning without requiring an upfront qu...
Abstract. Vertical and Horizontal partitions allow database adminis-trators (DBAs) to considerably i...
Physical database design is important for query performance in a shared-nothing parallel database sy...
The performance of the execution of an analytical workload critically impacts the speed at which com...
Data partitioning significantly improves the query performance in distributed database systems. A la...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Big data analytics often involves complex join queries over two or more tables. Such join process...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
International audienceApplications with very large databases, where data items are continuously appe...
International audienceApplications with very large databases, where data items are continuously appe...
We propose to demonstrate a fine-grained partitioning frame-work that reorganizes the data tuples in...
For a storage system to keep pace with increasing amounts of data, a natural solution is to deploy m...
International audienceOLAP queries are typically heavy-weight and ad-hoc thus requiring high storage...
International audienceOLAP queries are typically heavy-weight and ad-hoc thus requiring high storage...
In an increasing number of use cases, databases face the challenge of managing irregularly structure...
Abstract. Vertical and Horizontal partitions allow database adminis-trators (DBAs) to considerably i...
Physical database design is important for query performance in a shared-nothing parallel database sy...
The performance of the execution of an analytical workload critically impacts the speed at which com...
Data partitioning significantly improves the query performance in distributed database systems. A la...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Big data analytics often involves complex join queries over two or more tables. Such join process...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
International audienceApplications with very large databases, where data items are continuously appe...
International audienceApplications with very large databases, where data items are continuously appe...
We propose to demonstrate a fine-grained partitioning frame-work that reorganizes the data tuples in...
For a storage system to keep pace with increasing amounts of data, a natural solution is to deploy m...
International audienceOLAP queries are typically heavy-weight and ad-hoc thus requiring high storage...
International audienceOLAP queries are typically heavy-weight and ad-hoc thus requiring high storage...
In an increasing number of use cases, databases face the challenge of managing irregularly structure...
Abstract. Vertical and Horizontal partitions allow database adminis-trators (DBAs) to considerably i...
Physical database design is important for query performance in a shared-nothing parallel database sy...
The performance of the execution of an analytical workload critically impacts the speed at which com...