Data-intensive analytical applications need to support both efficient reads and writes. However, what is usually a good data layout for an update-heavy workload, is not well-suited for a read-mostly one and vice versa. Modern analytical data systems rely on columnar layouts and employ delta stores to inject new data and updates. We show that for hybrid workloads we can achieve close to one order of magnitude better performance by tailoring the column layout design to the data and query workload. Our approach navigates the possible design space of the physical layout: it organizes each column’s data by determining the number of partitions, their corresponding sizes and ranges, and the amount of buffer space and how it is allocated. We frame ...
In the last two decades, relational databases for analytics have been specialized to address the nee...
Disk drives are the bottleneck in the processing of large amounts of data used in almost all common ...
Optimal configuration is vital for a DataBase Management System (DBMS) to achieve high performance. ...
Data-intensive analytical applications need to support both efficient reads and writes. However, wh...
Ad-hoc analysis implies processing data in near real-time. Thus, raw data (i.e., neither normalized ...
Modern storage systems are complex. Simple direct-attached storage devices are giving way to storag...
Modern state-of-the-art database systems are designed around a single data storage layout. This is a...
With the SAP HANA database, SAP offers a high-perfor-mance in-memory hybrid-store database. Hybrid-s...
We investigate techniques that marry the high readonly analytical query performance of compressed, r...
Today’s database management systems offer numerous tuning knobs that allow an adaptation of database...
In recent years, big data applications often involve dealing with diverse datasets in terms of struc...
<p>Modern industrial, government, and academic organizations are collecting massive amounts of data ...
htmlabstractMany applications with manually implemented data management exhibit a data storage patte...
Nowadays, database optimizers take advantage of indexes and materialized views (MVs) to produce quer...
Distributed processing frameworks process data in parallel by dividing it into multiple partitions a...
In the last two decades, relational databases for analytics have been specialized to address the nee...
Disk drives are the bottleneck in the processing of large amounts of data used in almost all common ...
Optimal configuration is vital for a DataBase Management System (DBMS) to achieve high performance. ...
Data-intensive analytical applications need to support both efficient reads and writes. However, wh...
Ad-hoc analysis implies processing data in near real-time. Thus, raw data (i.e., neither normalized ...
Modern storage systems are complex. Simple direct-attached storage devices are giving way to storag...
Modern state-of-the-art database systems are designed around a single data storage layout. This is a...
With the SAP HANA database, SAP offers a high-perfor-mance in-memory hybrid-store database. Hybrid-s...
We investigate techniques that marry the high readonly analytical query performance of compressed, r...
Today’s database management systems offer numerous tuning knobs that allow an adaptation of database...
In recent years, big data applications often involve dealing with diverse datasets in terms of struc...
<p>Modern industrial, government, and academic organizations are collecting massive amounts of data ...
htmlabstractMany applications with manually implemented data management exhibit a data storage patte...
Nowadays, database optimizers take advantage of indexes and materialized views (MVs) to produce quer...
Distributed processing frameworks process data in parallel by dividing it into multiple partitions a...
In the last two decades, relational databases for analytics have been specialized to address the nee...
Disk drives are the bottleneck in the processing of large amounts of data used in almost all common ...
Optimal configuration is vital for a DataBase Management System (DBMS) to achieve high performance. ...