Data compression techniques (both lossless and lossy compression methods) are widely utilized in big data analytic applications in domains including health-care, transportation, and finance. The main benefit achieved from applying data compression techniques is the saving of space cost. However, performing analytic queries on compressed data has two major challenges in terms of the performance and the accuracy: (i) decompressing data may damage the performance, and (ii) if lossy data compression techniques are utilized then the returned answers are not accurate. In this dissertation, we study how to accelerate analytic queries over compressed data (and provide tight error guarantees for approximate answers if lossy data compression methods ...
A fast response is critical in many data-intensive applications, including knowledge discovery analy...
Modern columnar databases heavily use compression to reduce memory footprint and boost query executi...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Decision-support applications in emerging environments require that SQL query results or intermediat...
Decision-support applications in emerging environments require that entire SQL query results be ship...
Columnar databases have dominated the data analysis market for their superior performance in query p...
Evolving customer requirements and increasing competition force business organizations to store incr...
The last few years have seen an exponential increase, driven by many disparate fields such as big da...
Today, we collect a large amount of data, and the volume of the data we collect is projected to grow...
The Data Cube is the central abstraction behind the power of On-Line Analytical Processing (OLAP) sy...
Efficient query processing in statistical databases is constrained by the I/O bottleneck problem bec...
Low-latency, high-throughput systems for serving interactive queries are crucial to today's web serv...
Data compression is one way to gain better performance from a database. Compression is typically ach...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
In the era of big data, computing exact answers to analytical queries becomes prohibitively expensiv...
A fast response is critical in many data-intensive applications, including knowledge discovery analy...
Modern columnar databases heavily use compression to reduce memory footprint and boost query executi...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Decision-support applications in emerging environments require that SQL query results or intermediat...
Decision-support applications in emerging environments require that entire SQL query results be ship...
Columnar databases have dominated the data analysis market for their superior performance in query p...
Evolving customer requirements and increasing competition force business organizations to store incr...
The last few years have seen an exponential increase, driven by many disparate fields such as big da...
Today, we collect a large amount of data, and the volume of the data we collect is projected to grow...
The Data Cube is the central abstraction behind the power of On-Line Analytical Processing (OLAP) sy...
Efficient query processing in statistical databases is constrained by the I/O bottleneck problem bec...
Low-latency, high-throughput systems for serving interactive queries are crucial to today's web serv...
Data compression is one way to gain better performance from a database. Compression is typically ach...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
In the era of big data, computing exact answers to analytical queries becomes prohibitively expensiv...
A fast response is critical in many data-intensive applications, including knowledge discovery analy...
Modern columnar databases heavily use compression to reduce memory footprint and boost query executi...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...