Decision-support applications in emerging environments require that SQL query results or intermediate results are shipped to clients for further analysis and presentation. These clients may use low bandwidth connections or have severe memory restrictions. Consequently, there is a need to compress the results of a query for efficient transfer and client-side access. This paper explores a variety of techniques that address this issue. We present a framework to represent "compression plans" formed by composing primitive compression operators. We also present optimization algorithms that enumerate valid compression plans and choose an optimal plan. The factors that influence this choice include statistical and semantic information on ...
While a variety of lossy compression schemes have been developed for certain forms of digital data (...
Abstract. Evaluating a query can involve manipulation of large vol-umes of temporary data. When the ...
Abstract—Bitmap indices are widely used for large read-only repositories in data warehouses and scie...
Decision-support applications in emerging environments require that entire SQL query results be ship...
M.S. University of Hawaii at Manoa 2012.Includes bibliographical references.The proliferation of lig...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Data compression is one way to gain better performance from a database. Compression is typically ach...
Modern columnar databases heavily use compression to reduce memory footprint and boost query executi...
Data compression techniques (both lossless and lossy compression methods) are widely utilized in big...
Columnar databases have dominated the data analysis market for their superior performance in query p...
In this demo, we present MorphStore, an in-memory column store with a novel compression-aware query ...
Efficient query processing in statistical databases is constrained by the I/O bottleneck problem bec...
Column-oriented database system architectures invite a reevaluation of how and when data in database...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Abstract Bit-vectors are widely used for indexing and summarizing data due to their efficient proces...
While a variety of lossy compression schemes have been developed for certain forms of digital data (...
Abstract. Evaluating a query can involve manipulation of large vol-umes of temporary data. When the ...
Abstract—Bitmap indices are widely used for large read-only repositories in data warehouses and scie...
Decision-support applications in emerging environments require that entire SQL query results be ship...
M.S. University of Hawaii at Manoa 2012.Includes bibliographical references.The proliferation of lig...
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk ...
Data compression is one way to gain better performance from a database. Compression is typically ach...
Modern columnar databases heavily use compression to reduce memory footprint and boost query executi...
Data compression techniques (both lossless and lossy compression methods) are widely utilized in big...
Columnar databases have dominated the data analysis market for their superior performance in query p...
In this demo, we present MorphStore, an in-memory column store with a novel compression-aware query ...
Efficient query processing in statistical databases is constrained by the I/O bottleneck problem bec...
Column-oriented database system architectures invite a reevaluation of how and when data in database...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Abstract Bit-vectors are widely used for indexing and summarizing data due to their efficient proces...
While a variety of lossy compression schemes have been developed for certain forms of digital data (...
Abstract. Evaluating a query can involve manipulation of large vol-umes of temporary data. When the ...
Abstract—Bitmap indices are widely used for large read-only repositories in data warehouses and scie...