Compression can sometimes improve performance by making more of the data available to the processors faster. We consider the compression of integer keys in a B+-tree index. For this purpose, systems such as IBM DB2 use variable-byte compression over differentially coded keys. We revisit this problem with various compression alternatives such as Google's VarIntGB, Binary Packing and Frame-of-Reference. In all cases, we describe algorithms that can operate directly on compressed data. Many of our alternatives exploit the single-instruction-multiple-data (SIMD) instructions supported by modern CPUs. We evaluate our techniques in a database environment provided by Upscaledb, a production-quality key-value database. Our best techniques are SIMD ...
Sorted lists of integers are commonly used in inverted in-dexes and database systems. They are often...
Abstract. The B-tree and its variants have been widely applied in many data management fields. When ...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...
Columnar databases have dominated the data analysis market for their superior performance in query p...
Arrays of integers are often compressed in search engines. Though there are many ways to compress in...
Sorted lists of integers are commonly used in inverted indexes and database systems. They are often ...
Many database applications make extensive use of bitmap indexing schemes. In this paper, we study h...
We study algorithms for efficient compression and decompression of a sequence of integers on modern ...
We study algorithms for efficient compression and decompression of a sequence of integers on modern ...
Methods are discussed to enhance the efficiency and speed of data compression techniques in DBMS (da...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
Data compression is one way to gain better performance from a database. Compression is typically ach...
Many database applications make extensive use of bitmap indexing schemes. In this paper, we study ho...
Compressed bitmap indices are heavily used in scientific and commercial database systems because the...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Sorted lists of integers are commonly used in inverted in-dexes and database systems. They are often...
Abstract. The B-tree and its variants have been widely applied in many data management fields. When ...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...
Columnar databases have dominated the data analysis market for their superior performance in query p...
Arrays of integers are often compressed in search engines. Though there are many ways to compress in...
Sorted lists of integers are commonly used in inverted indexes and database systems. They are often ...
Many database applications make extensive use of bitmap indexing schemes. In this paper, we study h...
We study algorithms for efficient compression and decompression of a sequence of integers on modern ...
We study algorithms for efficient compression and decompression of a sequence of integers on modern ...
Methods are discussed to enhance the efficiency and speed of data compression techniques in DBMS (da...
Domain encoding is a common technique to compress the columns of a column store and to accelerate ma...
Data compression is one way to gain better performance from a database. Compression is typically ach...
Many database applications make extensive use of bitmap indexing schemes. In this paper, we study ho...
Compressed bitmap indices are heavily used in scientific and commercial database systems because the...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Sorted lists of integers are commonly used in inverted in-dexes and database systems. They are often...
Abstract. The B-tree and its variants have been widely applied in many data management fields. When ...
Compression reduces both the size of indexes and the time needed to evaluate queries. In this paper,...