Compression algorithms are important for data oriented tasks, especially in the era of Big Data. Modern processors equipped with powerful SIMD instruction sets, provide us an opportunity for achieving better compression performance. Previous research has shown that SIMD-based optimizations can multiply decoding speeds. Following these pioneering studies, we propose a general approach to accelerate compression algorithms. By instantiating the approach, we have developed several novel integer compression algorithms, called Group-Simple, Group-Scheme, Group-AFOR, and Group-PFD, and implemented their corresponding vectorized versions. We evaluate the proposed algorithms on two public TREC datasets, a Wikipedia dataset and a Twitter dataset. Wit...
Data movement has long been identified as the biggest challenge facing modern computer systems' desi...
We give a detailed algorithm for fast text compression. Our algorithm, related to the PPM method, si...
Data compression is the reduction of redundancy in data representation in order to decrease storage ...
Sorted lists of integers are commonly used in inverted indexes and database systems. They are often ...
We study algorithms for efficient compression and decompression of a sequence of integers on modern ...
We study algorithms for efficient compression and decompression of a sequence of integers on modern ...
Sorted lists of integers are commonly used in inverted in-dexes and database systems. They are often...
In many important applications—such as search engines and relational database systems—data are store...
Arrays of integers are often compressed in search engines. Though there are many ways to compress in...
The three generations of postings list compression strategies (Var-iable Byte Encoding, Word Aligned...
Compression can sometimes improve performance by making more of the data available to the processors...
Lightweight integer compression algorithms are frequently applied in in-memory database systems to t...
Today\u27s scientific simulations require a significant reduction of the data size because of extrem...
International audienceThe development of next-generation sequencing (NGS) technology presents a cons...
Data Compression is today essential for a wide range of applications: for example Internet and the W...
Data movement has long been identified as the biggest challenge facing modern computer systems' desi...
We give a detailed algorithm for fast text compression. Our algorithm, related to the PPM method, si...
Data compression is the reduction of redundancy in data representation in order to decrease storage ...
Sorted lists of integers are commonly used in inverted indexes and database systems. They are often ...
We study algorithms for efficient compression and decompression of a sequence of integers on modern ...
We study algorithms for efficient compression and decompression of a sequence of integers on modern ...
Sorted lists of integers are commonly used in inverted in-dexes and database systems. They are often...
In many important applications—such as search engines and relational database systems—data are store...
Arrays of integers are often compressed in search engines. Though there are many ways to compress in...
The three generations of postings list compression strategies (Var-iable Byte Encoding, Word Aligned...
Compression can sometimes improve performance by making more of the data available to the processors...
Lightweight integer compression algorithms are frequently applied in in-memory database systems to t...
Today\u27s scientific simulations require a significant reduction of the data size because of extrem...
International audienceThe development of next-generation sequencing (NGS) technology presents a cons...
Data Compression is today essential for a wide range of applications: for example Internet and the W...
Data movement has long been identified as the biggest challenge facing modern computer systems' desi...
We give a detailed algorithm for fast text compression. Our algorithm, related to the PPM method, si...
Data compression is the reduction of redundancy in data representation in order to decrease storage ...