Enormous datasets are a common occurence today and compressing them is often beneficial. Fast direct access to any element in the compressed data is a requirement in the field of compressed data structures, which is not easily supported with traditional compression methods. Variable-byte encoding is a method for compressing integers of different byte lengths. It removes unused leading bytes and adds an additional continuation bit to each byte to denote whether the compressed integer continues to the next byte or not. An existing solution using a rank data structure performs well in this given task. This thesis introduces an alternative solution using a select data structure and compares the two implementations. An experimentation is also...
International audienceData acquired over long periods of time like High Definition (HD) videos or re...
Context-based compression methods are the most powerful approaches to squeeze arbitrary textual data...
We present a new universal source code for distributions of unlabeled binary and ordinal trees that ...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Lightweight integer compression algorithms are frequently applied in in-memory database systems to t...
We investigate the problem of variable-length compression with random access for stationary and ergo...
Arrays of integers are often compressed in search engines. Though there are many ways to compress in...
We consider practical implementations of compressed bitvectors, which support rank and select operat...
A data compression scheme that exploits locality of reference, such as occurs when words are used fr...
Lightweight integer compression algorithms play an important role in in-memory database systems to t...
[Abstract] In this thesis we address the problem of the efficiency in Information Retrieval by pres...
We address the well-known problem of designing, implementing and experimenting compressed data struc...
We study a new technique for optimal data compression subject to conditions of causality and differe...
Data compression has always been an essential aspect of computing. In recent times, with the increas...
International audienceData acquired over long periods of time like High Definition (HD) videos or re...
Context-based compression methods are the most powerful approaches to squeeze arbitrary textual data...
We present a new universal source code for distributions of unlabeled binary and ordinal trees that ...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer se...
Lightweight integer compression algorithms are frequently applied in in-memory database systems to t...
We investigate the problem of variable-length compression with random access for stationary and ergo...
Arrays of integers are often compressed in search engines. Though there are many ways to compress in...
We consider practical implementations of compressed bitvectors, which support rank and select operat...
A data compression scheme that exploits locality of reference, such as occurs when words are used fr...
Lightweight integer compression algorithms play an important role in in-memory database systems to t...
[Abstract] In this thesis we address the problem of the efficiency in Information Retrieval by pres...
We address the well-known problem of designing, implementing and experimenting compressed data struc...
We study a new technique for optimal data compression subject to conditions of causality and differe...
Data compression has always been an essential aspect of computing. In recent times, with the increas...
International audienceData acquired over long periods of time like High Definition (HD) videos or re...
Context-based compression methods are the most powerful approaches to squeeze arbitrary textual data...
We present a new universal source code for distributions of unlabeled binary and ordinal trees that ...