Semistatic byte-oriented word-based compression codes have been shown to be an attractive alternative to compress natural language text databases, because of the combination of speed, effectiveness, and direct searchability they offer. In particular, our recently proposed family of dense compression codes has been shown to be superior to the more traditional byte-oriented word-based Huffman codes in most aspects. In this paper, we focus on the problem of transmitting texts among peers that do not share the vocabulary. This is the typical scenario for adaptive compression methods. We design adaptive variants of our semistatic dense codes, showing that they are much simpler and faster than dynamic Huffman codes and reach almost the same compr...
AbstractIn this paper we present the adaptation of a compression technique, specially designed to co...
Word-based context models for text compression have the capacity to outperform more simple character...
International audienceWe give a new text compression scheme based on Forbidden Words ("antidictionar...
Semistatic byte-oriented word-based compression codes have been shown to be an attractive alternativ...
We address the problem of adaptive compression of natural language text, focusing on the case where ...
Semistatic word-based byte-oriented compressors are known to be attractive alternatives to compress ...
Semistatic word-based byte-oriented compression codes are known to be attractive alternatives to com...
This work presents (s, c)-Dense Code, a new method for compressing natural language texts. This tec...
[Abstract] Text databases are growing in the last years due to the widespread use of digital librar...
An algorithm for very efficient compression of a set of natural language text files is presented. No...
An algorithm for very efficient compression of a set of natural language text files is presented. No...
AbstractWe adapted Word-based Tagged Code (WBTC) to obtain its dynamic version. The aim of designing...
Text compression over alphabet of words or syllables brings up a new concern to deal with - the alph...
. A new text compression scheme is presented in this paper. The main purpose of this scheme is to sp...
Dictionary-based compression algorithms include a parsing strategy to transform the input text into ...
AbstractIn this paper we present the adaptation of a compression technique, specially designed to co...
Word-based context models for text compression have the capacity to outperform more simple character...
International audienceWe give a new text compression scheme based on Forbidden Words ("antidictionar...
Semistatic byte-oriented word-based compression codes have been shown to be an attractive alternativ...
We address the problem of adaptive compression of natural language text, focusing on the case where ...
Semistatic word-based byte-oriented compressors are known to be attractive alternatives to compress ...
Semistatic word-based byte-oriented compression codes are known to be attractive alternatives to com...
This work presents (s, c)-Dense Code, a new method for compressing natural language texts. This tec...
[Abstract] Text databases are growing in the last years due to the widespread use of digital librar...
An algorithm for very efficient compression of a set of natural language text files is presented. No...
An algorithm for very efficient compression of a set of natural language text files is presented. No...
AbstractWe adapted Word-based Tagged Code (WBTC) to obtain its dynamic version. The aim of designing...
Text compression over alphabet of words or syllables brings up a new concern to deal with - the alph...
. A new text compression scheme is presented in this paper. The main purpose of this scheme is to sp...
Dictionary-based compression algorithms include a parsing strategy to transform the input text into ...
AbstractIn this paper we present the adaptation of a compression technique, specially designed to co...
Word-based context models for text compression have the capacity to outperform more simple character...
International audienceWe give a new text compression scheme based on Forbidden Words ("antidictionar...