An algorithm for very efficient compression of a set of natural language text files is presented. Not only a very good compression ratio is reached, the used compression method allows fast pattern matching in compressed text, which is an attractive property especially for search engines. Much information is stored in the form of a large collection of text files. The web search engines can store the web pages in the raw text form to build so-called snippets or to perform so-called positional ranking functions on them. Furthermore, there exist many other similar contexts such as the storage of emails, application logs or the databases of text files (literary works or technical reports). In this paper, we address the problem of the compression...
Nowadays we know how to effectively compress most basic components of any modern search engine, such...
Nowadays we know how to effectively compress most ba-sic components of any modern search engine, suc...
In recent times, we have witnessed an unprecedented growth of textual information via the Internet, ...
An algorithm for very efficient compression of a set of natural language text files is presented. No...
. A new text compression scheme is presented in this paper. The main purpose of this scheme is to sp...
Semistatic byte-oriented word-based compression codes have been shown to be an attractive alternativ...
Semistatic word-based byte-oriented compressors are known to be attractive alternatives to compress ...
Semistatic word-based byte-oriented compression codes are known to be attractive alternatives to com...
We address the problem of adaptive compression of natural language text, focusing on the case where ...
The present chapter describes a few standard algorithms used for processing texts. They apply, for.....
The idea of using data compression algorithms for machine learning has been reinvented many times. I...
This work presents (s, c)-Dense Code, a new method for compressing natural language texts. This tec...
In this Ph. D. Thesis we investigate several data compression methods on text in natural language. O...
AbstractIn this paper we present the adaptation of a compression technique, specially designed to co...
EGOTHOR is a search engine that indexes the Web and allows us to search the Web documents. Its hit l...
Nowadays we know how to effectively compress most basic components of any modern search engine, such...
Nowadays we know how to effectively compress most ba-sic components of any modern search engine, suc...
In recent times, we have witnessed an unprecedented growth of textual information via the Internet, ...
An algorithm for very efficient compression of a set of natural language text files is presented. No...
. A new text compression scheme is presented in this paper. The main purpose of this scheme is to sp...
Semistatic byte-oriented word-based compression codes have been shown to be an attractive alternativ...
Semistatic word-based byte-oriented compressors are known to be attractive alternatives to compress ...
Semistatic word-based byte-oriented compression codes are known to be attractive alternatives to com...
We address the problem of adaptive compression of natural language text, focusing on the case where ...
The present chapter describes a few standard algorithms used for processing texts. They apply, for.....
The idea of using data compression algorithms for machine learning has been reinvented many times. I...
This work presents (s, c)-Dense Code, a new method for compressing natural language texts. This tec...
In this Ph. D. Thesis we investigate several data compression methods on text in natural language. O...
AbstractIn this paper we present the adaptation of a compression technique, specially designed to co...
EGOTHOR is a search engine that indexes the Web and allows us to search the Web documents. Its hit l...
Nowadays we know how to effectively compress most basic components of any modern search engine, such...
Nowadays we know how to effectively compress most ba-sic components of any modern search engine, suc...
In recent times, we have witnessed an unprecedented growth of textual information via the Internet, ...