Information theorists maintain that typical English-language text is approximately 75 per cent predictable; for example, in a text-reconstruction experiment carried out by Claude Shannon, a subject presented with various incomplete sentences was able to guess the next letter correctly in 79 out of 102 attempts. However, if one attempts to retain the sense of a sentence after weeding out letters according to some predetermined rule, one cannot achieve such an extreme limit; in fact, the removal of even half the text is likely to introduce severe difficulties. Although Shannon cites an experiment in which six subjects restored an average of 93 per cent of the 50-per-cent-deleted FCTSSTRNGRHNFCTN, surely the triteness of the phrase made it mor...
TR-COSC 03/90The world-wide use of digital storage and communications devices is increasing the nee...
A text written using symbols from a given alphabet can be compressed using the Huffman code, which m...
It is known that the entropy of English text can be reduced by arranging the text into groups of two...
The ability of human operators to correct mutilations in printed English texts was studied for a var...
The goal of this paper is to show the dependency of the entropy of English text on the subject of th...
The procedure that predicts the mean information per letter in a long text by adding the constraint ...
In Compression of English Text in the May 1982 Word Ways, the editor showed how one can sometimes ...
Based on data from a large-scale experiment with human subjects, we conclude that the logarithm of...
In his work on the information content of English text in 1951, Shannon described a method of recodi...
The increasing practice of comitting large amounts of text, such as dictionaries and thesauri, to co...
Shannon estimates the entropy of the set of words in printed English as 11.82 bits per word. As this...
It is shown that optimal text compression is a harder problem than artificial intelligence as define...
We address the problem of improving the efficiency of natural language text input under degraded con...
Traditional methods for efficient text entry are based on prediction. Prediction requires a constant...
For many years, computational linguists have studied the statistical behavior of language -- the dis...
TR-COSC 03/90The world-wide use of digital storage and communications devices is increasing the nee...
A text written using symbols from a given alphabet can be compressed using the Huffman code, which m...
It is known that the entropy of English text can be reduced by arranging the text into groups of two...
The ability of human operators to correct mutilations in printed English texts was studied for a var...
The goal of this paper is to show the dependency of the entropy of English text on the subject of th...
The procedure that predicts the mean information per letter in a long text by adding the constraint ...
In Compression of English Text in the May 1982 Word Ways, the editor showed how one can sometimes ...
Based on data from a large-scale experiment with human subjects, we conclude that the logarithm of...
In his work on the information content of English text in 1951, Shannon described a method of recodi...
The increasing practice of comitting large amounts of text, such as dictionaries and thesauri, to co...
Shannon estimates the entropy of the set of words in printed English as 11.82 bits per word. As this...
It is shown that optimal text compression is a harder problem than artificial intelligence as define...
We address the problem of improving the efficiency of natural language text input under degraded con...
Traditional methods for efficient text entry are based on prediction. Prediction requires a constant...
For many years, computational linguists have studied the statistical behavior of language -- the dis...
TR-COSC 03/90The world-wide use of digital storage and communications devices is increasing the nee...
A text written using symbols from a given alphabet can be compressed using the Huffman code, which m...
It is known that the entropy of English text can be reduced by arranging the text into groups of two...