Information theory can be used to assess how efficiently a message is transmitted on the basis of different symbolic systems. In this paper, I estimate the information-theoretic efficiency of written language for parallel text data in more than 1000 different languages, both on the level of characters and on the level of words as information encoding units. The main results show that (i) the median efficiency is ∼29% on the character level and ∼45% on the word level, (ii) efficiency on both levels is strongly correlated with each other and (iii) efficiency tends to be higher for languages with more speakers
We review some recent progress on the characterisation of long-range patterns of word use in languag...
<p>A) Information in word distribution as a function of the scale for the Voynich manuscript compare...
<p>Efficiency is nearly inversely proportional to complexity over a nearly hundred-fold range. The h...
Written language is a complex communication signal capable of conveying information encoded in the f...
One of the fundamental questions about human language is whether all languages are equally complex. ...
We use n-gram language models to investigate how far language approximates an optimal code for human...
All living beings try to save effort, and humans are no exception. This groundbreaking book shows ho...
This study develops a probabilistic theory of efficiency in natural language. The first part is theo...
The procedure that predicts the mean information per letter in a long text by adding the constraint ...
We demonstrate a substantial improvement on one of the most celebrated empirical laws in the study o...
Languages employ different strategies to transmit structural and grammatical information. While, for...
Languages employ different strategies to transmit structural and grammatical information. While, for...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
We provide evidence for a rational account of language pro-duction, Uniform Information Density (UID...
We review some recent progress on the characterisation of long-range patterns of word use in languag...
<p>A) Information in word distribution as a function of the scale for the Voynich manuscript compare...
<p>Efficiency is nearly inversely proportional to complexity over a nearly hundred-fold range. The h...
Written language is a complex communication signal capable of conveying information encoded in the f...
One of the fundamental questions about human language is whether all languages are equally complex. ...
We use n-gram language models to investigate how far language approximates an optimal code for human...
All living beings try to save effort, and humans are no exception. This groundbreaking book shows ho...
This study develops a probabilistic theory of efficiency in natural language. The first part is theo...
The procedure that predicts the mean information per letter in a long text by adding the constraint ...
We demonstrate a substantial improvement on one of the most celebrated empirical laws in the study o...
Languages employ different strategies to transmit structural and grammatical information. While, for...
Languages employ different strategies to transmit structural and grammatical information. While, for...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
We provide evidence for a rational account of language pro-duction, Uniform Information Density (UID...
We review some recent progress on the characterisation of long-range patterns of word use in languag...
<p>A) Information in word distribution as a function of the scale for the Voynich manuscript compare...
<p>Efficiency is nearly inversely proportional to complexity over a nearly hundred-fold range. The h...