Recently, it was demonstrated that generalized entropies of order α offer novel and important opportunities to quantify the similarity of symbol sequences where α is a free parameter. Varying this parameter makes it possible to magnify differences between different texts at specific scales of the corresponding word frequency spectrum. For the analysis of the statistical properties of natural languages, this is especially interesting, because textual data are characterized by Zipf’s law, i.e., there are very few word types that occur very often (e.g., function words expressing grammatical relationships) and many word types with a very low frequency (e.g., content words carrying most of the meaning of a sentence). Here, this approach is syste...
<p>The recent dramatic increase in online data availability has allowed researchers to explore human...
The present paper discusses the benefits and challenges of token-based typology, which takes into ac...
Written text is one of the fundamental manifestations of human language, and the study of its univer...
Recently, it was demonstrated that generalized entropies of order α offer novel and important opport...
Recently, it was demonstrated that generalized entropies of order α offer novel and important opport...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
Natural language is a remarkable example of a complex dynamical system which combines variation and ...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
We show how generalized Gibbs-Shannon entropies can provide new insights on the statistical properti...
We estimate the n-gram entropies of natural language texts in word-length representation and find th...
Quantifying the similarity between symbolic sequences is a traditional problem in information theory...
We analyze the occurrence frequencies of over 15 million words recorded in millions of books publish...
© 2014 The Author(s) Published by the Royal Society. All rights reserved. The frequency with which w...
The relationship between the entropy of language and its complexity has been the subject of much spe...
The recent dramatic increase in online data availability has allowed researchers to explore human cu...
<p>The recent dramatic increase in online data availability has allowed researchers to explore human...
The present paper discusses the benefits and challenges of token-based typology, which takes into ac...
Written text is one of the fundamental manifestations of human language, and the study of its univer...
Recently, it was demonstrated that generalized entropies of order α offer novel and important opport...
Recently, it was demonstrated that generalized entropies of order α offer novel and important opport...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
Natural language is a remarkable example of a complex dynamical system which combines variation and ...
The choice associated with words is a fundamental property of natural languages. It lies at the hear...
We show how generalized Gibbs-Shannon entropies can provide new insights on the statistical properti...
We estimate the n-gram entropies of natural language texts in word-length representation and find th...
Quantifying the similarity between symbolic sequences is a traditional problem in information theory...
We analyze the occurrence frequencies of over 15 million words recorded in millions of books publish...
© 2014 The Author(s) Published by the Royal Society. All rights reserved. The frequency with which w...
The relationship between the entropy of language and its complexity has been the subject of much spe...
The recent dramatic increase in online data availability has allowed researchers to explore human cu...
<p>The recent dramatic increase in online data availability has allowed researchers to explore human...
The present paper discusses the benefits and challenges of token-based typology, which takes into ac...
Written text is one of the fundamental manifestations of human language, and the study of its univer...