Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as well as in other communication systems. We raise the question of the elementary units for which Zipf’s law should hold in the most natural way, studying its validity for plain word forms and for the corresponding lemma forms. We analyze several long literary texts comprising four languages, with different levels of morphological complexity. In all cases Zipf’s law is fulfilled, in the sense that a power-law distribution of word or lemma frequencies is valid for several orders of magnitude. We investigate the extent to which the word-lemma transformation preserves two parameters of Zipf’s law: the exponent and the low-frequency cut-off. We are no...
Zipf's law states that the frequency of a word is a power function of its rank. The exponent of the ...
Zipf's law is found when the vocabulary of long written texts is ranked according to the frequency o...
The formation of sentences is a highly structured and history-dependent process. The probability of ...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
<p>(a) Probability mass functions <i>f</i>(<i>n</i>) of the absolute frequencies <i>n</i> of words a...
Despite being a paradigm of quantitative linguistics, Zipf\'\''s law for words suffers from three ma...
Despite being a paradigm of quantitative linguistics, Zipf'\''s law for words suffers from three mai...
The frequency of words and letters in bodies of text has been heavily studied for several purposes, ...
The dependence with text length of the statistical properties of word occurrences has long been cons...
Zipf’s Law is an empirical law according to which the frequency of occurrence of a word in a corpus ...
The dependence on text length of the statistical properties of word occurrences has long been consid...
Zipf's law states that the frequency of a word is a power function of its rank. The exponent of the ...
Zipf's law is found when the vocabulary of long written texts is ranked according to the frequency o...
The formation of sentences is a highly structured and history-dependent process. The probability of ...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
<p>(a) Probability mass functions <i>f</i>(<i>n</i>) of the absolute frequencies <i>n</i> of words a...
Despite being a paradigm of quantitative linguistics, Zipf\'\''s law for words suffers from three ma...
Despite being a paradigm of quantitative linguistics, Zipf'\''s law for words suffers from three mai...
The frequency of words and letters in bodies of text has been heavily studied for several purposes, ...
The dependence with text length of the statistical properties of word occurrences has long been cons...
Zipf’s Law is an empirical law according to which the frequency of occurrence of a word in a corpus ...
The dependence on text length of the statistical properties of word occurrences has long been consid...
Zipf's law states that the frequency of a word is a power function of its rank. The exponent of the ...
Zipf's law is found when the vocabulary of long written texts is ranked according to the frequency o...
The formation of sentences is a highly structured and history-dependent process. The probability of ...