An important body of quantitative linguistics is constituted by a series of statistical laws about language usage. Despite the importance of these linguistic laws, some of them are poorly formulated, and, more importantly, there is no unified framework that encompasses all them. This paper presents a new perspective to establish a connection between different statistical linguistic laws. Characterizing each word type by two random variables-length (in number of characters) and absolute frequency-we show that the corresponding bivariate joint probability distribution shows a rich and precise phenomenology, with the type-length and the type-frequency distributions as its two marginals, and the conditional distribution of frequency at fixed le...
The frequency of words and letters in bodies of text has been heavily studied for several purposes, ...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
This paper examines data from English, Swedish and German in order to find a theoretical distributio...
Zipf’s Law is an empirical law according to which the frequency of occurrence of a word in a corpus ...
Quantitative linguistics has provided us with a number of empirical laws that characterise the evolu...
The pioneering research of G. K. Zipf on the relationship between word frequency and other word feat...
The dependence with text length of the statistical properties of word occurrences has long been cons...
Brevity and frequency are two crucial factors in the processes of statistical learning. The compress...
The dependence on text length of the statistical properties of word occurrences has long been consid...
In his pioneering research, G.K. Zipf observed that more frequent words tend to have more meanings, ...
It is hard to imagine how the development of quantitative linguistics would have been after G.K. Zi...
The word-frequency distribution provides the fundamental building blocks that generate discourse in ...
Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Brevity and frequency are two crucial factors in the processes of statistical learning in language. ...
The pioneering research of G.K. Zipf on the relationship between word frequency and other word featu...
The frequency of words and letters in bodies of text has been heavily studied for several purposes, ...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
This paper examines data from English, Swedish and German in order to find a theoretical distributio...
Zipf’s Law is an empirical law according to which the frequency of occurrence of a word in a corpus ...
Quantitative linguistics has provided us with a number of empirical laws that characterise the evolu...
The pioneering research of G. K. Zipf on the relationship between word frequency and other word feat...
The dependence with text length of the statistical properties of word occurrences has long been cons...
Brevity and frequency are two crucial factors in the processes of statistical learning. The compress...
The dependence on text length of the statistical properties of word occurrences has long been consid...
In his pioneering research, G.K. Zipf observed that more frequent words tend to have more meanings, ...
It is hard to imagine how the development of quantitative linguistics would have been after G.K. Zi...
The word-frequency distribution provides the fundamental building blocks that generate discourse in ...
Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
Brevity and frequency are two crucial factors in the processes of statistical learning in language. ...
The pioneering research of G.K. Zipf on the relationship between word frequency and other word featu...
The frequency of words and letters in bodies of text has been heavily studied for several purposes, ...
Zipf’s law is a fundamental paradigm in the statistics of written and spoken natural language as wel...
This paper examines data from English, Swedish and German in order to find a theoretical distributio...