In the present work, we quantify the irregularity of different European languages belonging to four linguistic families (Romance, Germanic, Uralic and Slavic) and an artificial language (Esperanto). We modified a well-known method to calculate the approximate and sample entropy of written texts. We find differences in the degree of irregularity between the families and our method, which is based on the search of regularities in a sequence of symbols, and consistently distinguishes between natural and synthetic randomized texts. Moreover, we extended our study to the case where multiple scales are accounted for, such as the multiscale entropy analysis. Our results revealed that real texts have non-trivial structure compared to the ones obtai...
Background: The language faculty is probably the most distinctive feature of our species, and endows...
Even though the plagiarism identification issue remains relevant, modern detection methods are still...
Even though the plagiarism identification issue remains relevant, modern detection methods are still...
We compared entropy for texts written in natural languages (English, Spanish) and artificial languag...
<p>For each language, blue bars represent the average entropy of the random ...
<p>Each panel shows the distribution of the entropy of the random texts ...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
The language faculty is probably the most distinctive feature of our species, and endows us with a u...
The relationship between the entropy of language and its complexity has been the subject of much spe...
The language faculty is probably the most distinctive feature of our species, ...
As is the case of many signals produced by complex systems, language presents a statistical structu...
Zipf's law is just one out of many universal laws proposed to describe statistical regularities in l...
Computational textual aesthetics aims at studying observable differences between aesthetic categorie...
<p>(A) Normalized histograms of the fluctuation exponent <i>α</i> co...
Background: The language faculty is probably the most distinctive feature of our species, and endows...
Even though the plagiarism identification issue remains relevant, modern detection methods are still...
Even though the plagiarism identification issue remains relevant, modern detection methods are still...
We compared entropy for texts written in natural languages (English, Spanish) and artificial languag...
<p>For each language, blue bars represent the average entropy of the random ...
<p>Each panel shows the distribution of the entropy of the random texts ...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
The language faculty is probably the most distinctive feature of our species, and endows us with a u...
The relationship between the entropy of language and its complexity has been the subject of much spe...
The language faculty is probably the most distinctive feature of our species, ...
As is the case of many signals produced by complex systems, language presents a statistical structu...
Zipf's law is just one out of many universal laws proposed to describe statistical regularities in l...
Computational textual aesthetics aims at studying observable differences between aesthetic categorie...
<p>(A) Normalized histograms of the fluctuation exponent <i>α</i> co...
Background: The language faculty is probably the most distinctive feature of our species, and endows...
Even though the plagiarism identification issue remains relevant, modern detection methods are still...
Even though the plagiarism identification issue remains relevant, modern detection methods are still...