While the use of statistical physics methods to analyze large corpora has been useful to unveil many patterns in texts, no comprehensive investigation has been performed on the interdependence between syntactic and semantic factors. In this study we propose a framework for determining whether a text (e.g., written in an unknown alphabet) is compatible with a natural language and to which language it could belong. The approach is based on three types of statistical measurements, i.e. obtained from first-order statistics of word properties in a text, from the topology of complex networks representing texts, and from intermittency concepts where text is treated as a time series. Comparative experiments were performed with the New Testament in ...
In linguistic studies, the academic level of the vocabulary in a text can be described in terms of s...
International audienceWitnesses of medieval literary texts, preserved in manuscript, are layered obj...
The Voynich manuscript is a more than 600-year-old historical manuscript. It is considered one of th...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
The Voynich manuscript has remained so far as a mystery for linguists and cryptologists. While the t...
The Voynich manuscript has remained so far as a mystery for linguists and cryptologists. While the t...
The Voynich manuscript is a medieval book written in an unknown script. This paper studies the relat...
This paper discusses the possible use of unconventional algorithms on analysis and categorization of...
Abstract: The statistical properties of letters frequencies in European literature texts a...
The purpose of the paper is to extend the general theory of translation to texts written in the same...
In the present work, we quantify the irregularity of different European languages belonging to four ...
Natural language is a remarkable example of a complex dynamical system which combines variation and ...
Written text is one of the fundamental manifestations of human language, and the study of its univer...
This paper presents a quantitative approach to poetry, based on the use of several statistical measu...
In linguistic studies, the academic level of the vocabulary in a text can be described in terms of s...
International audienceWitnesses of medieval literary texts, preserved in manuscript, are layered obj...
The Voynich manuscript is a more than 600-year-old historical manuscript. It is considered one of th...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
The Voynich manuscript has remained so far as a mystery for linguists and cryptologists. While the t...
The Voynich manuscript has remained so far as a mystery for linguists and cryptologists. While the t...
The Voynich manuscript is a medieval book written in an unknown script. This paper studies the relat...
This paper discusses the possible use of unconventional algorithms on analysis and categorization of...
Abstract: The statistical properties of letters frequencies in European literature texts a...
The purpose of the paper is to extend the general theory of translation to texts written in the same...
In the present work, we quantify the irregularity of different European languages belonging to four ...
Natural language is a remarkable example of a complex dynamical system which combines variation and ...
Written text is one of the fundamental manifestations of human language, and the study of its univer...
This paper presents a quantitative approach to poetry, based on the use of several statistical measu...
In linguistic studies, the academic level of the vocabulary in a text can be described in terms of s...
International audienceWitnesses of medieval literary texts, preserved in manuscript, are layered obj...
The Voynich manuscript is a more than 600-year-old historical manuscript. It is considered one of th...