<p>(a) shows the word sequence of Les Miserables from word 31096 to word 31116. Punctuations are considered as words. The sequences beneath illustrate how the return intervals between rare words and their lengths are defined: For <i>R</i><sub><i>Q</i></sub> = 2 and 4, only words with ranks above the corresponding <i>Q</i> value (here: <i>Q</i> = 46 and 544, respectively) are picked out and denoted by the large bars. The other words are denoted by the small bars. The return intervals are the intervals between consecutive large bars, i.e. the number of small bars between 2 consecutive large bars plus 1, and are listed beneath the sequences. (b) shows, in a segment of 300 words, the position of words with ranks above <i>Q</i> = 544, 2731, and ...
We study the correlation properties of word lengths in large texts from 30 ebooks in the English lan...
International audienceWe analyse a sample text. By identifying compounds and other sequences of word...
22nd International Symposium on Computer and Information Sciences -- NOV 07-09, 2007 -- Ankara, TURK...
<p>For each text, the left-hand graphs show the (conditional) average length of a return interval in...
A fundamental problem in linguistics is how literary texts can be quantified mathematically. It is w...
We study regularities, to which the relative frequencies of the word lengths are subject, if the ent...
<p>The figure shows the autocorrelation function <i>C</i><sub><i>Q</i></sub>(<i>s</i>) that quantifi...
In natural language using short sentences is considered efficient for communication. However, a text...
International audienceAn interval of a permutation is a consecutive substring consisting of consecut...
A statistical physics study of punctuation effects on sentence lengths is presented for written text...
In natural language, several sequences of words are very frequent. A classical language model, like ...
The structure of written texts is analyzed by focusing on word sequences. As a method, word sequence...
Tout l’univers lexicométrique est fondé sur la répétition. La fréquence d’un mot dans un texte, c’es...
The distribution of a word in a collection of texts (corpus) is the set of locations where this term...
<p>The upper panel displays the aggregated response time distributions for the three text units, wor...
We study the correlation properties of word lengths in large texts from 30 ebooks in the English lan...
International audienceWe analyse a sample text. By identifying compounds and other sequences of word...
22nd International Symposium on Computer and Information Sciences -- NOV 07-09, 2007 -- Ankara, TURK...
<p>For each text, the left-hand graphs show the (conditional) average length of a return interval in...
A fundamental problem in linguistics is how literary texts can be quantified mathematically. It is w...
We study regularities, to which the relative frequencies of the word lengths are subject, if the ent...
<p>The figure shows the autocorrelation function <i>C</i><sub><i>Q</i></sub>(<i>s</i>) that quantifi...
In natural language using short sentences is considered efficient for communication. However, a text...
International audienceAn interval of a permutation is a consecutive substring consisting of consecut...
A statistical physics study of punctuation effects on sentence lengths is presented for written text...
In natural language, several sequences of words are very frequent. A classical language model, like ...
The structure of written texts is analyzed by focusing on word sequences. As a method, word sequence...
Tout l’univers lexicométrique est fondé sur la répétition. La fréquence d’un mot dans un texte, c’es...
The distribution of a word in a collection of texts (corpus) is the set of locations where this term...
<p>The upper panel displays the aggregated response time distributions for the three text units, wor...
We study the correlation properties of word lengths in large texts from 30 ebooks in the English lan...
International audienceWe analyse a sample text. By identifying compounds and other sequences of word...
22nd International Symposium on Computer and Information Sciences -- NOV 07-09, 2007 -- Ankara, TURK...