The recent availability of large corpora of digitized texts over several centuries opens the way to new forms of studies on the evolution of languages. In this thesis, we study a corpus of 4 million press articles covering a period of 200 years. The thesis tries to measure the evolution of written French on this period at the level of words and expressions, but also in a more global way by attempting to define integrated measures of linguistic evolution. The methodological choice is to introduce a minimum of linguistic hypotheses in this study by developing new measures around the simple notion of n-gram, a sequence of n consecutive words. The thesis explores on this basis the potential of already known concepts as temporal frequency profil...
<p>The recent dramatic increase in online data availability has allowed researchers to explore human...
The recent dramatic increase in online data availability has allowed researchers to explore human cu...
The recent dramatic increase in online data availability has allowed researchers to explore human cu...
International audienceThe present study deals with textual frequencies, considered from the point of...
International audienceThe present study deals with textual frequencies, considered from the point of...
This paper presents a methodology to analyze linguistic changes in a given textual corpus allowing t...
The relationship between the entropy of language and its complexity has been the subject of much spe...
This work aims to study grammaticalization, the process by which the functional items of a language ...
International audienceThe aim of the present study is to assess the use of n-grams and Correspondenc...
International audienceThe aim of the present study is to assess the use of n-grams and Correspondenc...
International audienceThe aim of the present study is to assess the use of n-grams and Correspondenc...
While there has been little work yet on quantifying diachronic change in multi-word sequences (MWS) ...
While there has been little work yet on quantifying diachronic change in multi-word sequences (MWS) ...
While there has been little work yet on quantifying diachronic change in multi-word sequences (MWS) ...
This article shows how new approaches in corpus analysis could enrich traditional lexicographic desc...
<p>The recent dramatic increase in online data availability has allowed researchers to explore human...
The recent dramatic increase in online data availability has allowed researchers to explore human cu...
The recent dramatic increase in online data availability has allowed researchers to explore human cu...
International audienceThe present study deals with textual frequencies, considered from the point of...
International audienceThe present study deals with textual frequencies, considered from the point of...
This paper presents a methodology to analyze linguistic changes in a given textual corpus allowing t...
The relationship between the entropy of language and its complexity has been the subject of much spe...
This work aims to study grammaticalization, the process by which the functional items of a language ...
International audienceThe aim of the present study is to assess the use of n-grams and Correspondenc...
International audienceThe aim of the present study is to assess the use of n-grams and Correspondenc...
International audienceThe aim of the present study is to assess the use of n-grams and Correspondenc...
While there has been little work yet on quantifying diachronic change in multi-word sequences (MWS) ...
While there has been little work yet on quantifying diachronic change in multi-word sequences (MWS) ...
While there has been little work yet on quantifying diachronic change in multi-word sequences (MWS) ...
This article shows how new approaches in corpus analysis could enrich traditional lexicographic desc...
<p>The recent dramatic increase in online data availability has allowed researchers to explore human...
The recent dramatic increase in online data availability has allowed researchers to explore human cu...
The recent dramatic increase in online data availability has allowed researchers to explore human cu...