How to measure proximities and oppositions in large text corpora? Intertextual distance provides a simple and interesting solution. Its properties make it a good tool for text classification, and especially for tree-analysis which is fully presented and discussed here. In order to measure the quality of this classification, two indices are proposed. The method presented provides an accurate tool for literary studies-as is demonstrated by applying it to two areas of French literature, Racine's tragedies and an authorship attribution experiment. Résumé Comment mesurer les proximités et les oppositions dans les grands corpus de texts? La distance intertextuelle offre une solution simple et intéressante. Ses propriétés en font un excellent...
18 pagesInternational audienceIn the 2001, Volume 8, Number 3, issue of the Journal of Quantitative ...
Version française préliminaire à la traduction anglaise acceptée par le Journal of Quantitative Ling...
When the aim of a study is comparing and contrasting texts of the same genre and achieving a good ar...
International audienceHow to measure proximities and oppositions in large text corpora? Intertextual...
Version préliminaire soumise au comité scientifique et retenue sans modificationInternational audien...
The purpose of this paper is to test and to compare various methods used by the statistical analyses...
With the collaboration of J. Savoy, a corpus has been compiled in order to test the methods of autho...
International audienceHow can it be said that texts are "near to" or "distant from" one another? Are...
Moving from Labb\ue9\u2019s proposal envisaging the use of intertextual distance to measure the simi...
Moving from Labb\ue9\u2019s proposal envisaging the use of intertextual distance to measure the simi...
version anglaise préliminaire à l'article paru sous ce titre dans le Journal of Quantitative Linguis...
In text clustering most distance-based methods summarize the occurrences of a set of linguistic feat...
This study proposes a thematic research method using statistics (a probabilistic test) : it is appli...
The main aim of this study is testing the performance of Labbe's intertextual distance in the case o...
International audienceAs of late, the network analysis of literary texts has grown into an independe...
18 pagesInternational audienceIn the 2001, Volume 8, Number 3, issue of the Journal of Quantitative ...
Version française préliminaire à la traduction anglaise acceptée par le Journal of Quantitative Ling...
When the aim of a study is comparing and contrasting texts of the same genre and achieving a good ar...
International audienceHow to measure proximities and oppositions in large text corpora? Intertextual...
Version préliminaire soumise au comité scientifique et retenue sans modificationInternational audien...
The purpose of this paper is to test and to compare various methods used by the statistical analyses...
With the collaboration of J. Savoy, a corpus has been compiled in order to test the methods of autho...
International audienceHow can it be said that texts are "near to" or "distant from" one another? Are...
Moving from Labb\ue9\u2019s proposal envisaging the use of intertextual distance to measure the simi...
Moving from Labb\ue9\u2019s proposal envisaging the use of intertextual distance to measure the simi...
version anglaise préliminaire à l'article paru sous ce titre dans le Journal of Quantitative Linguis...
In text clustering most distance-based methods summarize the occurrences of a set of linguistic feat...
This study proposes a thematic research method using statistics (a probabilistic test) : it is appli...
The main aim of this study is testing the performance of Labbe's intertextual distance in the case o...
International audienceAs of late, the network analysis of literary texts has grown into an independe...
18 pagesInternational audienceIn the 2001, Volume 8, Number 3, issue of the Journal of Quantitative ...
Version française préliminaire à la traduction anglaise acceptée par le Journal of Quantitative Ling...
When the aim of a study is comparing and contrasting texts of the same genre and achieving a good ar...