When the aim of a study is comparing and contrasting texts of the same genre and achieving a good arrangement for a text clustering, we often resort to lexical-based approaches and appropriate measures of similarity/distance (Burrows, 2002; Juola, 2008; Rudman, 1998, Stamatatos, 2009; Labb\ue9 and Labb\ue9, 2001; Tuzzi, 2010) between texts, e.g. cosine similarity, Burrows's Delta, Labb\ue9's intertextual distance, etc. Given the properties and the formula of a distance, we obtain a square matrix that includes n 7n cells and n(n-1)/2 positive non-zero non-redundant values that can be exploited for an automatic classification of the n available texts. This distance matrix might be read from an alternative perspective, i.e. as a ranking syste...
The focus of this thesis is comparison of analysis of text-document similarity using clustering algo...
Objective of the document clustering techniques is to assemble similar documents and segregate dissi...
International audienceWe propose a new similarity measure between texts which, contrary to the curre...
In text clustering most distance-based methods summarize the occurrences of a set of linguistic feat...
Moving from Labb\ue9\u2019s proposal envisaging the use of intertextual distance to measure the simi...
Moving from Labb\ue9\u2019s proposal envisaging the use of intertextual distance to measure the simi...
This study takes into account the issue of text clustering against the specific backgrou...
This study takes into account the issue of text clustering against the specific background of bag-of...
When dealing with authorship attribution (AA), famous cases of disputed authorship naturally come up...
How to measure proximities and oppositions in large text corpora? Intertextual distance provides a s...
The main aim of this study is testing the performance of Labbe's intertextual distance in the case o...
We applied hierarchical clustering using Rank distance, previously used in compu-tational stylometry...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
Data mining, also known as knowledge discovery in database (KDD), is the process to discover interes...
A fundamental problem in linguistics is how literary texts can be quantified mathematically. It is w...
The focus of this thesis is comparison of analysis of text-document similarity using clustering algo...
Objective of the document clustering techniques is to assemble similar documents and segregate dissi...
International audienceWe propose a new similarity measure between texts which, contrary to the curre...
In text clustering most distance-based methods summarize the occurrences of a set of linguistic feat...
Moving from Labb\ue9\u2019s proposal envisaging the use of intertextual distance to measure the simi...
Moving from Labb\ue9\u2019s proposal envisaging the use of intertextual distance to measure the simi...
This study takes into account the issue of text clustering against the specific backgrou...
This study takes into account the issue of text clustering against the specific background of bag-of...
When dealing with authorship attribution (AA), famous cases of disputed authorship naturally come up...
How to measure proximities and oppositions in large text corpora? Intertextual distance provides a s...
The main aim of this study is testing the performance of Labbe's intertextual distance in the case o...
We applied hierarchical clustering using Rank distance, previously used in compu-tational stylometry...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
Data mining, also known as knowledge discovery in database (KDD), is the process to discover interes...
A fundamental problem in linguistics is how literary texts can be quantified mathematically. It is w...
The focus of this thesis is comparison of analysis of text-document similarity using clustering algo...
Objective of the document clustering techniques is to assemble similar documents and segregate dissi...
International audienceWe propose a new similarity measure between texts which, contrary to the curre...