The paper goes through some tools considered nowadays classical in Text Mining procedures and software. We are speaking of Latent Semantic Indexing for dimensionality reduction, and the wide literature devoted to the problem of how to weight the word importance, and how to measure similarities between words and between words and queries. Visualisation is strongly affected by these choices. Here we compare some alternatives from a statistical viewpoint. A corpus consisting of six years of the Italian edition of Le Monde Diplomatique is analysed in order to show the effects of the different weighting systems together with the potentiality of Textual Data Analysis in summarising and representing newspaper information
In this paper, we investigate the impact of several local and global weighting schemes on Latent Sem...
Aim of the paper is to propose a Text Mining strategy based on statistical tools, which make more ef...
Recent research has highlighted a series of peculiar features in translated texts deriving from \u20...
The paper goes through some tools considered nowadays classical in Text Mining procedures and softwa...
This paper provides an overview of methods of analysis of textual data by applying quantitative meas...
In this paper, after reconstructing some essential phases in the evolution of automatic analysis of ...
This paper aims at exploring the capability of the so called Latent Semantic Analysis applied to a m...
The paper aims at showing the advantages of formulating lexical structures with variable elements (L...
Pairwise similarity judgement correlations between humans and Latent Semantic Analysis (LSA) were ex...
Analysis of large text data sets is gaining popularity providing the users some insights into their ...
The definition of good strategies for Text Retrieval has become in recent years more and more import...
Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribu...
Visualization is commonly used in data analysis to help the user in getting an initial idea about th...
The focus of this paper is to understand whether the words contained in a text corpus improves the e...
Adding semantic analysis in the process of comparing news articles enables a deeper level of analysi...
In this paper, we investigate the impact of several local and global weighting schemes on Latent Sem...
Aim of the paper is to propose a Text Mining strategy based on statistical tools, which make more ef...
Recent research has highlighted a series of peculiar features in translated texts deriving from \u20...
The paper goes through some tools considered nowadays classical in Text Mining procedures and softwa...
This paper provides an overview of methods of analysis of textual data by applying quantitative meas...
In this paper, after reconstructing some essential phases in the evolution of automatic analysis of ...
This paper aims at exploring the capability of the so called Latent Semantic Analysis applied to a m...
The paper aims at showing the advantages of formulating lexical structures with variable elements (L...
Pairwise similarity judgement correlations between humans and Latent Semantic Analysis (LSA) were ex...
Analysis of large text data sets is gaining popularity providing the users some insights into their ...
The definition of good strategies for Text Retrieval has become in recent years more and more import...
Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribu...
Visualization is commonly used in data analysis to help the user in getting an initial idea about th...
The focus of this paper is to understand whether the words contained in a text corpus improves the e...
Adding semantic analysis in the process of comparing news articles enables a deeper level of analysi...
In this paper, we investigate the impact of several local and global weighting schemes on Latent Sem...
Aim of the paper is to propose a Text Mining strategy based on statistical tools, which make more ef...
Recent research has highlighted a series of peculiar features in translated texts deriving from \u20...