The writing style used in social media usually contains informal elements that can lower the performance of Natural Language Processing applications. For this reason, text normalisation techniques have drawn a lot of attention recently when dealing with informal content. However, not all the texts present the same level of informality and may not require additional pre-processing steps. Therefore, in this paper we explore the results of applying lexical normalisation applied to a sentiment analysis classification task on Web 2.0 texts, shows more than a 2.6% improvement over average F1 for the most informal data.El tipo de lenguaje empleado en las redes sociales suele incluir elementos informales que pueden afectar el rendimiento de las her...
In the past decade, sentiment analysis research has thrived, especially on social media. While this ...
is one of the most important data sources in social data analysis. However, the text contained on Tw...
As social media constitute a valuable source for data analysis for a wide range of applications, the...
The language used in social media is often characterized by the abundance of informal and non-standa...
The language used in social media is often characterized by the abundance of informal and non-standa...
We present research aiming to build tools for the normalization of User-Generated Content (UGC). We ...
El análisis de textos de la Web 2.0 es un tema de investigación relevante hoy en día. Sin embargo, s...
National audienceThe boom of natural language processing (NLP) is taking place in a world where more...
We present work in progress aiming to build tools for the normalization of User-Generated Content (U...
We present work in progress aiming to build tools for the normalization of User-Generated Content (U...
Given the growing need to quickly process texts and extract information from the data for various pu...
Sentiment analysis in the most general sense refers to the classification of a piece of text into ei...
The proliferation of Web 2.0 technologies and the increasing use of computer-mediated communication ...
The informal nature of social media text renders it very difficult to be automati-cally processed by...
Existing natural language processing systems have often been designed with standard texts in mind. H...
In the past decade, sentiment analysis research has thrived, especially on social media. While this ...
is one of the most important data sources in social data analysis. However, the text contained on Tw...
As social media constitute a valuable source for data analysis for a wide range of applications, the...
The language used in social media is often characterized by the abundance of informal and non-standa...
The language used in social media is often characterized by the abundance of informal and non-standa...
We present research aiming to build tools for the normalization of User-Generated Content (UGC). We ...
El análisis de textos de la Web 2.0 es un tema de investigación relevante hoy en día. Sin embargo, s...
National audienceThe boom of natural language processing (NLP) is taking place in a world where more...
We present work in progress aiming to build tools for the normalization of User-Generated Content (U...
We present work in progress aiming to build tools for the normalization of User-Generated Content (U...
Given the growing need to quickly process texts and extract information from the data for various pu...
Sentiment analysis in the most general sense refers to the classification of a piece of text into ei...
The proliferation of Web 2.0 technologies and the increasing use of computer-mediated communication ...
The informal nature of social media text renders it very difficult to be automati-cally processed by...
Existing natural language processing systems have often been designed with standard texts in mind. H...
In the past decade, sentiment analysis research has thrived, especially on social media. While this ...
is one of the most important data sources in social data analysis. However, the text contained on Tw...
As social media constitute a valuable source for data analysis for a wide range of applications, the...