Abstract. Imbalanced training data always puzzles the supervised learning based emotion and sentiment classification. Several existing research showed that data sparseness and small disjuncts are the two major factors affecting the classification. Target to these two problems, this paper presents a word embedding based oversampling method. Firstly, a large-scale text corpus is used to train a continuous skip-gram model in order to form word embedding. A feature selection and linear combination algorithm is developed to construct text representation vector from word embedding. Based on this, the new minority class training samples are generated through calculating the mean vector of two text representation vectors in the same class until the...
Moving beyond the dominant bag-of-words approach to sentiment analysis we introduce an alternative p...
© 2018 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. As an indispensable resourc...
The imbalanced sentiment distribution of microblogs induces bad performance of binary classifiers on...
Text classification often faces the problem of imbalanced training data. This is true in sentiment a...
Sentiment classification is an important task which gained extensive attention both in academia and ...
Word embeddings are effective intermediate representations for capturing semantic regularities betwe...
Since some sentiment words have similar syntactic and semantic features in the corpus, existing pre-...
International audienceMost existing continuous word representation learning algorithms usually only ...
Context-based word embedding learning approaches can model rich semantic and syntactic information. ...
In this article, how word embeddings can be used as features in Chinese sentiment classi-fication is...
Abstract—A sentiment classification method for Chinese microblog is presented. For short sentence mi...
In this paper, we propose an emotion separated method(SeTF·IDF) to assign the emotion labels of sent...
This paper proposes a novel approach using a coarse-to-fine analysis strategy for sentence-level emo...
Sentiment analysis is a well-known and rapidly expanding study topic in natural language processing ...
For sentiment analysis, lexicons play an important role in many related tasks. In this paper, aiming...
Moving beyond the dominant bag-of-words approach to sentiment analysis we introduce an alternative p...
© 2018 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. As an indispensable resourc...
The imbalanced sentiment distribution of microblogs induces bad performance of binary classifiers on...
Text classification often faces the problem of imbalanced training data. This is true in sentiment a...
Sentiment classification is an important task which gained extensive attention both in academia and ...
Word embeddings are effective intermediate representations for capturing semantic regularities betwe...
Since some sentiment words have similar syntactic and semantic features in the corpus, existing pre-...
International audienceMost existing continuous word representation learning algorithms usually only ...
Context-based word embedding learning approaches can model rich semantic and syntactic information. ...
In this article, how word embeddings can be used as features in Chinese sentiment classi-fication is...
Abstract—A sentiment classification method for Chinese microblog is presented. For short sentence mi...
In this paper, we propose an emotion separated method(SeTF·IDF) to assign the emotion labels of sent...
This paper proposes a novel approach using a coarse-to-fine analysis strategy for sentence-level emo...
Sentiment analysis is a well-known and rapidly expanding study topic in natural language processing ...
For sentiment analysis, lexicons play an important role in many related tasks. In this paper, aiming...
Moving beyond the dominant bag-of-words approach to sentiment analysis we introduce an alternative p...
© 2018 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. As an indispensable resourc...
The imbalanced sentiment distribution of microblogs induces bad performance of binary classifiers on...