AbstractIn this paper, we propose a novel approach to classify short texts by combining both their lexical and semantic features. We present an improved measurement method for lexical feature selection and furthermore obtain the semantic features with the background knowledge repository which covers target category domains. The combination of lexical and semantic features is achieved by mapping words to topics with different weights. In this way, the dimensionality of feature space is reduced to the number of topics. We here use Wikipedia as background knowledge and employ Support Vector Machine (SVM) as classifier. The experiment results show that our approach has better effectiveness compared with existing methods for classifying short te...
Currently, since the categorical distribution of short text corpus is not balanced, it is difficult ...
Text classification typically performs best with large training sets, but short texts are very commo...
Abstract. Data Sparseness, the evident characteristic of short text, is caused by the diversity of l...
AbstractIn this paper, we propose a novel approach to classify short texts by combining both their l...
The use of background knowledge is largely unexploited in text classification tasks. This paper expl...
Understanding short texts is crucial to many applications, but challenges abound. First, short texts...
Abstract-Understanding short texts is crucial to many applications, but challenges abound. First, sh...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
Understanding short texts is crucial to many applications, but challenges abound. First, short texts...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
We address the problem of the categorization of short texts, like those posted by users on social ne...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
Short text understanding and short text are always more ambiguous. These short texts are produced in...
Currently, since the categorical distribution of short text corpus is not balanced, it is difficult ...
Text classification typically performs best with large training sets, but short texts are very commo...
Abstract. Data Sparseness, the evident characteristic of short text, is caused by the diversity of l...
AbstractIn this paper, we propose a novel approach to classify short texts by combining both their l...
The use of background knowledge is largely unexploited in text classification tasks. This paper expl...
Understanding short texts is crucial to many applications, but challenges abound. First, short texts...
Abstract-Understanding short texts is crucial to many applications, but challenges abound. First, sh...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
Understanding short texts is crucial to many applications, but challenges abound. First, short texts...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
We address the problem of the categorization of short texts, like those posted by users on social ne...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
With the exponential growth in the daily publication of scientific articles, automatic classificatio...
Short text understanding and short text are always more ambiguous. These short texts are produced in...
Currently, since the categorical distribution of short text corpus is not balanced, it is difficult ...
Text classification typically performs best with large training sets, but short texts are very commo...
Abstract. Data Sparseness, the evident characteristic of short text, is caused by the diversity of l...