Currently, since the categorical distribution of short text corpus is not balanced, it is difficult to obtain accurate classification results for long text classification. To solve this problem, this paper proposes a novel method of short text classification using comprehensive feature weights. This method takes into account the situation of the samples in the positive and negative categories, as well as the category correlation of words, so as to improve the existing feature weight calculation method and obtain a new method of calculating the comprehensive feature weight. The experimental result shows that the proposed method is significantly higher than other feature-weight methods in the micro and macro average value, which shows that th...
Text classification is a wide research field with existing ready-to-use solutions for supervised tra...
The use of background knowledge is largely unexploited in text classification tasks. This paper expl...
In text classification, providing an efficient classifier even if the number of documents involved i...
Currently, since the categorical distribution of short text corpus is not balanced, it is difficult ...
One decisive problem of short text classification is the serious dimensional disaster when utilizing...
Short text understanding and short text are always more ambiguous. These short texts are produced in...
AbstractIn this paper, we propose a novel approach to classify short texts by combining both their l...
In the information age, short texts are being encountered at numerous instances and in large quantit...
Abstract. Data Sparseness, the evident characteristic of short text, is caused by the diversity of l...
Selecting features that represent a specific class is important to achieve a high text classificatio...
Text preprocessing is one of the key problems in pattern recognition and plays an important role in ...
Text categorization is an important application of machine learning to the field of document informa...
Abstract As an important step in natural language processing (NLP), text classification system has b...
This paper proposes a new approach for text categorization, based on a feature projection technique....
In some specific fields, there are a lot of ultra-short texts that need to be categorized. This pape...
Text classification is a wide research field with existing ready-to-use solutions for supervised tra...
The use of background knowledge is largely unexploited in text classification tasks. This paper expl...
In text classification, providing an efficient classifier even if the number of documents involved i...
Currently, since the categorical distribution of short text corpus is not balanced, it is difficult ...
One decisive problem of short text classification is the serious dimensional disaster when utilizing...
Short text understanding and short text are always more ambiguous. These short texts are produced in...
AbstractIn this paper, we propose a novel approach to classify short texts by combining both their l...
In the information age, short texts are being encountered at numerous instances and in large quantit...
Abstract. Data Sparseness, the evident characteristic of short text, is caused by the diversity of l...
Selecting features that represent a specific class is important to achieve a high text classificatio...
Text preprocessing is one of the key problems in pattern recognition and plays an important role in ...
Text categorization is an important application of machine learning to the field of document informa...
Abstract As an important step in natural language processing (NLP), text classification system has b...
This paper proposes a new approach for text categorization, based on a feature projection technique....
In some specific fields, there are a lot of ultra-short texts that need to be categorized. This pape...
Text classification is a wide research field with existing ready-to-use solutions for supervised tra...
The use of background knowledge is largely unexploited in text classification tasks. This paper expl...
In text classification, providing an efficient classifier even if the number of documents involved i...