In this paper we propose a novel kernel for text categorization. This kernel is an inner product defined in the feature space generated by all word combinations of specified length. A word combination is a collection of unique words co-occurring in the same sentence. The word combination of length k is weighted by the k rm th root of the product of the inverse document frequencies (IDF) of its words. By discarding word order, the word combination features are more compatible with the flexibility of natural language and the feature dimensions of documents can be reduced significantly to improve the sparseness of feature representations. By restricting the words to the same sentence and considering multi-word combinations, the word combinatio...
This paper introduces a convolutional sen-tence kernel based on word embeddings. Our kernel overcome...
We carried out a series of experiments on text classification using multi-word features. A hand-craf...
The problem of combining different sources of information arises in several situations, for instance...
The expanding popularity of the Internet in recent years has lead to a corresponding increase in the...
We propose a novel approach for categorizing text documents based on the use of a special kernel. Th...
We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order ...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2013 10th International Conference on Elec...
We propose a novel approach for categorizing text documents based on the use of a special kernel. Th...
In text categorization, a document is usually represented by a vector space model which can accompli...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
Traditional bag-of-words model and recent word-sequence kernel are two well-known techniques in the ...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
University of Technology, Sydney. Faculty of Engineering and Information Technology.NO FULL TEXT AVA...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
This paper introduces a convolutional sen-tence kernel based on word embeddings. Our kernel overcome...
We carried out a series of experiments on text classification using multi-word features. A hand-craf...
The problem of combining different sources of information arises in several situations, for instance...
The expanding popularity of the Internet in recent years has lead to a corresponding increase in the...
We propose a novel approach for categorizing text documents based on the use of a special kernel. Th...
We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order ...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2013 10th International Conference on Elec...
We propose a novel approach for categorizing text documents based on the use of a special kernel. Th...
In text categorization, a document is usually represented by a vector space model which can accompli...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
Traditional bag-of-words model and recent word-sequence kernel are two well-known techniques in the ...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
University of Technology, Sydney. Faculty of Engineering and Information Technology.NO FULL TEXT AVA...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
This paper introduces a convolutional sen-tence kernel based on word embeddings. Our kernel overcome...
We carried out a series of experiments on text classification using multi-word features. A hand-craf...
The problem of combining different sources of information arises in several situations, for instance...