We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order relations between the words and between the documents. Conventional approach in text categorization systems is to represent documents as a "Bag of Words" (BOW) in which the relations between the words and their positions are lost. Additionally, traditional machine learning algorithms assume that instances, in our case documents, are independent and identically distributed. This approach simplifies the underlying models, but nevertheless it ignores the semantic connections between words as well as the semantic relations between documents that stem from the words. In this study, we improve the semantic knowledge capture capability of a previous...
Vector Space Models (VSM) are commonly used in language processing to represent certain aspects of n...
Abstract: This work presents kernel functions that can be used in conjunction with the Support Vecto...
International audienceNatural Language Processing has emerged as an active field of research in the ...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 13th International Conference on Artificia...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2013 10th International Conference on Elec...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
Abstract. Typically, in textual document classification the documents are represented in the vector ...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
In this study, we propose a novel methodology to build a semantic smoothing kernel to use with Suppo...
In text categorization, a document is usually represented by a vector space model which can accompli...
The expanding popularity of the Internet in recent years has lead to a corresponding increase in the...
In this paper we propose a novel kernel for text categorization. This kernel is an inner product def...
Abstract. Most text classification systems use bag-of-words represen-tation of documents to find the...
Vector Space Models (VSM) are commonly used in language processing to represent certain aspects of n...
Abstract: This work presents kernel functions that can be used in conjunction with the Support Vecto...
International audienceNatural Language Processing has emerged as an active field of research in the ...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 13th International Conference on Artificia...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2013 10th International Conference on Elec...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
Abstract. Typically, in textual document classification the documents are represented in the vector ...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
In this study, we propose a novel methodology to build a semantic smoothing kernel to use with Suppo...
In text categorization, a document is usually represented by a vector space model which can accompli...
The expanding popularity of the Internet in recent years has lead to a corresponding increase in the...
In this paper we propose a novel kernel for text categorization. This kernel is an inner product def...
Abstract. Most text classification systems use bag-of-words represen-tation of documents to find the...
Vector Space Models (VSM) are commonly used in language processing to represent certain aspects of n...
Abstract: This work presents kernel functions that can be used in conjunction with the Support Vecto...
International audienceNatural Language Processing has emerged as an active field of research in the ...