Abstract. Most text classification systems use bag-of-words represen-tation of documents to find the classification target function. Linguistic structures such as morphology, syntax and semantic are completely ne-glected in the learning process. This paper proposes a new document representation that, while includ-ing its context independent sentence meaning, is able to be used by a structured kernel function, namely the direct product kernel. The proposal is evaluated using a dataset of articles from a Portuguese daily newspaper and classifiers are built using the SVM algorithm. The results show that this structured representation, while only partially de-scribing document’s significance has the same discriminative power over classes as the...
Text classification using semantic information is the latest trend of research due to its greater po...
In this study, we propose a novel methodology to build a semantic smoothing kernel to use with Suppo...
Klasifikacija teksta temeljem sadržaja jedan je od osnovnih zadataka koji se javljaju u domeni dubin...
Most text classification systems use bag-of-words represen- tation of documents to find the classifi...
The most common approach to the text classification problem is to use a bag-of-words representation ...
We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order ...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
International audienceNatural Language Processing has emerged as an active field of research in the ...
International audienceSince a decade, text categorization has become an active field of research in ...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2013 10th International Conference on Elec...
The expanding popularity of the Internet in recent years has lead to a corresponding increase in the...
Abstract. Typically, in textual document classification the documents are represented in the vector ...
Text classification using semantic information is the latest trend of research due to its greater po...
In this study, we propose a novel methodology to build a semantic smoothing kernel to use with Suppo...
Klasifikacija teksta temeljem sadržaja jedan je od osnovnih zadataka koji se javljaju u domeni dubin...
Most text classification systems use bag-of-words represen- tation of documents to find the classifi...
The most common approach to the text classification problem is to use a bag-of-words representation ...
We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order ...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
International audienceNatural Language Processing has emerged as an active field of research in the ...
International audienceSince a decade, text categorization has become an active field of research in ...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2013 10th International Conference on Elec...
The expanding popularity of the Internet in recent years has lead to a corresponding increase in the...
Abstract. Typically, in textual document classification the documents are represented in the vector ...
Text classification using semantic information is the latest trend of research due to its greater po...
In this study, we propose a novel methodology to build a semantic smoothing kernel to use with Suppo...
Klasifikacija teksta temeljem sadržaja jedan je od osnovnih zadataka koji se javljaju u domeni dubin...