Most text classification systems use bag-of-words represen- tation of documents to find the classification target function. Linguistic structures such as morphology, syntax and semantic are completely ne- glected in the learning process. This paper proposes a new document representation that, while includ- ing its context independent sentence meaning, is able to be used by a structured kernel function, namely the direct product kernel. The proposal is evaluated using a dataset of articles from a Portuguese daily newspaper and classifiers are built using the SVM algorithm. The results show that this structured representation, while only partially de- scribing document’s significance has the same discriminative power over classes as the tradi...
We propose a novel approach for categorizing text documents based on the use of a special kernel. Th...
Text classification using semantic information is the latest trend of research due to its greater po...
Most text classification research to date has used the standard'bag of words'model for text represen...
Abstract. Most text classification systems use bag-of-words represen-tation of documents to find the...
The most common approach to the text classification problem is to use a bag-of-words representation ...
We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order ...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
This paper examines the role of various linguistic structures on text classification applying the st...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
In text categorization, a document is usually represented by a vector space model which can accompli...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
International audienceNatural Language Processing has emerged as an active field of research in the ...
Klasifikacija teksta temeljem sadržaja jedan je od osnovnih zadataka koji se javljaju u domeni dubin...
International audienceSince a decade, text categorization has become an active field of research in ...
We propose a novel approach for categorizing text documents based on the use of a special kernel. Th...
Text classification using semantic information is the latest trend of research due to its greater po...
Most text classification research to date has used the standard'bag of words'model for text represen...
Abstract. Most text classification systems use bag-of-words represen-tation of documents to find the...
The most common approach to the text classification problem is to use a bag-of-words representation ...
We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order ...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
This paper examines the role of various linguistic structures on text classification applying the st...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
In text categorization, a document is usually represented by a vector space model which can accompli...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
International audienceNatural Language Processing has emerged as an active field of research in the ...
Klasifikacija teksta temeljem sadržaja jedan je od osnovnih zadataka koji se javljaju u domeni dubin...
International audienceSince a decade, text categorization has become an active field of research in ...
We propose a novel approach for categorizing text documents based on the use of a special kernel. Th...
Text classification using semantic information is the latest trend of research due to its greater po...
Most text classification research to date has used the standard'bag of words'model for text represen...