International audienceKernels are widely used in Natural Language Processing as similarity measures within inner-product based learning methods like the Support Vector Machine. The Vector Space Model (VSM) is extensively used for the spatial representation of the documents. However, it is purely a statistical representation. In this paper, we present a Concept Vector Space Model (CVSM) representation which uses linguistic prior knowledge to capture the meanings of the documents. We also propose a linear kernel and a latent kernel for this space. The linear kernel takes advantage of the linguistic concepts whereas the latent kernel combines statistical and linguistic concepts. Indeed, the latter kernel uses latent concepts extracted by the L...
We present an empirical study on the use of semantic information for Concept Seg-mentation and Label...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
Models that represent meaning as high-dimensional numerical vectors—such as latent semantic analysis...
International audienceThe document similarity measure is a key point in textual data processing. It ...
International audienceNatural Language Processing has emerged as an active field of research in the ...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
The Basic Vector Space Model (BVSM) is well known in information retrieval. Unfortunately, its retri...
Latent semantic analysis (LSA)is based on the concept of vector space mod-els, an approach using lin...
University of Minnesota Ph.D. dissertation. June 2010. Major: Computer Science. Advisor: William Edw...
We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order ...
Computers understand very little of the meaning of human language. This profoundly limits our abilit...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
Symbolic approaches have dominated NLP as a means to model syntactic and semantic aspects of natural...
International audienceSince a decade, text categorization has become an active field of research in ...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
We present an empirical study on the use of semantic information for Concept Seg-mentation and Label...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
Models that represent meaning as high-dimensional numerical vectors—such as latent semantic analysis...
International audienceThe document similarity measure is a key point in textual data processing. It ...
International audienceNatural Language Processing has emerged as an active field of research in the ...
In this thesis text categorization is investigated in four dimensions of analysis: theoretically as ...
The Basic Vector Space Model (BVSM) is well known in information retrieval. Unfortunately, its retri...
Latent semantic analysis (LSA)is based on the concept of vector space mod-els, an approach using lin...
University of Minnesota Ph.D. dissertation. June 2010. Major: Computer Science. Advisor: William Edw...
We propose a semantic kernel for Support Vector Machines (SVM) that takes advantage of higher-order ...
Computers understand very little of the meaning of human language. This profoundly limits our abilit...
Text categorization plays a crucial role in both academic and commercial platforms due to the growin...
Symbolic approaches have dominated NLP as a means to model syntactic and semantic aspects of natural...
International audienceSince a decade, text categorization has become an active field of research in ...
The bag of words (BOW) representation of documents is very common in text classification systems. Ho...
We present an empirical study on the use of semantic information for Concept Seg-mentation and Label...
Ganiz, Murat Can (Dogus Author) -- Conference full title: 2014 IEEE International Symposium on Innov...
Models that represent meaning as high-dimensional numerical vectors—such as latent semantic analysis...