In this dissertation, we introduce a novel text representation method mainly used for text classification purpose. The presented representation method is initially based on a variety of closeness relationships between pairs of words in text passages within the entire corpus. This representation is then used as the basis for our multi-level lightweight ontological representation method (TOR-FUSE), in which documents are represented based on their contexts and the goal of the learning task. The method is unlike the traditional representation methods, in which all the documents are represented solely based on the constituent words of the documents, and are totally isolated from the goal that they are represented for. We believe choosing the co...
With the explosive growth of Internet and computing technology, human beings are confronted by a gre...
International audienceWe propose an approach for semi-automated construction of ontologies from text...
Unsupervised learning text representations aims at converting natural languages into vector represen...
The principal feature of ontology, which is developed for a text processing, is wider knowledge rep...
Text classification (TC) is an important foundation of information retrieval and text mining. The m...
Automatic text classification is the process of automatically classifying text documents into pre-de...
It is well known that synonymous and polysemous terms often bring in some noises when calculating th...
Most text classification research to date has used the standard'bag of words'model for text represen...
Researchers in many disciplines have been concerned with modeling textual data in order to account f...
Automatic text classification is the task of organizing documents into pre-determined classes, gener...
Most text classification research to date has used the standard "bag of words" model for text repres...
Structuring of text document knowledge frequently appears either by ontologies and metadata or by au...
We are living in the age of internet where massive amount of information is produced from various di...
Knowledge in textual form is always presented as visually and hierarchically structured units of tex...
Recent work has made much of using semantic knowledge, derived in particular from domain ontologies,...
With the explosive growth of Internet and computing technology, human beings are confronted by a gre...
International audienceWe propose an approach for semi-automated construction of ontologies from text...
Unsupervised learning text representations aims at converting natural languages into vector represen...
The principal feature of ontology, which is developed for a text processing, is wider knowledge rep...
Text classification (TC) is an important foundation of information retrieval and text mining. The m...
Automatic text classification is the process of automatically classifying text documents into pre-de...
It is well known that synonymous and polysemous terms often bring in some noises when calculating th...
Most text classification research to date has used the standard'bag of words'model for text represen...
Researchers in many disciplines have been concerned with modeling textual data in order to account f...
Automatic text classification is the task of organizing documents into pre-determined classes, gener...
Most text classification research to date has used the standard "bag of words" model for text repres...
Structuring of text document knowledge frequently appears either by ontologies and metadata or by au...
We are living in the age of internet where massive amount of information is produced from various di...
Knowledge in textual form is always presented as visually and hierarchically structured units of tex...
Recent work has made much of using semantic knowledge, derived in particular from domain ontologies,...
With the explosive growth of Internet and computing technology, human beings are confronted by a gre...
International audienceWe propose an approach for semi-automated construction of ontologies from text...
Unsupervised learning text representations aims at converting natural languages into vector represen...