Abstract—Document representation is a fundamental prob-lem for text mining. Many efforts have been done to generate concise yet semantic representation, such as bag-of-words, phrase, sentence and topic-level descriptions. Nevertheless, most existing techniques counter difficulties in handling mono-lingual comparable corpus, which is a collection of mono-lingual documents conveying the same topic. In this paper, we propose the use of frame, a high-level semantic unit, and construct frame-based representations to semantically describe documents by bags of frames, using an information network approach. One major challenge in this representation is that semantically similar frames may be of different forms. For example, “radiation leaked ” in o...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
This paper proposes an evolution over MERGILO, a tool for reconciling knowledge graphs extracted fro...
We propose a novel methodology for extracting semantic similarity knowledge from semi-structured sou...
Abstract — Conventional document mining systems mainly use the presence or absence of keywords to mi...
Conventional document mining systems mainly use the presence or absence of keywords to mine texts. H...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
Two document representation methods are mainly used in solving text mining problems. Known for its i...
Traditional techniques of document clustering do not consider the semantic relationships between wor...
Abstract: Text-mining methods have become a key feature for homeland-security technologies, as they ...
The multiple ways in which propositional content can be expressed is often referred to as the paraph...
Abstract:Most of the common techniques of text mining are based on the statistical analysis of the t...
Semantic document clustering is a type of unsupervised learning in which documents are grouped toget...
Measuring semantic nearness of documents is important for accurate information retrieval, automated ...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
AGAP : équipe GE2popDocument Indexing is but not limited to summarizing document contents with a sma...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
This paper proposes an evolution over MERGILO, a tool for reconciling knowledge graphs extracted fro...
We propose a novel methodology for extracting semantic similarity knowledge from semi-structured sou...
Abstract — Conventional document mining systems mainly use the presence or absence of keywords to mi...
Conventional document mining systems mainly use the presence or absence of keywords to mine texts. H...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
Two document representation methods are mainly used in solving text mining problems. Known for its i...
Traditional techniques of document clustering do not consider the semantic relationships between wor...
Abstract: Text-mining methods have become a key feature for homeland-security technologies, as they ...
The multiple ways in which propositional content can be expressed is often referred to as the paraph...
Abstract:Most of the common techniques of text mining are based on the statistical analysis of the t...
Semantic document clustering is a type of unsupervised learning in which documents are grouped toget...
Measuring semantic nearness of documents is important for accurate information retrieval, automated ...
Thematic organization of text is a natural practice of humans and a crucial task for today's vast re...
AGAP : équipe GE2popDocument Indexing is but not limited to summarizing document contents with a sma...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
This paper proposes an evolution over MERGILO, a tool for reconciling knowledge graphs extracted fro...
We propose a novel methodology for extracting semantic similarity knowledge from semi-structured sou...