Traditional techniques of document clustering do not consider the semantic relationships between words when assigning documents to clusters. For instance, if two documents talking about the same topic do that using different words (which may be synonyms or semantically associated), these techniques may assign documents to different clusters. Previous research has approached this problem by enriching the document representation with the background knowledge in an ontology. This paper presents a new approach to enhance document clustering by exploiting the semantic knowledge contained in Wikipedia. We first map terms within documents to their corresponding Wikipedia concepts. Then, similarity between each pair of terms is calculated by using ...
The constant success of the Internet made the number of text documents in electronic forms increases...
In this article, we examine an algorithm for document clustering using a similarity graph. The graph...
The constant success of the Internet made the number of text documents in electronic forms increases...
In traditional text clustering methods, documents are represented as “bags of words ” without consid...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
A graph-based distance between Wikipedia ar-ticles is defined using a random walk model, which estim...
Semantic document clustering is a type of unsupervised learning in which documents are grouped toget...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Abstract: This paper provides a solution to the issue: “How can we use Wikipedia based concepts in d...
The constant success of the Internet made the number of text documents in electronic forms increases...
In this article, we examine an algorithm for document clustering using a similarity graph. The graph...
The constant success of the Internet made the number of text documents in electronic forms increases...
In traditional text clustering methods, documents are represented as “bags of words ” without consid...
Most traditional text clustering methods are based on “bag of words ” (BOW) representation based on ...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document ...
A graph-based distance between Wikipedia ar-ticles is defined using a random walk model, which estim...
Semantic document clustering is a type of unsupervised learning in which documents are grouped toget...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Wikipedia has been applied as a background knowledge base to various text mining problems, but very ...
Abstract: This paper provides a solution to the issue: “How can we use Wikipedia based concepts in d...
The constant success of the Internet made the number of text documents in electronic forms increases...
In this article, we examine an algorithm for document clustering using a similarity graph. The graph...
The constant success of the Internet made the number of text documents in electronic forms increases...