In this paper, we introduce a new clustering algorithm for discovering and describing the topics comprised in a text collection. Our proposal relies on both the most probable term pairs generated from the collection and the estimation of the topic homogeneity associated to these pairs. Topics and their descriptions are generated from those term pairs whose support sets are homogeneous enough for representing collection topics. Experimental results obtained over three benchmark text collections demonstrate the effectiveness and utility of this new approach. © 2009
Topics extraction from documents has become increasingly important due to its effectiveness in many ...
Topics extraction from documents has become increasingly important due to its effectiveness in many ...
Topics extraction from documents has become increasingly important due to its effectiveness in many ...
When analyzing a document collection, a key piece of information is the number of distinct topics it...
When analyzing a document collection, a key piece of information is the number of distinct topics it...
The article addresses the problem of document clusterization. The author describes a technology for ...
The article addresses the problem of document clusterization. The author describes a technology for ...
The exponential growth of the size and popularity of the world wide web has increased the interest i...
The exponential growth of the size and popularity of the world wide web has increased the interest i...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction from documents has become increasingly important due to its effectiveness in many ...
Topics extraction from documents has become increasingly important due to its effectiveness in many ...
Topics extraction from documents has become increasingly important due to its effectiveness in many ...
When analyzing a document collection, a key piece of information is the number of distinct topics it...
When analyzing a document collection, a key piece of information is the number of distinct topics it...
The article addresses the problem of document clusterization. The author describes a technology for ...
The article addresses the problem of document clusterization. The author describes a technology for ...
The exponential growth of the size and popularity of the world wide web has increased the interest i...
The exponential growth of the size and popularity of the world wide web has increased the interest i...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction has become increasingly important due to its effectiveness in many tasks, includin...
Topics extraction from documents has become increasingly important due to its effectiveness in many ...
Topics extraction from documents has become increasingly important due to its effectiveness in many ...
Topics extraction from documents has become increasingly important due to its effectiveness in many ...