Topic detection and tracking (TDT) algorithms have long been developed for the discovery of topics. However, most existing TDT algorithms suffer from paying less attention to: (1) temporal distance between a pair of topics; (2) the mutual effect between highly correlated topic terms. In this paper, we proposed a novel topic detection approach by applying hierarchical clustering on the constructed concept graph (HCCG), which is able to solve aforementioned shortcomings simultaneously. In this approach, the concept is first defined as well as the concept behavior curve. Then, the temporal graph is constructed with concept as vertexes and connected by the edges sharing the same topic terms. By performing hierarchical clustering on this concept...
In this research, we extend probabilistic topic models, originally developed for a textual corpus an...
When analyzing a document collection, a key piece of information is the number of distinct topics it...
Abstract: "The world wide web represents vast stores of information. However, the sheer amount of su...
News topic detection is the process of organizing news story collections and real-time news/broadcas...
As large-scale digital text collections become abundant, the necessity of automatically summarizing ...
Topic hierarchies can help researchers to develop a quick and concise understanding of the main them...
International audienceThe most popular topic modelling algorithm, Latent Dirichlet Allocation, produ...
In this paper, we introduce a new clustering algorithm for discovering and describing the topics com...
Abstract—We are concerned with the general problem of concept mining – discovering useful associatio...
Abstract—We are concerned with the general problem of concept mining – discovering useful associatio...
Hierarchical topic detection is a new task in the TDT 2004 evaluation program, which aims to organiz...
<p>All topics with “language in their top 5 terms were first identified from the results for topic m...
Abstract—An algorithm named SMHP (Similarity Matrix based Hypergraph Partition) algorithm is propose...
PACLIC / The University of the Philippines Visayas Cebu College Cebu City, Philippines / November 20...
We address the problem of Topic Detection and Tracking (TDT) and subsequently detecting trends from...
In this research, we extend probabilistic topic models, originally developed for a textual corpus an...
When analyzing a document collection, a key piece of information is the number of distinct topics it...
Abstract: "The world wide web represents vast stores of information. However, the sheer amount of su...
News topic detection is the process of organizing news story collections and real-time news/broadcas...
As large-scale digital text collections become abundant, the necessity of automatically summarizing ...
Topic hierarchies can help researchers to develop a quick and concise understanding of the main them...
International audienceThe most popular topic modelling algorithm, Latent Dirichlet Allocation, produ...
In this paper, we introduce a new clustering algorithm for discovering and describing the topics com...
Abstract—We are concerned with the general problem of concept mining – discovering useful associatio...
Abstract—We are concerned with the general problem of concept mining – discovering useful associatio...
Hierarchical topic detection is a new task in the TDT 2004 evaluation program, which aims to organiz...
<p>All topics with “language in their top 5 terms were first identified from the results for topic m...
Abstract—An algorithm named SMHP (Similarity Matrix based Hypergraph Partition) algorithm is propose...
PACLIC / The University of the Philippines Visayas Cebu College Cebu City, Philippines / November 20...
We address the problem of Topic Detection and Tracking (TDT) and subsequently detecting trends from...
In this research, we extend probabilistic topic models, originally developed for a textual corpus an...
When analyzing a document collection, a key piece of information is the number of distinct topics it...
Abstract: "The world wide web represents vast stores of information. However, the sheer amount of su...