A challenge created by the recent development in information technology is that people are often faced with an overwhelming amount of information available to them, with blogs presenting the latest and most abundant source of such information. In this thesis, I approach the problem from a standpoint of organizing the newly created information into sensible groups. The first part of the thesis is an overview of the state of the art in the areas relevant to the problem and an analysis of shortcomings of different methods. The main contribution is the development of a new algorithm that pieces together various ideas presented in the first part. It is an online hierarchical clustering algorithm that is capable of incremental model updates th...
Recent advances and widespread usage of online web services and social media platforms, coupled with...
The continuous growth of social networks and the active use of social media services result in massi...
We present and analyze the star clustering algorithm. We discuss an implementation of this algorithm...
A challenge created by the recent development in information technology is that people are often fac...
Topic detection (TD) is an important area of research whose primary goal is to detect retrospective ...
PACLIC / The University of the Philippines Visayas Cebu College Cebu City, Philippines / November 20...
We investigate four hierarchical clustering methods (single-link, complete-link, groupwise-average, ...
The development of information technology brings numerous online news and events to our daily life. ...
Detecting events from one or more temporally-ordered stream(s) of documents (e.g. news articles, blo...
Real-world events of general interest trigger engaging discussions among peoplefor short bursts in t...
News topic detection is the process of organizing news story collections and real-time news/broadcas...
This paper describes the term frequency patterns found in online news summaries published over a se...
Clustering of related or similar objects has long been regarded as a potentially useful contribution...
Clustering of related or similar objects has long been regarded as a potentially useful contribution...
We present and analyze the off-line star algorithm for clustering static information systems and the...
Recent advances and widespread usage of online web services and social media platforms, coupled with...
The continuous growth of social networks and the active use of social media services result in massi...
We present and analyze the star clustering algorithm. We discuss an implementation of this algorithm...
A challenge created by the recent development in information technology is that people are often fac...
Topic detection (TD) is an important area of research whose primary goal is to detect retrospective ...
PACLIC / The University of the Philippines Visayas Cebu College Cebu City, Philippines / November 20...
We investigate four hierarchical clustering methods (single-link, complete-link, groupwise-average, ...
The development of information technology brings numerous online news and events to our daily life. ...
Detecting events from one or more temporally-ordered stream(s) of documents (e.g. news articles, blo...
Real-world events of general interest trigger engaging discussions among peoplefor short bursts in t...
News topic detection is the process of organizing news story collections and real-time news/broadcas...
This paper describes the term frequency patterns found in online news summaries published over a se...
Clustering of related or similar objects has long been regarded as a potentially useful contribution...
Clustering of related or similar objects has long been regarded as a potentially useful contribution...
We present and analyze the off-line star algorithm for clustering static information systems and the...
Recent advances and widespread usage of online web services and social media platforms, coupled with...
The continuous growth of social networks and the active use of social media services result in massi...
We present and analyze the star clustering algorithm. We discuss an implementation of this algorithm...