Bursty features in text streams are very useful in many text mining applications. Most existing studies detect bursty features based purely on term frequency changes without taking into account the semantic contexts of terms, and as a result the detected bursty features may not always be interesting or easy to interpret. In this paper we propose to model the contexts of bursty features using a language modeling approach. We then propose a novel topic diversity-based metric using the context models to find newsworthy bursty features. We also propose to use the context models to automatically assign meaningful tags to bursty features. Using a large corpus of a stream of news articles, we quantitatively show that the proposed context language ...
The increasing pace of change in languages affects many applications and algorithms for text process...
Bursty topics discovery in microblogs is important for people to grasp essential and valuable inform...
Collaborative tagging have emerged as a ubiquitous way to annotate and organize online resources. As...
Detecting and using bursty patterns to analyze text streams has been one of the fundamental approach...
Bursty features in text streams are very useful in many text mining applications. Most existing stud...
Text classification is a major data mining task. An advanced text classification technique is known ...
Nowadays, almost all text corpora, such as blogs, emails and RSS feeds, are a collection of text str...
Previous work on text mining has almost exclusively focused on a single stream. However, we often ha...
Burst detection is an important topic in temporal stream analysis. Usually, only the textual feature...
Many document collections are by nature dynamic, evolving as the topics or events they describe chan...
A fundamental problem in text data mining is to extract meaningful structure from document streams ...
Mining retrospective events from text streams has been an important research topic. Classic text rep...
Term-based approaches can extract many features in text documents, but most include noise. Many popu...
With the dramatic growth of text information, there is an increasing need for powerful text mining s...
With the dramatic growth of text information, there is an increasing need for powerful text mining s...
The increasing pace of change in languages affects many applications and algorithms for text process...
Bursty topics discovery in microblogs is important for people to grasp essential and valuable inform...
Collaborative tagging have emerged as a ubiquitous way to annotate and organize online resources. As...
Detecting and using bursty patterns to analyze text streams has been one of the fundamental approach...
Bursty features in text streams are very useful in many text mining applications. Most existing stud...
Text classification is a major data mining task. An advanced text classification technique is known ...
Nowadays, almost all text corpora, such as blogs, emails and RSS feeds, are a collection of text str...
Previous work on text mining has almost exclusively focused on a single stream. However, we often ha...
Burst detection is an important topic in temporal stream analysis. Usually, only the textual feature...
Many document collections are by nature dynamic, evolving as the topics or events they describe chan...
A fundamental problem in text data mining is to extract meaningful structure from document streams ...
Mining retrospective events from text streams has been an important research topic. Classic text rep...
Term-based approaches can extract many features in text documents, but most include noise. Many popu...
With the dramatic growth of text information, there is an increasing need for powerful text mining s...
With the dramatic growth of text information, there is an increasing need for powerful text mining s...
The increasing pace of change in languages affects many applications and algorithms for text process...
Bursty topics discovery in microblogs is important for people to grasp essential and valuable inform...
Collaborative tagging have emerged as a ubiquitous way to annotate and organize online resources. As...