Topic Detection and Tracking (TDT) is a variant of classification in which the classes are not known or fixed in advance. Consider for example an incoming stream of news articles or email messages that are to be classified by topic; new classes must be created as new topics arise. The problem is a challenging one for machine learning. Instances of new topics must be recognized as not belonging to any of the existing classes (detection), and instances of old topics must be correctly classified (tracking)---often with extremely little training data per class. This paper proposes a new approach to TDT based on probabilistic, generative models. Strong statistical techniques are used to address the many challenges: hierarchical shrinkage for spa...
Bursty topics discovery in microblogs is important for people to grasp essential and valuable inform...
Emerging topic detection from microblogs has developed into an attractive task because events usuall...
This paper explores a combination of machine learning, approximate text segmentation and a vector-sp...
Topic Detection and Tracking (TDT) is a variant of classication in which the set of classes grows ov...
Multi-source web news portals provide various advantages such as richness in news content and an opp...
In this paper we propose a probabilistic model for online document clustering. We use non-parametric...
Microblog is a popular and open platform for discovering and sharing the latest news about social is...
We present a unified model of what was traditionally viewed as two separate tasks: data association ...
Topic detection (TD) is an important area of research whose primary goal is to detect retrospective ...
Structured probabilistic inference has shown to be useful in modeling complex latent structures of d...
Cataloged from PDF version of article.Multisource web news portals provide various advantages such a...
Topic detection and tracking approaches monitor broadcast news in order to spot new, previously unre...
Abstract Graphical models have become the basic framework for topic based probabilistic modeling. Es...
In this paper we present recent advances in automatic categorization of noisy, unstructured text mes...
Topic detection is concerned with the unsupervised clustering of news stories over time. The TNO top...
Bursty topics discovery in microblogs is important for people to grasp essential and valuable inform...
Emerging topic detection from microblogs has developed into an attractive task because events usuall...
This paper explores a combination of machine learning, approximate text segmentation and a vector-sp...
Topic Detection and Tracking (TDT) is a variant of classication in which the set of classes grows ov...
Multi-source web news portals provide various advantages such as richness in news content and an opp...
In this paper we propose a probabilistic model for online document clustering. We use non-parametric...
Microblog is a popular and open platform for discovering and sharing the latest news about social is...
We present a unified model of what was traditionally viewed as two separate tasks: data association ...
Topic detection (TD) is an important area of research whose primary goal is to detect retrospective ...
Structured probabilistic inference has shown to be useful in modeling complex latent structures of d...
Cataloged from PDF version of article.Multisource web news portals provide various advantages such a...
Topic detection and tracking approaches monitor broadcast news in order to spot new, previously unre...
Abstract Graphical models have become the basic framework for topic based probabilistic modeling. Es...
In this paper we present recent advances in automatic categorization of noisy, unstructured text mes...
Topic detection is concerned with the unsupervised clustering of news stories over time. The TNO top...
Bursty topics discovery in microblogs is important for people to grasp essential and valuable inform...
Emerging topic detection from microblogs has developed into an attractive task because events usuall...
This paper explores a combination of machine learning, approximate text segmentation and a vector-sp...