Abstract. This paper presents a topic model that captures the tem-poral dynamics in the text data along with topical phrases. Previous approaches have relied upon bag-of-words assumption to model such property in a corpus. This has resulted in an inferior performance with less interpretable topics. Our topic model can not only capture changes in the way a topic structure changes over time but also maintains impor-tant contextual information in the text data. Finding topical n-grams, when possible based on context, instead of always presenting unigrams in topics does away with many ambiguities that individual words may carry. We derive a collapsed Gibbs sampler for posterior inference. Our experimental results show an improvement over the cu...
A large document collection that builds up over time usually contains a number of different themes. ...
While most topic modeling algorithms model text corpora with unigrams, human interpretation often re...
Recent work in statistical topic models has investigated richer structures to capture either tempora...
Most topic models, such as latent Dirichlet allocation, rely on the bag of words assumption. However...
This paper presents an LDA-style topic model that captures not only the low-dimensional structure of...
This paper presents an LDA-style topic model that captures not only the low-dimensional structure of...
Text corpora with documents from a range of time epochs are natu-ral and ubiquitous in many fields, ...
Abstract. Most nonparametric topic models such as Hierarchical Dirichlet Pro-cesses, when viewed as ...
Information extraction from large corpora can be a useful tool for many applications in industry and...
The abundance of data in the information age poses an immense challenge for us: how to perform large...
User generated content in the form of customer reviews, blogs or tweets is an emerging and rich sour...
A single, stationary topic model such as latent Dirichlet allocation is inappropriate for modeling c...
A single, stationary topic model such as la-tent Dirichlet allocation is inappropriate for modeling ...
Topic modeling is a probabilistic generation model to find the representative topic of a document an...
A large document collection that builds up over time usually contains a number of different ...
A large document collection that builds up over time usually contains a number of different themes. ...
While most topic modeling algorithms model text corpora with unigrams, human interpretation often re...
Recent work in statistical topic models has investigated richer structures to capture either tempora...
Most topic models, such as latent Dirichlet allocation, rely on the bag of words assumption. However...
This paper presents an LDA-style topic model that captures not only the low-dimensional structure of...
This paper presents an LDA-style topic model that captures not only the low-dimensional structure of...
Text corpora with documents from a range of time epochs are natu-ral and ubiquitous in many fields, ...
Abstract. Most nonparametric topic models such as Hierarchical Dirichlet Pro-cesses, when viewed as ...
Information extraction from large corpora can be a useful tool for many applications in industry and...
The abundance of data in the information age poses an immense challenge for us: how to perform large...
User generated content in the form of customer reviews, blogs or tweets is an emerging and rich sour...
A single, stationary topic model such as latent Dirichlet allocation is inappropriate for modeling c...
A single, stationary topic model such as la-tent Dirichlet allocation is inappropriate for modeling ...
Topic modeling is a probabilistic generation model to find the representative topic of a document an...
A large document collection that builds up over time usually contains a number of different ...
A large document collection that builds up over time usually contains a number of different themes. ...
While most topic modeling algorithms model text corpora with unigrams, human interpretation often re...
Recent work in statistical topic models has investigated richer structures to capture either tempora...