In this paper, we propose a new probabilistic model, Bag of Timestamps (BoT), for chronological text mining. BoT is an extension of latent Dirichlet allocation (LDA), and has two remarkable features when compared with a previously proposed Topics over Time (ToT), which is also an extension of LDA. First, we can avoid overfitting to temporal data, because temporal data are modeled in a Bayesian manner similar to word frequencies. Second, BoT has a conditional probability where no functions requiring time-consuming computations appear. The experiments using newswire documents show that BoT achieves more moderate fitting to temporal data in shorter execution time than ToT.Advances in Data and Web Management. Joint International Conferences, AP...
Abstract. In this work, we consider the problem of classifying time-sensitive queries at different t...
Dynamic Topic Models (DTM) are a way to extract time-variant information from a collection of docume...
Timeline Generation, through generating news timelines from the massive data of news corpus, aims at...
Abstract. In this paper, we propose a new probabilistic model, Bag of Timestamps (BoT), for chronolo...
In this paper, we propose a new method for topical trend analysis. We model topical trends by per-to...
This paper provides a new approach to topical trend analysis. Our aim is to improve the generalizati...
This paper presents a new Bayesian topical trend analysis. We regard the parameters of topic Dirichl...
Abstract—We consider the problem of inferring and modeling topics in a sequence of documents with kn...
textThis thesis explores the temporal analysis of text using the implicit temporal cues present in d...
We consider the problem of modeling temporal textual data taking endogenous and exogenous processes ...
A single, stationary topic model such as latent Dirichlet allocation is inappropriate for modeling c...
This paper introduces a novel probabilistic activity modeling approach that mines recurrent sequenti...
International audienceDiscriminant chronicle mining (DCM) tackles temporal sequence classification b...
This paper studies the problem of latent periodic topic anal-ysis from timestamped documents. The ex...
In recent years probabilistic topic models have gained tremendous attention in data mining and natur...
Abstract. In this work, we consider the problem of classifying time-sensitive queries at different t...
Dynamic Topic Models (DTM) are a way to extract time-variant information from a collection of docume...
Timeline Generation, through generating news timelines from the massive data of news corpus, aims at...
Abstract. In this paper, we propose a new probabilistic model, Bag of Timestamps (BoT), for chronolo...
In this paper, we propose a new method for topical trend analysis. We model topical trends by per-to...
This paper provides a new approach to topical trend analysis. Our aim is to improve the generalizati...
This paper presents a new Bayesian topical trend analysis. We regard the parameters of topic Dirichl...
Abstract—We consider the problem of inferring and modeling topics in a sequence of documents with kn...
textThis thesis explores the temporal analysis of text using the implicit temporal cues present in d...
We consider the problem of modeling temporal textual data taking endogenous and exogenous processes ...
A single, stationary topic model such as latent Dirichlet allocation is inappropriate for modeling c...
This paper introduces a novel probabilistic activity modeling approach that mines recurrent sequenti...
International audienceDiscriminant chronicle mining (DCM) tackles temporal sequence classification b...
This paper studies the problem of latent periodic topic anal-ysis from timestamped documents. The ex...
In recent years probabilistic topic models have gained tremendous attention in data mining and natur...
Abstract. In this work, we consider the problem of classifying time-sensitive queries at different t...
Dynamic Topic Models (DTM) are a way to extract time-variant information from a collection of docume...
Timeline Generation, through generating news timelines from the massive data of news corpus, aims at...