The publication time of a document carries a relevant information about its semantic content. The Dirichlet-Hawkes process has been proposed to jointly model textual information and publication dynamics. This approach has been used with success in several recent works, and extended to tackle specific challenging problems --typically for short texts or entangled publication dynamics. However, the prior in its current form does not allow for complex publication dynamics. In particular, inferred topics are independent from each other --a publication about finance is assumed to have no influence on publications about politics, for instance. In this work, we develop the Multivariate Powered Dirichlet-Hawkes Process (MPDHP), that alleviates thi...
Editor: We describe distributed algorithms for two widely-used topic models, namely the Latent Diric...
textSince its introduction, topic modeling has been a fundamental tool in analyzing corpus structure...
Dynamic Topic Models (DTM) are a way to extract time-variant information from a collection of docume...
International audienceThe publication time of a document carries a relevant information about its se...
International audienceThe textual content of a document and its publication date are intertwined. Fo...
Clusters in document streams, such as online news articles, can be induced by their textual contents...
Most models of information diffusion online rely on the assumption that pieces of information spread...
Information spread on networks can be efficiently modeled by considering three features: documents' ...
International audienceInformation spread on networks can be efficiently modeled by considering three...
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP ...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
Understanding how topics within a document evolve over the structure of the document is an interesti...
Abstract Understanding how topics within a document evolve over the structure of the document is an ...
We describe distributed algorithms for two widely-used topic models, namely the Latent Dirichlet All...
The multivariate Hawkes process (MHP) is widely used for analyzing data streams that interact with e...
Editor: We describe distributed algorithms for two widely-used topic models, namely the Latent Diric...
textSince its introduction, topic modeling has been a fundamental tool in analyzing corpus structure...
Dynamic Topic Models (DTM) are a way to extract time-variant information from a collection of docume...
International audienceThe publication time of a document carries a relevant information about its se...
International audienceThe textual content of a document and its publication date are intertwined. Fo...
Clusters in document streams, such as online news articles, can be induced by their textual contents...
Most models of information diffusion online rely on the assumption that pieces of information spread...
Information spread on networks can be efficiently modeled by considering three features: documents' ...
International audienceInformation spread on networks can be efficiently modeled by considering three...
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP ...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
Understanding how topics within a document evolve over the structure of the document is an interesti...
Abstract Understanding how topics within a document evolve over the structure of the document is an ...
We describe distributed algorithms for two widely-used topic models, namely the Latent Dirichlet All...
The multivariate Hawkes process (MHP) is widely used for analyzing data streams that interact with e...
Editor: We describe distributed algorithms for two widely-used topic models, namely the Latent Diric...
textSince its introduction, topic modeling has been a fundamental tool in analyzing corpus structure...
Dynamic Topic Models (DTM) are a way to extract time-variant information from a collection of docume...