Editor: We describe distributed algorithms for two widely-used topic models, namely the Latent Dirich-let Allocation (LDA) model, and the Hierarchical Dirichet Process (HDP) model. In our distributed algorithms the data is partitioned across separate processors and inference is done in a parallel, dis-tributed fashion. We propose two distributed algorithms for LDA. The first algorithm is a straight-forward mapping of LDA to a distributed processor setting. In this algorithm processors concur-rently perform Gibbs sampling over local data followed by a global update of topic counts. The al-gorithm is simple to implement and can be viewed as an approximation to Gibbs-sampled LDA. The second version is a model that uses a hierarchical Bayesian ...
Learning meaningful topic models with massive document collections which contain millions of documen...
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric model that can be used to model...
Learning meaningful topic models with massive document collections which contain millions of documen...
We describe distributed algorithms for two widely-used topic models, namely the Latent Dirichlet All...
Distributed learning is a problem of fundamental interest in machine learning and cognitive science....
Distributed learning is a problem of fundamental interest in machine learning and cognitive science....
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
Topic modeling algorithms (like Latent Dirichlet Allocation) tend to be very slow when run over larg...
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP ...
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of d...
We present an extension to the Hierarchical Dirichlet Process (HDP), which allows for the inclusion ...
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of ...
The sizes of modern digital libraries have grown beyond our capacity to comprehend manually. Thus we...
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric model that can be used to model...
Latent Dirichlet Allocation (LDA) is a popular topic modeling tech-nique for exploring document coll...
Learning meaningful topic models with massive document collections which contain millions of documen...
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric model that can be used to model...
Learning meaningful topic models with massive document collections which contain millions of documen...
We describe distributed algorithms for two widely-used topic models, namely the Latent Dirichlet All...
Distributed learning is a problem of fundamental interest in machine learning and cognitive science....
Distributed learning is a problem of fundamental interest in machine learning and cognitive science....
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
Topic modeling algorithms (like Latent Dirichlet Allocation) tend to be very slow when run over larg...
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP ...
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of d...
We present an extension to the Hierarchical Dirichlet Process (HDP), which allows for the inclusion ...
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of ...
The sizes of modern digital libraries have grown beyond our capacity to comprehend manually. Thus we...
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric model that can be used to model...
Latent Dirichlet Allocation (LDA) is a popular topic modeling tech-nique for exploring document coll...
Learning meaningful topic models with massive document collections which contain millions of documen...
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric model that can be used to model...
Learning meaningful topic models with massive document collections which contain millions of documen...