We present an extension to the Hierarchical Dirichlet Process (HDP), which allows for the inclusion of super-vision. Our model marries the non-parametric benefits of HDP with those of Supervised Latent Dirichlet Allocation (SLDA) to enable learning the topic space directly from data while simultaneously including the labels within the model. The proposed model is learned using variational inference which allows for the efficient use of a large training dataset. We also present the online version of variational inference, which makes the method scalable to very large datasets. We show results comparing our model to a traditional super-vised parametric topic model, SLDA, and show that it out-performs SLDA on a number of benchmark datasets. 1
Topic models for text analysis are most commonly trained using either Gibbs sampling or variational ...
We study the problem of topic modeling in corpora whose documents are organized in a multi-level hie...
In many unsupervised machine learning algorithms where labelling a large quantity of data is unfeasi...
We present an extension to the Hierarchical Dirichlet Process (HDP), which allows for the inclusion ...
We present an extension to the Hierarchical Dirichlet Process (HDP), which allows for the inclusion ...
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric model that can be used to model...
We introduce a new variational inference ob-jective for hierarchical Dirichlet process ad-mixture mo...
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP ...
A wide variety of Dirichlet-multinomial 'topic' models have found interesting applications in recent...
We present the hierarchical Dirichlet scal-ing process (HDSP), a Bayesian nonparametric mixed member...
Abstract—We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. ...
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP ...
Appropriate tools for managing large-scale data, like online texts, images and user pro-files, are b...
Abstract—We propose the supervised hierarchical Dirichlet process (sHDP), a nonparametric generative...
Recent advances have made it feasible to apply the stochastic variational paradigm to a collapsed re...
Topic models for text analysis are most commonly trained using either Gibbs sampling or variational ...
We study the problem of topic modeling in corpora whose documents are organized in a multi-level hie...
In many unsupervised machine learning algorithms where labelling a large quantity of data is unfeasi...
We present an extension to the Hierarchical Dirichlet Process (HDP), which allows for the inclusion ...
We present an extension to the Hierarchical Dirichlet Process (HDP), which allows for the inclusion ...
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric model that can be used to model...
We introduce a new variational inference ob-jective for hierarchical Dirichlet process ad-mixture mo...
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP ...
A wide variety of Dirichlet-multinomial 'topic' models have found interesting applications in recent...
We present the hierarchical Dirichlet scal-ing process (HDSP), a Bayesian nonparametric mixed member...
Abstract—We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. ...
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP ...
Appropriate tools for managing large-scale data, like online texts, images and user pro-files, are b...
Abstract—We propose the supervised hierarchical Dirichlet process (sHDP), a nonparametric generative...
Recent advances have made it feasible to apply the stochastic variational paradigm to a collapsed re...
Topic models for text analysis are most commonly trained using either Gibbs sampling or variational ...
We study the problem of topic modeling in corpora whose documents are organized in a multi-level hie...
In many unsupervised machine learning algorithms where labelling a large quantity of data is unfeasi...