In real world industrial applications of topic modeling, the ability to capture gigantic conceptual space by learning an ultra-high dimensional topical representation, i.e., the so-called “big model”, is becoming the next desideratum after enthusiasms on ”big data”, especially for fine-grained downstream tasks such as online advertising, where good performances are usually achieved by regression-based predictors built on millions if not billions of input features. The conventional data-parallel approach for training gigantic topic models turns out to be rather inefficient in utilizing the power of parallelism, due to the heavy dependency on a centralized image of “model”. Big model size also poses another challenge on the storage, where ava...
Topic models provide a powerful tool for analyzing large text collections by representing high dimen...
Scalable and effective analysis of large text corpora remains a chal-lenging problem as our ability ...
We present a scheme for fast, distributed learn-ing on big (i.e. high-dimensional) models ap-plied t...
In real world industrial applications of topic modeling, the ability to capture gigantic conceptual ...
<p>When building large-scale machine learning (ML) programs, such as big topic models or deep neural...
Abstract—Over the past few years we have witnessed an increasing popularity in the use of graphical ...
This paper describes a high performance sampling archi-tecture for inference of latent topic models ...
<p>Topic models, and more specifically the class of latent Dirichlet allocation (LDA), are widely us...
Logistic-normal topic models can effectively discover correlation structures among latent topics. Ho...
When building large-scale machine learning (ML) programs, such as massive topic models or deep neura...
ABSTRACT Topic models have played a pivotal role in analyzing large collections of complex data. Bes...
Inference in topic models typically involves a sampling step to associate latent variables with obse...
There has been a recent push for a new framework of learning, due in part to the availability of sto...
What type of algorithms and statistical techniques support learning from very large datasets over lo...
The recent emergence of Graphics Processing Units (GPUs) as general-purpose parallel computing devic...
Topic models provide a powerful tool for analyzing large text collections by representing high dimen...
Scalable and effective analysis of large text corpora remains a chal-lenging problem as our ability ...
We present a scheme for fast, distributed learn-ing on big (i.e. high-dimensional) models ap-plied t...
In real world industrial applications of topic modeling, the ability to capture gigantic conceptual ...
<p>When building large-scale machine learning (ML) programs, such as big topic models or deep neural...
Abstract—Over the past few years we have witnessed an increasing popularity in the use of graphical ...
This paper describes a high performance sampling archi-tecture for inference of latent topic models ...
<p>Topic models, and more specifically the class of latent Dirichlet allocation (LDA), are widely us...
Logistic-normal topic models can effectively discover correlation structures among latent topics. Ho...
When building large-scale machine learning (ML) programs, such as massive topic models or deep neura...
ABSTRACT Topic models have played a pivotal role in analyzing large collections of complex data. Bes...
Inference in topic models typically involves a sampling step to associate latent variables with obse...
There has been a recent push for a new framework of learning, due in part to the availability of sto...
What type of algorithms and statistical techniques support learning from very large datasets over lo...
The recent emergence of Graphics Processing Units (GPUs) as general-purpose parallel computing devic...
Topic models provide a powerful tool for analyzing large text collections by representing high dimen...
Scalable and effective analysis of large text corpora remains a chal-lenging problem as our ability ...
We present a scheme for fast, distributed learn-ing on big (i.e. high-dimensional) models ap-plied t...