We present LDA*, a system that has been deployed in one of the largest Internet companies to fulfil their requirements of "topic modeling as an internal service"-relying on thousands of machines, engineers in different sectors submit their data, some are as large as 1.8TB, to LDA* and get results back in hours. LDA* is motivated by the observation that none of the existing topic modeling systems is robust enough-Each of these existing systems is designed for a specific point in the tradeoffspace that can be suboptimal, sometimes by up to 10×, across workloads. Our first contribution is a systematic study of all recently proposed samplers: AliasLDA, F+LDA, LightLDA, and WarpLDA. We discovered a novel system tradeoffamong these samplers. Each...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
The demand for Natural Language Processing has been thriving rapidly due to the various emerging Int...
As a quantitative text analytic method, Latent Dirichlet Allocation (LDA) topic modeling has been wi...
Topic models such as Latent Dirichlet Allocation (LDA) have been widely used in information retrieva...
When building large-scale machine learning (ML) programs, such as big topic models or deep neural ne...
When building large-scale machine learning (ML) programs, such as massive topic models or deep neura...
Learning meaningful topic models with massive document collections which contain millions of documen...
Abstract—Electronic documents on the Internet are always generated with many kinds of side informati...
With the rapid growth of information technology, the amount of unstructured text data in digital lib...
Topic Modeling for Research Software ABSTRACT Currently, the amount of daily publications in diffe...
Topics discovered by the latent Dirichlet allocation (LDA) method are sometimes not meaningful for h...
We present the design and implementation of GLDA, a library that utilizes the GPU (Graphics Processi...
In real world industrial applications of topic modeling, the ability to capture gigantic conceptual ...
This paper describes a high performance sampling archi-tecture for inference of latent topic models ...
Topic models help make sense of large text collections. Automatically evaluating their output and de...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
The demand for Natural Language Processing has been thriving rapidly due to the various emerging Int...
As a quantitative text analytic method, Latent Dirichlet Allocation (LDA) topic modeling has been wi...
Topic models such as Latent Dirichlet Allocation (LDA) have been widely used in information retrieva...
When building large-scale machine learning (ML) programs, such as big topic models or deep neural ne...
When building large-scale machine learning (ML) programs, such as massive topic models or deep neura...
Learning meaningful topic models with massive document collections which contain millions of documen...
Abstract—Electronic documents on the Internet are always generated with many kinds of side informati...
With the rapid growth of information technology, the amount of unstructured text data in digital lib...
Topic Modeling for Research Software ABSTRACT Currently, the amount of daily publications in diffe...
Topics discovered by the latent Dirichlet allocation (LDA) method are sometimes not meaningful for h...
We present the design and implementation of GLDA, a library that utilizes the GPU (Graphics Processi...
In real world industrial applications of topic modeling, the ability to capture gigantic conceptual ...
This paper describes a high performance sampling archi-tecture for inference of latent topic models ...
Topic models help make sense of large text collections. Automatically evaluating their output and de...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
The demand for Natural Language Processing has been thriving rapidly due to the various emerging Int...
As a quantitative text analytic method, Latent Dirichlet Allocation (LDA) topic modeling has been wi...