Unsupervised probabilistic Bayesian models are powerful tools for statistical analysis, especially in the area of information retrieval, document analysis and text processing. Despite their success, unsupervised probabilistic Bayesian models are often slow in inference due to inter-entangled mutually dependent latent variables. In addition, the parameter space of these models is usually very large. As the data from various different media sources--for example, internet, electronic books, digital films, etc--become widely accessible, lack of scalability for these unsupervised probabilistic Bayesian models becomes a critical bottleneck. The primary focus of this dissertation is to speed up the inference process in unsupervised probabilistic ...
Intrinsically, topic models have always their likelihood functions fixed to multinomial distributio...
With the physical constraints of semiconductor-based electronics becoming increasingly limiting in t...
There has been an explosion in the amount of digital text information available in recent years, lea...
We present a hybrid algorithm for Bayesian topic models that combines the efficiency of sparse Gibbs...
Given the overwhelming quantities of data generated every day, there is a pressing need for tools th...
What type of algorithms and statistical techniques support learning from very large datasets over lo...
Understanding large collections of unstructured documents remains a persistent problem. Users need t...
Latent Dirichlet allocation (LDA) is an important probabilistic generative model and has usually use...
Latent variable models of text, such as topic models, have been explored in many areas of natural la...
In the era of "big data", scalable statistical inference is necessary to learn from new and growing ...
Online content have become an important medium to disseminate information and express opinions. With...
<p>Topic models, and more specifically the class of latent Dirichlet allocation (LDA), are widely us...
Probabilistic topic models have proven to be an extremely versatile class of mixed-membership models...
Online content have become an important medium to disseminate information and express opinions. With...
ABSTRACT Topic models have played a pivotal role in analyzing large collections of complex data. Bes...
Intrinsically, topic models have always their likelihood functions fixed to multinomial distributio...
With the physical constraints of semiconductor-based electronics becoming increasingly limiting in t...
There has been an explosion in the amount of digital text information available in recent years, lea...
We present a hybrid algorithm for Bayesian topic models that combines the efficiency of sparse Gibbs...
Given the overwhelming quantities of data generated every day, there is a pressing need for tools th...
What type of algorithms and statistical techniques support learning from very large datasets over lo...
Understanding large collections of unstructured documents remains a persistent problem. Users need t...
Latent Dirichlet allocation (LDA) is an important probabilistic generative model and has usually use...
Latent variable models of text, such as topic models, have been explored in many areas of natural la...
In the era of "big data", scalable statistical inference is necessary to learn from new and growing ...
Online content have become an important medium to disseminate information and express opinions. With...
<p>Topic models, and more specifically the class of latent Dirichlet allocation (LDA), are widely us...
Probabilistic topic models have proven to be an extremely versatile class of mixed-membership models...
Online content have become an important medium to disseminate information and express opinions. With...
ABSTRACT Topic models have played a pivotal role in analyzing large collections of complex data. Bes...
Intrinsically, topic models have always their likelihood functions fixed to multinomial distributio...
With the physical constraints of semiconductor-based electronics becoming increasingly limiting in t...
There has been an explosion in the amount of digital text information available in recent years, lea...