We investigate two important properties of real data: diversity and log-normality. Log-normality accounts for the fact that data follow the lognormal distribution, whereas diversity measures variations of the attributes in the data. To our knowledge, these two inherent properties have not been paid much attention from the machine learning community, especially from the topic modeling community. In this article, we fill in this gap in the framework of topic modeling. We first investigate whether or not these two properties can be captured by the most well-known Latent Dirichlet Allocation model (LDA), and find that LDA behaves inconsistently with respect to diversity. Particularly, it favors data of low diversity, but works badly on data of ...
This research project aims to provide a clear and concise guide to latent dirichlet allocation which...
Abstract. We present the discrete infinite logistic normal distribution (DILN), a Bayesian nonparame...
It is important to identify the ``correct'' number of topics in mechanisms like Latent Dirichlet All...
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical ana...
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical ana...
Topic models, such as latent Dirichlet allocation (LDA), have been an effective tool for the statist...
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical ana...
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical ana...
Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections o...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
Latent Dirichlet Allocation (LDA) is a popular machine-learning technique that identifies latent str...
Recent work on variational autoencoders (VAEs) has enabled the development of generative topic model...
2014 Topic models such as the latent Dirichlet allocation (LDA) have become a standard staple in the...
In latent Dirichlet allocation (LDA), topics are multino-mial distributions over the entire vocabula...
Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling h...
This research project aims to provide a clear and concise guide to latent dirichlet allocation which...
Abstract. We present the discrete infinite logistic normal distribution (DILN), a Bayesian nonparame...
It is important to identify the ``correct'' number of topics in mechanisms like Latent Dirichlet All...
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical ana...
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical ana...
Topic models, such as latent Dirichlet allocation (LDA), have been an effective tool for the statist...
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical ana...
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical ana...
Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections o...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
Latent Dirichlet Allocation (LDA) is a popular machine-learning technique that identifies latent str...
Recent work on variational autoencoders (VAEs) has enabled the development of generative topic model...
2014 Topic models such as the latent Dirichlet allocation (LDA) have become a standard staple in the...
In latent Dirichlet allocation (LDA), topics are multino-mial distributions over the entire vocabula...
Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling h...
This research project aims to provide a clear and concise guide to latent dirichlet allocation which...
Abstract. We present the discrete infinite logistic normal distribution (DILN), a Bayesian nonparame...
It is important to identify the ``correct'' number of topics in mechanisms like Latent Dirichlet All...