In many domains data items are represented by vectors of counts; count data arises for example in bioinformatics or analysis of text documents represented as word count vectors. However, often the amount of data available from an interesting data source is too small to model the data source well. When several data sets are available from related sources, exploiting their similarities by trans-fer learning can improve the resulting models compared to modeling sources independently. We introduce a Bayesian generative transfer learning model which represents similarity across document collections by sparse sharing of latent topics controlled by an Indian Buffet Process. Unlike a prominent previous model, Hierarchical Dirichlet Process (HDP) ba...
We present a nonparametric hierarchical Bayesian model of document collections that decouples sparsi...
Since the deep learning revolution, a general trend in machine learning literature has been that lar...
textIn several applications, scarcity of labeled data is a challenging problem that hinders the pred...
In many domains data items are represented by vectors of counts: count data arises, for example, in ...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
We present a hybrid algorithm for Bayesian topic models that combines the efficiency of sparse Gibbs...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
We describe distributed algorithms for two widely-used topic models, namely the Latent Dirichlet All...
Topic modeling is a suite of algorithms, which aims to discover the hidden structures in large digit...
In this paper, we show how using the Dirichlet Process mixture model as a generative model of data s...
Editor: We describe distributed algorithms for two widely-used topic models, namely the Latent Diric...
Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling h...
We study the problem of constructing the topic-based model over different domains for text classific...
We present a nonparametric hierarchical Bayesian model of document collections that decouples sparsi...
Since the deep learning revolution, a general trend in machine learning literature has been that lar...
textIn several applications, scarcity of labeled data is a challenging problem that hinders the pred...
In many domains data items are represented by vectors of counts: count data arises, for example, in ...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
We present a hybrid algorithm for Bayesian topic models that combines the efficiency of sparse Gibbs...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
We describe distributed algorithms for two widely-used topic models, namely the Latent Dirichlet All...
Topic modeling is a suite of algorithms, which aims to discover the hidden structures in large digit...
In this paper, we show how using the Dirichlet Process mixture model as a generative model of data s...
Editor: We describe distributed algorithms for two widely-used topic models, namely the Latent Diric...
Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling h...
We study the problem of constructing the topic-based model over different domains for text classific...
We present a nonparametric hierarchical Bayesian model of document collections that decouples sparsi...
Since the deep learning revolution, a general trend in machine learning literature has been that lar...
textIn several applications, scarcity of labeled data is a challenging problem that hinders the pred...