In many domains data items are represented by vectors of counts: count data arises, for example, in bioinformatics or analysis of text documents represented as word count vectors. However, often the amount of data available from an interesting data source is too small to model the data source well. When several data sets are available from related sources, exploiting their similarities by transfer learning can improve the resulting models compared to modeling sources independently. We introduce a Bayesian generative transfer learning model which represents similarity across document collections by sparse sharing of latent topics controlled by an Indian buffet process. Unlike a prominent previous model, hierarchical Dirichlet process (HDP) b...
Topic modeling is a suite of algorithms, which aims to discover the hidden structures in large digit...
Abstract. Knowledge transfer from multiple source domains to a target domain is crucial in transfer ...
We present a hybrid algorithm for Bayesian topic models that combines the efficiency of sparse Gibbs...
In many domains data items are represented by vectors of counts; count data arises for example in bi...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
We describe distributed algorithms for two widely-used topic models, namely the Latent Dirichlet All...
In this paper, we show how using the Dirichlet Process mixture model as a generative model of data s...
Since the deep learning revolution, a general trend in machine learning literature has been that lar...
In applications we may want to compare different document collections: they could have shared conten...
We study the problem of constructing the topic-based model over different domains for text classific...
textIn several applications, scarcity of labeled data is a challenging problem that hinders the pred...
Editor: We describe distributed algorithms for two widely-used topic models, namely the Latent Diric...
Topic modeling is a suite of algorithms, which aims to discover the hidden structures in large digit...
Abstract. Knowledge transfer from multiple source domains to a target domain is crucial in transfer ...
We present a hybrid algorithm for Bayesian topic models that combines the efficiency of sparse Gibbs...
In many domains data items are represented by vectors of counts; count data arises for example in bi...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
We describe distributed algorithms for two widely-used topic models, namely the Latent Dirichlet All...
In this paper, we show how using the Dirichlet Process mixture model as a generative model of data s...
Since the deep learning revolution, a general trend in machine learning literature has been that lar...
In applications we may want to compare different document collections: they could have shared conten...
We study the problem of constructing the topic-based model over different domains for text classific...
textIn several applications, scarcity of labeled data is a challenging problem that hinders the pred...
Editor: We describe distributed algorithms for two widely-used topic models, namely the Latent Diric...
Topic modeling is a suite of algorithms, which aims to discover the hidden structures in large digit...
Abstract. Knowledge transfer from multiple source domains to a target domain is crucial in transfer ...
We present a hybrid algorithm for Bayesian topic models that combines the efficiency of sparse Gibbs...