The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, creates a great demand for effective and efficient topic mod-eling approaches. While latent Dirichlet allocation (LDA) can be applied, it is not optimal due to its weakness in handling short texts with fast-changing topics and scalability concerns. In this paper, we pro-pose a transfer learning approach that utilizes abun-dant labeled documents from other domains (such as Yahoo! News or Wikipedia) to improve topic mod-eling, with better model fitting and result interpre-tation. Specifically, we develop Transfer Hierarchical LDA (thLDA) model, which incorporates the label information from other domains via informative pri-ors. In addition, we ...
E-petitions have become a popular vehicle for political activism, but studying them has been difficu...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
The aim of this bachelor thesis is to compare and empirically test the use of classification to impr...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
Twitter, or the world of 140 characters poses serious challenges to the efficacy of topic models on ...
Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requ...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
Latent topic analysis has emerged as one of the most effective methods for classifying, clustering a...
In many domains data items are represented by vectors of counts; count data arises for example in bi...
In many domains data items are represented by vectors of counts: count data arises, for example, in ...
In this paper, I apply latent dirichlet allocation(LDA) to cluster 100,000 health related articles u...
Topics discovered by the latent Dirichlet allocation (LDA) method are sometimes not meaningful for h...
Latent Dirichlet allocation (LDA) is a topic model that has been applied to var-ious fields, includi...
We present a novel topic mixture-based language model adaptation approach that uses La-tent Dirichle...
E-petitions have become a popular vehicle for political activism, but studying them has been difficu...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
The aim of this bachelor thesis is to compare and empirically test the use of classification to impr...
The increasing volume of short texts generated on so-cial media sites, such as Twitter or Facebook, ...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
Twitter, or the world of 140 characters poses serious challenges to the efficacy of topic models on ...
Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requ...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
Latent topic analysis has emerged as one of the most effective methods for classifying, clustering a...
In many domains data items are represented by vectors of counts; count data arises for example in bi...
In many domains data items are represented by vectors of counts: count data arises, for example, in ...
In this paper, I apply latent dirichlet allocation(LDA) to cluster 100,000 health related articles u...
Topics discovered by the latent Dirichlet allocation (LDA) method are sometimes not meaningful for h...
Latent Dirichlet allocation (LDA) is a topic model that has been applied to var-ious fields, includi...
We present a novel topic mixture-based language model adaptation approach that uses La-tent Dirichle...
E-petitions have become a popular vehicle for political activism, but studying them has been difficu...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
The aim of this bachelor thesis is to compare and empirically test the use of classification to impr...