Ranking of resources in social tagging systems is a difficult problem due to the inherent sparsity of the data and the vo-cabulary problems introduced by having a completely unre-stricted lexicon. In this paper we propose to use hidden topic models as a principled way of reducing the dimensionality of this data to provide more accurate resource rankings with higher recall. We first describe Latent Dirichlet Allocation (LDA) and then show how it can be used to rank resources in a social bookmarking system. We test the LDA tagging model and compare it with 3 non-topic model baselines on a large data sample obtained from the Delicious social book-marking site. Our evaluations show that our LDA-based method significantly outperforms all of the ...
Microblogging as a kind of social network has become more and more important in our daily lives. Eno...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
Abstract-Text categorization is the task of automatically assigning unlabeled text documents to some...
Ranking of resources in social tagging systems is a difficult problem due to the inherent sparsity o...
Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requ...
In this paper, I apply latent dirichlet allocation(LDA) to cluster 100,000 health related articles u...
Today we are living in modern Internet era. We can get all our information from the internet anytime...
It is challenging to discover relevant features from long documents that describe user information n...
Probabilistic topic models, such as LDA, are standard text analysis algorithms that provide predicti...
Search algorithms incorporating some form of topic model have a long history in information retrieva...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
Latent topic analysis has emerged as one of the most effective methods for classifying, clustering a...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
Microblogging as a kind of social network has become more and more important in our daily lives. Eno...
Microblogging as a kind of social network has become more and more important in our daily lives. Eno...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
Abstract-Text categorization is the task of automatically assigning unlabeled text documents to some...
Ranking of resources in social tagging systems is a difficult problem due to the inherent sparsity o...
Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requ...
In this paper, I apply latent dirichlet allocation(LDA) to cluster 100,000 health related articles u...
Today we are living in modern Internet era. We can get all our information from the internet anytime...
It is challenging to discover relevant features from long documents that describe user information n...
Probabilistic topic models, such as LDA, are standard text analysis algorithms that provide predicti...
Search algorithms incorporating some form of topic model have a long history in information retrieva...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
Latent topic analysis has emerged as one of the most effective methods for classifying, clustering a...
Thesis (Master's)--University of Washington, 2014In their 2001 work Latent Dirichlet Allocation, Ble...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
Microblogging as a kind of social network has become more and more important in our daily lives. Eno...
Microblogging as a kind of social network has become more and more important in our daily lives. Eno...
Latent dirichlet allocation Transfer learning a b s t r a c t Due to the scarcity of user interest i...
Abstract-Text categorization is the task of automatically assigning unlabeled text documents to some...