Large organizations often face the critical challenge of sharing information and maintaining connections between disparate subunits. Tools for automated analysis of document collections, such as topic models, can provide an important means for communication. The value of topic modeling is in its ability to discover interpretable, coherent themes from unstructured document sets, yet it is not unusual to find semantic mismatches that substantially reduce user confidence. In this paper, we first present an expert-driven topic annotation study, undertaken in order to obtain an annotated set of baseline topics and their distinguishing characteristics. We then present a metric for detecting poor-quality topics that does not rely on human feedback...
National audienceTopic modeling is a growing research field and novel ways of interpreting and evalu...
Abstract—Given a topic and its top-k most relevant words generated by a topic model, how can we tell...
When developing topic models, a critical question that should be asked is: How well will this model ...
Large organizations often face the critical challenge of sharing information and maintaining connect...
Managing large collections of documents is an important problem for many areas of science, industry,...
Topic modeling is a popular technique for exploring large document collections. It has proven useful...
Topic models can learn topics that are highly interpretable, semantically-coherent and can be used s...
Topic models could have a huge impact on improving the ways users find and discover content in digit...
Topic models extract representative word sets—called topics—from word counts in documents without re...
Topic modeling is an important tool in social media anal-ysis, allowing researchers to quickly under...
Topic models have the potential to improve search and browsing by extracting useful semantic themes ...
Topic modeling algorithms, such as LDA, find topics, hidden structures, in document corpora in an un...
Abstract—Given a topic and its top-k most relevant words generated by a topic model, how can we tell...
Topic models are widely used unsupervised models capable of learning topics – weighted lists of word...
Topic models are unsupervised techniques that extract likely topics from text corpora, by creating p...
National audienceTopic modeling is a growing research field and novel ways of interpreting and evalu...
Abstract—Given a topic and its top-k most relevant words generated by a topic model, how can we tell...
When developing topic models, a critical question that should be asked is: How well will this model ...
Large organizations often face the critical challenge of sharing information and maintaining connect...
Managing large collections of documents is an important problem for many areas of science, industry,...
Topic modeling is a popular technique for exploring large document collections. It has proven useful...
Topic models can learn topics that are highly interpretable, semantically-coherent and can be used s...
Topic models could have a huge impact on improving the ways users find and discover content in digit...
Topic models extract representative word sets—called topics—from word counts in documents without re...
Topic modeling is an important tool in social media anal-ysis, allowing researchers to quickly under...
Topic models have the potential to improve search and browsing by extracting useful semantic themes ...
Topic modeling algorithms, such as LDA, find topics, hidden structures, in document corpora in an un...
Abstract—Given a topic and its top-k most relevant words generated by a topic model, how can we tell...
Topic models are widely used unsupervised models capable of learning topics – weighted lists of word...
Topic models are unsupervised techniques that extract likely topics from text corpora, by creating p...
National audienceTopic modeling is a growing research field and novel ways of interpreting and evalu...
Abstract—Given a topic and its top-k most relevant words generated by a topic model, how can we tell...
When developing topic models, a critical question that should be asked is: How well will this model ...