Topics models, such as LDA, are widely used in Natural Language Processing. Making their output interpretable is an important area of research with applications to areas such as the enhancement of exploratory search interfaces and the development of interpretable machine learning models. Conventionally, topics are represented by their n most probable words, however, these representations are often difficult for humans to interpret. This paper explores the re-ranking of topic words to generate more interpretable topic representations. A range of approaches are compared and evaluated in two experiments. The first uses crowdworkers to associate topics represented by different word rankings with related documents. The second experiment is an au...
Readitopics provides a new tool for browsing a textual corpus that showcases several recent work on ...
Topic models arise from the need of understanding and exploring large text document collections and...
Topic models arise from the need of understanding and exploring large text document collections and...
Topic models are widely used in natural language processing (NLP). Ensuring that their output is int...
Probabilistic topic models, such as LDA, are standard text analysis algorithms that provide predicti...
Abstract. How to improve the rankings of the relevant documents plays a key role in information retr...
Probabilistic topic models have become one of the most widespread machine learning techniques in te...
Managing large collections of documents is an important problem for many areas of science, industry,...
Topic modeling has been widely utilized in the fields of information retrieval, text mining, text cl...
Recent work suggests knowledge sources can be added into the topic modeling process to label topics ...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
Topic models have the potential to improve search and browsing by extracting useful semantic themes ...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
In domain-specific information retrieval (IR), an emerging problem is how to provide different users...
In many domains that employ machine learning models, both high performing and interpretable models a...
Readitopics provides a new tool for browsing a textual corpus that showcases several recent work on ...
Topic models arise from the need of understanding and exploring large text document collections and...
Topic models arise from the need of understanding and exploring large text document collections and...
Topic models are widely used in natural language processing (NLP). Ensuring that their output is int...
Probabilistic topic models, such as LDA, are standard text analysis algorithms that provide predicti...
Abstract. How to improve the rankings of the relevant documents plays a key role in information retr...
Probabilistic topic models have become one of the most widespread machine learning techniques in te...
Managing large collections of documents is an important problem for many areas of science, industry,...
Topic modeling has been widely utilized in the fields of information retrieval, text mining, text cl...
Recent work suggests knowledge sources can be added into the topic modeling process to label topics ...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
Topic models have the potential to improve search and browsing by extracting useful semantic themes ...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
In domain-specific information retrieval (IR), an emerging problem is how to provide different users...
In many domains that employ machine learning models, both high performing and interpretable models a...
Readitopics provides a new tool for browsing a textual corpus that showcases several recent work on ...
Topic models arise from the need of understanding and exploring large text document collections and...
Topic models arise from the need of understanding and exploring large text document collections and...