In this paper, we propose a novel algorithm that rearrange the topic assignment results obtained from topic modeling algorithms, including NMF and LDA. The effectiveness of the algorithm is measured by how much the results conform to expert opinion, which is a data structure called TDAG that we defined to represent the probability that a pair of highly correlated words appear together. In order to make sure that the internal structure does not get changed too much from the rearrangement, coherence, which is a well known metric for measuring the effectiveness of topic modeling, is used to control the balance of the internal structure. We developed two ways to systematically obtain the expert opinion from data, depending on whether the data h...
Theoretical thesis.Spine title: Topic coherence after document restructuring.Bibliography: leaves 52...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
National audienceTopic modeling is a growing research field and novel ways of interpreting and evalu...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
Probabilistic topic models have become one of the most widespread machine learning technique for te...
Probabilistic topic models have become one of the most widespread machine learning technique for te...
Topic models arise from the need of understanding and exploring large text document collections and...
In recent years, topic modeling has become an established method in the analysis of text corpora, wi...
Topic models arise from the need of understanding and exploring large text document collections and...
This paper studies how to incorporate the ex-ternal word correlation knowledge to improve the cohere...
Topics discovered by the latent Dirichlet allocation (LDA) method are sometimes not meaningful for h...
Large organizations often face the critical challenge of sharing information and maintaining connect...
Large organizations often face the critical challenge of sharing information and maintaining connect...
Theoretical thesis.Spine title: Topic coherence after document restructuring.Bibliography: leaves 52...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
National audienceTopic modeling is a growing research field and novel ways of interpreting and evalu...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
Probabilistic topic models have become one of the most widespread machine learning technique for te...
Probabilistic topic models have become one of the most widespread machine learning technique for te...
Topic models arise from the need of understanding and exploring large text document collections and...
In recent years, topic modeling has become an established method in the analysis of text corpora, wi...
Topic models arise from the need of understanding and exploring large text document collections and...
This paper studies how to incorporate the ex-ternal word correlation knowledge to improve the cohere...
Topics discovered by the latent Dirichlet allocation (LDA) method are sometimes not meaningful for h...
Large organizations often face the critical challenge of sharing information and maintaining connect...
Large organizations often face the critical challenge of sharing information and maintaining connect...
Theoretical thesis.Spine title: Topic coherence after document restructuring.Bibliography: leaves 52...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...