Automated generation of high-quality topical hierarchies for a text collection is a dream problem in knowledge engineer-ing with many valuable applications. In this paper a scalable and robust algorithm is proposed for constructing a hierar-chy of topics from a text collection. We divide and conquer the problem using a top-down recursive framework, based on a tensor orthogonal decomposition technique. We solve a critical challenge to perform scalable inference for our newly designed hierarchical topic model. Experiments with various real-world datasets illustrate its ability to generate robust, high-quality hierarchies efficiently. Our method reduces the time of construction by several orders of magnitude, and its robust feature renders it ...
Lots of document collections are well organized in hierarchical structure, and such structure can he...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
We propose an unsupervised divisive partitioning algorithm for document data sets which enjoys many ...
Automatic construction of user-desired topical hierarchies over large volumes of text data is a high...
It is crucial in many information systems to organize short text segments, such as keywords in docum...
Topic hierarchies can help researchers to develop a quick and concise understanding of the main them...
The sizes of modern digital libraries have grown beyond our capacity to comprehend manually. Thus we...
Hierarchies have long been used for organization, summarization, and access to information. In this ...
Hierarchical topic modeling is a potentially powerful instrument for determining topical structures ...
The OLAP technology emerged 20 years ago and recently has been redesigned so that its dimensions, h...
This thesis proposes a novel model for automatically generate topic map for a document corpus with n...
Developing intuition for the content of a digital collection is difficult. Hierarchies of subject te...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
In this paper, we present a novel method for automatically building hierarchical topic structures of...
International audienceThe most popular topic modelling algorithm, Latent Dirichlet Allocation, produ...
Lots of document collections are well organized in hierarchical structure, and such structure can he...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
We propose an unsupervised divisive partitioning algorithm for document data sets which enjoys many ...
Automatic construction of user-desired topical hierarchies over large volumes of text data is a high...
It is crucial in many information systems to organize short text segments, such as keywords in docum...
Topic hierarchies can help researchers to develop a quick and concise understanding of the main them...
The sizes of modern digital libraries have grown beyond our capacity to comprehend manually. Thus we...
Hierarchies have long been used for organization, summarization, and access to information. In this ...
Hierarchical topic modeling is a potentially powerful instrument for determining topical structures ...
The OLAP technology emerged 20 years ago and recently has been redesigned so that its dimensions, h...
This thesis proposes a novel model for automatically generate topic map for a document corpus with n...
Developing intuition for the content of a digital collection is difficult. Hierarchies of subject te...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
In this paper, we present a novel method for automatically building hierarchical topic structures of...
International audienceThe most popular topic modelling algorithm, Latent Dirichlet Allocation, produ...
Lots of document collections are well organized in hierarchical structure, and such structure can he...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
We propose an unsupervised divisive partitioning algorithm for document data sets which enjoys many ...