The OLAP technology emerged 20 years ago and recently has been redesigned so that its dimensions, hierarchies and measures can support the particularities of textual data. Organizing textual data hierarchically can be solved with topic hierarchies. Currently, the topic hierarchy is defined only once in the data cube, i.e., for the entire lattice of cuboids. However, such hierarchy is sensitive to the document collection content. Thus, a data cube cell can contain a collection of documents distinct from others in the same cube, causing potential changes in the topic hierarchy. Furthermore, the text segment used in OLAP analysis also changes this hierarchy. In this work, we present a textual data cube with multiple dynamic topic hierar...
The exponential growth of the size and popularity of the world wide web has increased the interest i...
In the last few years, numerous proposals for modelling and querying Multidimensional Databases (MDD...
International audienceTopic segmentation traditionally relies on lexical cohesion measured through w...
It is crucial in many information systems to organize short text segments, such as keywords in docum...
Automated generation of high-quality topical hierarchies for a text collection is a dream problem in...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
This paper deals with the problem of physical clustering of multidimensional data that are organized...
Abstract. The standard approach to OLAP requires measures and di-mensions of a cube to be known at t...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
Hierarchies have long been used for organization, summarization, and access to information. In this ...
This thesis proposes a novel model for automatically generate topic map for a document corpus with n...
A large portion of real world data is either text or structured (\eg, relational) data. Such data o...
Uncovering the topics over short text corpus has become increasingly important with the bursty devel...
The sizes of modern digital libraries have grown beyond our capacity to comprehend manually. Thus we...
Abstract. We present a novel framework for comprehensive exploration of OLAP data by means of user-d...
The exponential growth of the size and popularity of the world wide web has increased the interest i...
In the last few years, numerous proposals for modelling and querying Multidimensional Databases (MDD...
International audienceTopic segmentation traditionally relies on lexical cohesion measured through w...
It is crucial in many information systems to organize short text segments, such as keywords in docum...
Automated generation of high-quality topical hierarchies for a text collection is a dream problem in...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
This paper deals with the problem of physical clustering of multidimensional data that are organized...
Abstract. The standard approach to OLAP requires measures and di-mensions of a cube to be known at t...
Topic models such as latent Dirichlet allocation (LDA) and hierarchical Dirichlet processes (HDP) ar...
Hierarchies have long been used for organization, summarization, and access to information. In this ...
This thesis proposes a novel model for automatically generate topic map for a document corpus with n...
A large portion of real world data is either text or structured (\eg, relational) data. Such data o...
Uncovering the topics over short text corpus has become increasingly important with the bursty devel...
The sizes of modern digital libraries have grown beyond our capacity to comprehend manually. Thus we...
Abstract. We present a novel framework for comprehensive exploration of OLAP data by means of user-d...
The exponential growth of the size and popularity of the world wide web has increased the interest i...
In the last few years, numerous proposals for modelling and querying Multidimensional Databases (MDD...
International audienceTopic segmentation traditionally relies on lexical cohesion measured through w...