International audienceThe LDA topic model describes a corpus on the basis of its vocabulary. Our experiment aims at determining whether LDA outputs' quality can be estimated through text similarity metrics, and if so determining the most relevant one. To do so, we use a categorized corpus on which we apply these metrics on every pair of categories. We present correlation scores between several metrics and the quality of the topic model. The experiments also include a comparison between simple and complex term extraction within our framework. We observed very high correlations with the Hellinger distance with or without complex terms, while the Soergel distance is most efficient when including complex terms. These experiments are a case stud...
As a quantitative text analytic method, Latent Dirichlet Allocation (LDA) topic modeling has been wi...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
Probabilistic topic models, such as LDA, are standard text analysis algorithms that provide predicti...
International audienceThe LDA topic model describes a corpus on the basis of its vocabulary. Our exp...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
We present in this paper experiments with several semantic similarity measures based on the unsuperv...
Topic modelling approaches such as LDA, when applied on a tweet corpus, can often generate a topic m...
Latent Dirichlet Allocation (LDA) has become the most stable and widely used topic model to derive t...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
With the vast amount of information available on the Internet today, helping users find relevant con...
Approaches for estimating the similarity between individual publications are an area of long -standi...
Topic models arise from the need of understanding and exploring large text document collections and...
Text mining has a wide range of applications in education. In this paper, we review Latent Dirichlet...
Probabilistic topic models have become one of the most widespread machine learning technique for te...
Abstract Weak topic correlation across document collections with different numbers of topics in indi...
As a quantitative text analytic method, Latent Dirichlet Allocation (LDA) topic modeling has been wi...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
Probabilistic topic models, such as LDA, are standard text analysis algorithms that provide predicti...
International audienceThe LDA topic model describes a corpus on the basis of its vocabulary. Our exp...
Probabilistic topic models have become one of the most widespread machine learning technique for tex...
We present in this paper experiments with several semantic similarity measures based on the unsuperv...
Topic modelling approaches such as LDA, when applied on a tweet corpus, can often generate a topic m...
Latent Dirichlet Allocation (LDA) has become the most stable and widely used topic model to derive t...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
With the vast amount of information available on the Internet today, helping users find relevant con...
Approaches for estimating the similarity between individual publications are an area of long -standi...
Topic models arise from the need of understanding and exploring large text document collections and...
Text mining has a wide range of applications in education. In this paper, we review Latent Dirichlet...
Probabilistic topic models have become one of the most widespread machine learning technique for te...
Abstract Weak topic correlation across document collections with different numbers of topics in indi...
As a quantitative text analytic method, Latent Dirichlet Allocation (LDA) topic modeling has been wi...
Ekinci, Ekin/0000-0003-0658-592X; ilhan omurca, sevinc/0000-0003-1214-9235Topic models, such as late...
Probabilistic topic models, such as LDA, are standard text analysis algorithms that provide predicti...