This work presents an alternative method to represent documents based on LDA (Latent Dirichlet Allocation) and how it affects to classification algorithms, in comparison to common text representation. LDA assumes that each document deals with a set of predefined topics, which are distributions over an entire vocabulary. Our main objective is to use the probability of a document belonging to each topic to implement a new text representation model. This proposed technique is deployed as an extension of the Weka software as a new filter. To demonstrate its performance, the created filter is tested with different classifiers such as a Support Vector Machine (SVM), k-Nearest Neighbors (k-NN), and Naive Bayes in different documental corpora (OHSU...
This paper introduces the ldagibbs command which implements Latent Dirichlet Allocation in Stata. La...
Latent Dirichlet Allocation is a generative probabilistic model that can be used to describe and ana...
Many approaches have been introduced to enable Latent Dirichlet Allocation (LDA) models to be update...
This work presents an alternative method to represent documents based on LDA (Latent Dirichlet Alloc...
Latent Dirichlet Allocation (henceforth LDA) is a statistical model that can be used to represent na...
Probabilistic topic models such as latent Dirichlet allocation (LDA) are widespread tools to analyse...
Latent Dirichlet Allocation (LDA) is a probability model for grouping hidden topics in documents by ...
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of ...
Latent Dirichlet Allocation (LDA) is a scheme which may be used to estimate topics and their probabi...
To assess critically the scientific literature is a very challenging task; in general it requires an...
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of d...
In the Information Age, a proliferation of unstructured text electronic documents exists. Processin...
Search algorithms incorporating some form of topic model have a long history in information retrieva...
Abstract-Text categorization is the task of automatically assigning unlabeled text documents to some...
This paper introduces the ldagibbs command which implements Latent Dirichlet Allocation in Stata. La...
This paper introduces the ldagibbs command which implements Latent Dirichlet Allocation in Stata. La...
Latent Dirichlet Allocation is a generative probabilistic model that can be used to describe and ana...
Many approaches have been introduced to enable Latent Dirichlet Allocation (LDA) models to be update...
This work presents an alternative method to represent documents based on LDA (Latent Dirichlet Alloc...
Latent Dirichlet Allocation (henceforth LDA) is a statistical model that can be used to represent na...
Probabilistic topic models such as latent Dirichlet allocation (LDA) are widespread tools to analyse...
Latent Dirichlet Allocation (LDA) is a probability model for grouping hidden topics in documents by ...
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of ...
Latent Dirichlet Allocation (LDA) is a scheme which may be used to estimate topics and their probabi...
To assess critically the scientific literature is a very challenging task; in general it requires an...
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of d...
In the Information Age, a proliferation of unstructured text electronic documents exists. Processin...
Search algorithms incorporating some form of topic model have a long history in information retrieva...
Abstract-Text categorization is the task of automatically assigning unlabeled text documents to some...
This paper introduces the ldagibbs command which implements Latent Dirichlet Allocation in Stata. La...
This paper introduces the ldagibbs command which implements Latent Dirichlet Allocation in Stata. La...
Latent Dirichlet Allocation is a generative probabilistic model that can be used to describe and ana...
Many approaches have been introduced to enable Latent Dirichlet Allocation (LDA) models to be update...