This article presents a probabilistic generative model for text based on semantic topics and syntactic classes called Part-of-Speech LDA (POSLDA). POSLDA simultaneously uncovers short-range syntactic patterns (syntax) and long-range semantic patterns (topics) that exist in document collections. This results in word distributions that are specific to both topics (sports, education,...) and parts-of-speech (nouns, verbs,...). For example, multinomial distributions over words are uncovered that can be understood as “nouns about weather ” or “verbs about law”. We describe the model and an approximate inference algorithm and then demonstrate the quality of the learned topics both qualitatively and quantitatively. Then, we discuss an NLP applicat...
We present a novel topic mixture-based language model adaptation approach that uses La-tent Dirichle...
Probabilistic topic models are widely used to discover latent topics in document collec-tions, while...
Probabilistic topic models are machine learning tools for processing and understanding large text d...
The syntactic topic model (STM) is a Bayesian nonparametric model of language that discovers latent ...
Probabilistic topic models are unsupervised generative models which model document content as a two-...
We investigate the use of topic models, such as probabilistic latent semantic analysis (PLSA) and la...
We investigate the use of topic models, such as probabilistic latent semantic analysis (PLSA) and la...
This paper establishes a connection between two apparently very different kinds of probabilistic mod...
We describe a probabilistic framework for acquiring selectional preferences of linguistic predi-cate...
International audienceProbabilistic topic models are generative models that describe the content of ...
International audienceProbabilistic topic models are generative models that describe the content of ...
Probabilistic topic modeling is a powerful tool to uncover hidden thematic structure of documents. T...
Topic Models like Latent Dirichlet Allocation have been widely used for their robustness in estimati...
Topic Models like Latent Dirichlet Allocation have been widely used for their robustness in estimati...
Topic Models like Latent Dirichlet Allocation have been widely used for their robustness in estimati...
We present a novel topic mixture-based language model adaptation approach that uses La-tent Dirichle...
Probabilistic topic models are widely used to discover latent topics in document collec-tions, while...
Probabilistic topic models are machine learning tools for processing and understanding large text d...
The syntactic topic model (STM) is a Bayesian nonparametric model of language that discovers latent ...
Probabilistic topic models are unsupervised generative models which model document content as a two-...
We investigate the use of topic models, such as probabilistic latent semantic analysis (PLSA) and la...
We investigate the use of topic models, such as probabilistic latent semantic analysis (PLSA) and la...
This paper establishes a connection between two apparently very different kinds of probabilistic mod...
We describe a probabilistic framework for acquiring selectional preferences of linguistic predi-cate...
International audienceProbabilistic topic models are generative models that describe the content of ...
International audienceProbabilistic topic models are generative models that describe the content of ...
Probabilistic topic modeling is a powerful tool to uncover hidden thematic structure of documents. T...
Topic Models like Latent Dirichlet Allocation have been widely used for their robustness in estimati...
Topic Models like Latent Dirichlet Allocation have been widely used for their robustness in estimati...
Topic Models like Latent Dirichlet Allocation have been widely used for their robustness in estimati...
We present a novel topic mixture-based language model adaptation approach that uses La-tent Dirichle...
Probabilistic topic models are widely used to discover latent topics in document collec-tions, while...
Probabilistic topic models are machine learning tools for processing and understanding large text d...