Topic models are a useful tool for analyzing large text collections, but have previously been applied in only monolingual, or at most bilingual, contexts. Meanwhile, massive collections of interlinked documents in dozens of languages, such as Wikipedia, are now widely available, calling for tools that can characterize content in many languages. We introduce a polylingual topic model that discovers topics aligned across multiple languages. We explore the model\u27s characteristics using two large corpora, each with over ten different languages, and demonstrate its usefulness in supporting machine translation and tracking topic trends across languages
Topic models, an unsupervised technique for inferring translation domains improve machine translatio...
Topic models, an unsupervised technique for inferring translation domains improve machine translatio...
Probabilistic topic models are unsupervised generative models which model document content as a two-...
Topic models are a useful tool for analyzing large text collections, but have previously been applie...
Topic models are a useful tool for analyzing large text collections, but have previously been applie...
Topic models are a useful tool for analyzing large text collections, but have previously been applie...
Topic models are a useful tool for analyzing large text collections, but have previously been applie...
Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data. Current ...
Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data. Current ...
Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data. Current ...
Abstract. In this paper, we present the Polylingual Labeled Topic Model, a model which combines the ...
A machine-readable bilingual dictionary plays a crucial role in many natural language processing tas...
This paper explores bridging the content of two different languages via latent topics. Specifically,...
Abstract. This paper explores bridging the content of two different languages via latent topics. Spe...
Topic models, an unsupervised technique for inferring translation domains improve machine translatio...
Topic models, an unsupervised technique for inferring translation domains improve machine translatio...
Topic models, an unsupervised technique for inferring translation domains improve machine translatio...
Probabilistic topic models are unsupervised generative models which model document content as a two-...
Topic models are a useful tool for analyzing large text collections, but have previously been applie...
Topic models are a useful tool for analyzing large text collections, but have previously been applie...
Topic models are a useful tool for analyzing large text collections, but have previously been applie...
Topic models are a useful tool for analyzing large text collections, but have previously been applie...
Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data. Current ...
Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data. Current ...
Dynamic topic models (DTMs) capture the evolution of topics and trends in time series data. Current ...
Abstract. In this paper, we present the Polylingual Labeled Topic Model, a model which combines the ...
A machine-readable bilingual dictionary plays a crucial role in many natural language processing tas...
This paper explores bridging the content of two different languages via latent topics. Specifically,...
Abstract. This paper explores bridging the content of two different languages via latent topics. Spe...
Topic models, an unsupervised technique for inferring translation domains improve machine translatio...
Topic models, an unsupervised technique for inferring translation domains improve machine translatio...
Topic models, an unsupervised technique for inferring translation domains improve machine translatio...
Probabilistic topic models are unsupervised generative models which model document content as a two-...