For centuries, scholars have explored the deep links among human languages. In this paper, we present a class of probabilistic models that use these links as a form of naturally occurring supervision. These models allow us to substantially improve performance for core text processing tasks, such as morphological segmentation, part-of-speech tagging, and syntactic parsing. Besides these traditional NLP tasks, we also present a multilingual model for the computational decipherment of lost languages
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Language is central to human intelligence. We review recent break- throughs in machine language proc...
We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories fr...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Recent research has shown promise in multilingual modeling, demonstrating how a single model is capa...
With the rising amount of available multilingual text data, computational linguistics faces an oppor...
In real scenarios, a multilingual model trained to solve NLP tasks on a set of languages can be requ...
textBuilding a computer system that can understand human languages has been one of the long-standing...
How cross-linguistically applicable are NLP models, specifically language models? A fair comparison ...
This thesis focuses on unsupervised morphological seg- mentation, the fundamental task in NLP which ...
Building models of language is a central task in natural language processing. Traditionally, languag...
Computational approaches to linguistic analysis have been used for more than half a century. The mai...
NLP technologies are uneven for the world's languages as the state-of-the-art models are only availa...
We investigate the problem of unsupervised part-of-speech tagging when raw parallel data is availabl...
We demonstrate the effectiveness of multilingual learning for unsupervised part-of-speech tagging. ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Language is central to human intelligence. We review recent break- throughs in machine language proc...
We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories fr...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Recent research has shown promise in multilingual modeling, demonstrating how a single model is capa...
With the rising amount of available multilingual text data, computational linguistics faces an oppor...
In real scenarios, a multilingual model trained to solve NLP tasks on a set of languages can be requ...
textBuilding a computer system that can understand human languages has been one of the long-standing...
How cross-linguistically applicable are NLP models, specifically language models? A fair comparison ...
This thesis focuses on unsupervised morphological seg- mentation, the fundamental task in NLP which ...
Building models of language is a central task in natural language processing. Traditionally, languag...
Computational approaches to linguistic analysis have been used for more than half a century. The mai...
NLP technologies are uneven for the world's languages as the state-of-the-art models are only availa...
We investigate the problem of unsupervised part-of-speech tagging when raw parallel data is availabl...
We demonstrate the effectiveness of multilingual learning for unsupervised part-of-speech tagging. ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Language is central to human intelligence. We review recent break- throughs in machine language proc...
We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories fr...