The development of rich, multi-lingual corpora is essential for enabling new types of large-scale inquiry into the nature of language (Abney and Bird, 2010; Lewis and Xia, 2010). However, significant digital resources currently exist for only a handful of the world's languages. The present dissertation addresses this issue by introducing new techniques for creating rich corpora by enriching existing resources via automated processing. As a way of leveraging existing resources, this dissertation describes an automated method for extracting bitext (text accompanied by a translation) from bilingual documents. Digitized copies of printed books are mined for foreign-language material, using statistical methods for language identification and wo...
Deposited with permission of the author. ©2004 Charlotte WilsonLinguistic information is useful in ...
In statistical machine translation, estimating word-to-word alignment probabilities for the translat...
Theoretical work in morphological typology offers the possibility of measuring morpholog- ical diver...
The development of rich, multi-lingual corpora is essential for enabling new types of large-scale in...
The world-wide proliferation of digital communications has created the need for language and speech ...
This thesis contains work on a specific problem in field of LanguageTechnology. The problem can be d...
This article surveys resource-light monolingual approaches to morphological analysis and tagging. Wh...
Language documentation involves linguistic analysis of the collected material, which is typically do...
In this paper, a novel algorithm for incorporating morpho-logical knowledge into statistical machine...
A core issue that hampers development and use of language technology for underresourced and morpholo...
This paper presents an algorithm for the unsuper-vised learning of a simple morphology of a nat-ural...
Abstract We propose a language-independent approach for improving statistical machine translation fo...
The morphology of a language is a knowledge of the ways in which the language’s words can change in ...
We present a novel method of statisti-cal morphological generation, i.e. the pre-diction of inflecte...
The induction program we have crafted relies primarily on the linguistic notion of productivity to f...
Deposited with permission of the author. ©2004 Charlotte WilsonLinguistic information is useful in ...
In statistical machine translation, estimating word-to-word alignment probabilities for the translat...
Theoretical work in morphological typology offers the possibility of measuring morpholog- ical diver...
The development of rich, multi-lingual corpora is essential for enabling new types of large-scale in...
The world-wide proliferation of digital communications has created the need for language and speech ...
This thesis contains work on a specific problem in field of LanguageTechnology. The problem can be d...
This article surveys resource-light monolingual approaches to morphological analysis and tagging. Wh...
Language documentation involves linguistic analysis of the collected material, which is typically do...
In this paper, a novel algorithm for incorporating morpho-logical knowledge into statistical machine...
A core issue that hampers development and use of language technology for underresourced and morpholo...
This paper presents an algorithm for the unsuper-vised learning of a simple morphology of a nat-ural...
Abstract We propose a language-independent approach for improving statistical machine translation fo...
The morphology of a language is a knowledge of the ways in which the language’s words can change in ...
We present a novel method of statisti-cal morphological generation, i.e. the pre-diction of inflecte...
The induction program we have crafted relies primarily on the linguistic notion of productivity to f...
Deposited with permission of the author. ©2004 Charlotte WilsonLinguistic information is useful in ...
In statistical machine translation, estimating word-to-word alignment probabilities for the translat...
Theoretical work in morphological typology offers the possibility of measuring morpholog- ical diver...