We present an unsupervised learning algorithm that acquires a natural-language lexicon from raw speech. The algorithm is based on the optimal encoding of symbol sequences in an MDL framework, and uses a hierarchical representation of language that overcomes many of the problems that have stymied previous grammar-induction procedures. The forward mapping from symbol sequences to the speech stream is modeled using features based on articulatory gestures. We present results on the acquisition of lexicons and language models from raw speech, text, and phonetic transcripts, and demonstrate that our algorithm compares very favorably to other reported results with respect to segmentation performance and statistical efficiency
The identification of words in continuous speech, known as speech segmentation, is a critical early ...
Human speakers encode information into raw speech which is then decoded by the listeners. This compl...
© 2014 IEEE. In this paper, a spoken command and control interface that acquires spoken language thr...
We present an unsupervised learning algorithm that acquires a natural-language lexicon from raw spee...
We present an algorithm that acquires words (pairings of phonological forms and semantic representat...
Natural language processing systems such as speech recognition and ma-chine translation conventional...
Preparing words in speech production is normally a fast and accurate process. We generate them two o...
The process of human spoken language acquisition is still being studied up to this day—the most popu...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In the domain of unsupervised learning most work on speech has focused on discovering low-level cons...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
We present a model of unsupervised phonological lexicon discovery -- the problem of simultaneously l...
This paper reports the on-going research of a thesis project investigating a computational model of ...
Roy [1] developed a computational model of early lexical learning to address three questions: First,...
We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories fr...
The identification of words in continuous speech, known as speech segmentation, is a critical early ...
Human speakers encode information into raw speech which is then decoded by the listeners. This compl...
© 2014 IEEE. In this paper, a spoken command and control interface that acquires spoken language thr...
We present an unsupervised learning algorithm that acquires a natural-language lexicon from raw spee...
We present an algorithm that acquires words (pairings of phonological forms and semantic representat...
Natural language processing systems such as speech recognition and ma-chine translation conventional...
Preparing words in speech production is normally a fast and accurate process. We generate them two o...
The process of human spoken language acquisition is still being studied up to this day—the most popu...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In the domain of unsupervised learning most work on speech has focused on discovering low-level cons...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
We present a model of unsupervised phonological lexicon discovery -- the problem of simultaneously l...
This paper reports the on-going research of a thesis project investigating a computational model of ...
Roy [1] developed a computational model of early lexical learning to address three questions: First,...
We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories fr...
The identification of words in continuous speech, known as speech segmentation, is a critical early ...
Human speakers encode information into raw speech which is then decoded by the listeners. This compl...
© 2014 IEEE. In this paper, a spoken command and control interface that acquires spoken language thr...