Current supervised speech technology relies heavily on tran-scribed speech and pronunciation dictionaries. In settings where unlabelled speech data alone is available, unsupervised methods are required to discover categorical linguistic structure directly from the audio. We present a novel Bayesian model which seg-ments unlabelled input speech into word-like units, resulting in a complete unsupervised transcription of the speech in terms of discovered word types. In our approach, a potential word segment (of arbitrary length) is embedded in a fixed-dimensional space; the model (implemented as a Gibbs sampler) then builds a whole-word acoustic model in this space while jointly doing seg-mentation. We report word error rates in a connected di...
In this paper, we present an unsupervised method for automatically discovering words from speech usi...
A Bayesian model of continuous speech recognition is presented. It is based on Shortlist ( D. Norris...
This work investigates subspace non-parametric models for the task of learning a set of acoustic uni...
Unsupervised speech processing methods are essential for ap-plications ranging from zero-resource sp...
Abstract — In this paper we consider the unsupervised word discovery from phonetic input. We employ ...
Accepted to ICASSP 2018International audienceDeveloping speech technologies for low-resource languag...
Documenting languages helps to prevent the extinction of endangered dialects, many of which are othe...
Automatic speech recognition has matured into a commercially successful technology, enabling voice-b...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We present a novel approach to speech processing based on the principle of pattern discovery. Our wo...
Zero resource speech processing refers to a scenario where no or minimal transcribed da...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
The ability to infer linguistic structures from noisy speech streams seems to be an innate human cap...
The creation of a pronunciation lexicon re-mains the most inefficient process in develop-ing an Auto...
Zero-resource speech processing is a growing research area which aims to develop methods that can d...
In this paper, we present an unsupervised method for automatically discovering words from speech usi...
A Bayesian model of continuous speech recognition is presented. It is based on Shortlist ( D. Norris...
This work investigates subspace non-parametric models for the task of learning a set of acoustic uni...
Unsupervised speech processing methods are essential for ap-plications ranging from zero-resource sp...
Abstract — In this paper we consider the unsupervised word discovery from phonetic input. We employ ...
Accepted to ICASSP 2018International audienceDeveloping speech technologies for low-resource languag...
Documenting languages helps to prevent the extinction of endangered dialects, many of which are othe...
Automatic speech recognition has matured into a commercially successful technology, enabling voice-b...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We present a novel approach to speech processing based on the principle of pattern discovery. Our wo...
Zero resource speech processing refers to a scenario where no or minimal transcribed da...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
The ability to infer linguistic structures from noisy speech streams seems to be an innate human cap...
The creation of a pronunciation lexicon re-mains the most inefficient process in develop-ing an Auto...
Zero-resource speech processing is a growing research area which aims to develop methods that can d...
In this paper, we present an unsupervised method for automatically discovering words from speech usi...
A Bayesian model of continuous speech recognition is presented. It is based on Shortlist ( D. Norris...
This work investigates subspace non-parametric models for the task of learning a set of acoustic uni...