Accepted to ICASSP 2018International audienceDeveloping speech technologies for low-resource languages has become a very active research field over the last decade. Among others, Bayesian models have shown some promising results on artificial examples but still lack of in situ experiments. Our work applies state-of-the-art Bayesian models to unsupervised Acoustic Unit Discovery (AUD) in a real low-resource language scenario. We also show that Bayesian models can naturally integrate information from other resourceful languages by means of informative prior leading to more consistent discovered units. Finally, discovered acoustic units are used, either as the 1-best sequence or as a lattice, to perform word segmentation. Word segmentation res...
We summarize the accomplishments of a multi-disciplinary work-shop exploring the computational and s...
Many techniques in speech processing require inference based on observations that are of- ten noisy,...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
Accepted to ICASSP 2018International audienceDeveloping speech technologies for low-resource languag...
This work investigates subspace non-parametric models for the task of learning a set of acoustic uni...
Current supervised speech technology relies heavily on tran-scribed speech and pronunciation diction...
The ability to infer linguistic structures from noisy speech streams seems to be an innate human cap...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
AbstractRecently, several nonparametric Bayesian models have been proposed to automatically discover...
Abstract — In this paper we consider the unsupervised word discovery from phonetic input. We employ ...
Documenting languages helps to prevent the extinction of endangered dialects, many of which are othe...
Documenting languages helps to prevent the extinction of endangered dialects – many of which are oth...
Automatic speech recognition has matured into a commercially successful technology, enabling voice-b...
One of the key challenges involved in building statistical automatic speech recog-nition (ASR) syste...
International audienceWe present a first attempt to perform attentional word segmen-tation directly ...
We summarize the accomplishments of a multi-disciplinary work-shop exploring the computational and s...
Many techniques in speech processing require inference based on observations that are of- ten noisy,...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
Accepted to ICASSP 2018International audienceDeveloping speech technologies for low-resource languag...
This work investigates subspace non-parametric models for the task of learning a set of acoustic uni...
Current supervised speech technology relies heavily on tran-scribed speech and pronunciation diction...
The ability to infer linguistic structures from noisy speech streams seems to be an innate human cap...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
AbstractRecently, several nonparametric Bayesian models have been proposed to automatically discover...
Abstract — In this paper we consider the unsupervised word discovery from phonetic input. We employ ...
Documenting languages helps to prevent the extinction of endangered dialects, many of which are othe...
Documenting languages helps to prevent the extinction of endangered dialects – many of which are oth...
Automatic speech recognition has matured into a commercially successful technology, enabling voice-b...
One of the key challenges involved in building statistical automatic speech recog-nition (ASR) syste...
International audienceWe present a first attempt to perform attentional word segmen-tation directly ...
We summarize the accomplishments of a multi-disciplinary work-shop exploring the computational and s...
Many techniques in speech processing require inference based on observations that are of- ten noisy,...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...