Due to abundant resources not always being available for resource-limited languages, training an acoustic model with unbalanced training data for multilingual speech recognition is an interesting research issue. In this paper, we propose a three-step data-driven phone clustering method to train a multilingual acoustic model. The first step is to obtain a clustering rule of context independent phone models driven from a well-trained acoustic model using a similarity measurement. For the second step, we further clustered the sub-phone units using hierarchical agglomerative clustering with delta Bayesian information criteria according to the clustering rules. Then, we chose a parametric modeling technique-- model complexity selection-- to adju...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
The development of automatic speech recognition systems requires significant quantities of annotated...
Despite the success of deep learning in speech recognition, multi-dialect speech recognition remains...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Summarization: This work presents techniques for improved cross-language transfer of speech...
In this paper, we explore how different acoustic modeling tech-niques can benefit from data in langu...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
Although research has previously been done on multilingual speech recognition, it has been found to ...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Today, speech synthesizers in new languages are typically built by collecting several hours of well ...
Title from PDF of title page (University of Missouri--Columbia, viewed on May 25, 2012).The entire t...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
The development of automatic speech recognition systems requires significant quantities of annotated...
Despite the success of deep learning in speech recognition, multi-dialect speech recognition remains...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Summarization: This work presents techniques for improved cross-language transfer of speech...
In this paper, we explore how different acoustic modeling tech-niques can benefit from data in langu...
Summarization: The porting of a speech recognition system to a new language is usually a time-consum...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recogni...
Although research has previously been done on multilingual speech recognition, it has been found to ...
This paper studies cross-lingual acoustic modeling in the context of subspace Gaussian mixture model...
Today, speech synthesizers in new languages are typically built by collecting several hours of well ...
Title from PDF of title page (University of Missouri--Columbia, viewed on May 25, 2012).The entire t...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
Over the past decades, speech recognition has dramatically improved in a large variety of applicatio...
The development of automatic speech recognition systems requires significant quantities of annotated...
Despite the success of deep learning in speech recognition, multi-dialect speech recognition remains...