We explore the integration of multiple factors such as genre and speaker gender for acoustic model adaptation tasks to improve Mandarin ASR system performance on broadcast news and broadcast conversation audio. We investigate the use of mul-tifactor clustering of acoustic model training data and the ap-plication of MPE-MAP and fMPE-MAP acoustic model adap-tations. We found that by effectively combining these adapta-tion approaches, we achieve 6 % relative reduction in recogni-tion error rate compared to a Mandarin recognition system that does not use genre-specific acoustic models, and 5 % relative improvement if the genre-adaptive system is combined with an-other, genre-independent state-of-the-art system. Index Terms: large vocabulary aut...
The paper constructs a prosody adaptation model for Mandarin text to speech system by using the pros...
Lack of data is a problem in training language models for conversational speech recognition, particu...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...
This paper considers dynamic language model adaptation for Mandarin broadcast news recognition. Both...
In this paper, a basic Mandarin broadcast news speech recognition system is constructed using the MA...
[[abstract]]N-gram language modeling is a crucial component in any speech recognizer since it is exp...
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and ...
[[abstract]]This study investigates language modeling for Mandarin continuous speech recognition. Fi...
This paper presents an empirical study of word error minimization approaches for Mandarin large voca...
[[abstract]]Discriminative training of acoustic models has been an active focus of much current rese...
[[abstract]]Statistical language modeling, which aims to capture the regularities in human natural l...
This paper discusses the development of the CU-HTK Mandarin Broadcast News (BN) transcription system...
This article investigates the use of several lightly supervised and data-driven approaches to Mandar...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
The predominant language model for speech recognition is n-gram language model, which is locally lea...
The paper constructs a prosody adaptation model for Mandarin text to speech system by using the pros...
Lack of data is a problem in training language models for conversational speech recognition, particu...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...
This paper considers dynamic language model adaptation for Mandarin broadcast news recognition. Both...
In this paper, a basic Mandarin broadcast news speech recognition system is constructed using the MA...
[[abstract]]N-gram language modeling is a crucial component in any speech recognizer since it is exp...
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and ...
[[abstract]]This study investigates language modeling for Mandarin continuous speech recognition. Fi...
This paper presents an empirical study of word error minimization approaches for Mandarin large voca...
[[abstract]]Discriminative training of acoustic models has been an active focus of much current rese...
[[abstract]]Statistical language modeling, which aims to capture the regularities in human natural l...
This paper discusses the development of the CU-HTK Mandarin Broadcast News (BN) transcription system...
This article investigates the use of several lightly supervised and data-driven approaches to Mandar...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
The predominant language model for speech recognition is n-gram language model, which is locally lea...
The paper constructs a prosody adaptation model for Mandarin text to speech system by using the pros...
Lack of data is a problem in training language models for conversational speech recognition, particu...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...