In this paper, a novel on-line incremental speaker adaptation technique is proposed for real time transcription applications such as automatic closed-captioning of live TV programs. Differently from previously proposed methods, our technique does not operate at utterance level but instead speaker change detection and clustering as well as speaker adaptation occur over a short chunk of the incoming audio signal. Incremental adaptation based on feature space maximum likelihood linear regression (fMLLR) is conducted w. r. t. a Gaussian mixture model (GMM) modeling the acoustic training data. Individual speakers are represented by fMLLR transforms, and these transforms are used for speaker clustering and for performing speaker adapt...
Abstract. In this paper a speaker adaptation methodology is proposed, which first automatically dete...
In real-time speech recognition applications, there is a need to implement a fast and reliable adapt...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
In order to improve the performance of speech recognition systems when speakers change frequently an...
The use of the PC and Internet for placing telephone calls will present new opportunities to capture...
For many applications, it is necessary to produce speech transcriptions in a causal fashion. To prod...
Abstract: This paper deals with speaker adaptation techniques well suited for the task of online sub...
This paper deals with speaker adaptation techniques well suited for the task of online subtitling. T...
This paper addresses a novel algorithm of incremental speaker adaptation (ISA) based on universal ba...
This paper deals with adaptation techniques based on maximum likelihood linear transformations, whic...
In this paper, an approach for unsupervised dynamic adaptation of the language model used in an auto...
This paper addresses speaker adaptive acoustic modeling, based on feature space maximum likelih...
This paper introduces two novel techniques for instantaneous speaker adaptation, reference speaker w...
This paper presents a technical speaker adaptation method called WMLLR, which is based on maximum li...
The general goal of this thesis is to improve the performance of state-of-the-art statistical automa...
Abstract. In this paper a speaker adaptation methodology is proposed, which first automatically dete...
In real-time speech recognition applications, there is a need to implement a fast and reliable adapt...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
In order to improve the performance of speech recognition systems when speakers change frequently an...
The use of the PC and Internet for placing telephone calls will present new opportunities to capture...
For many applications, it is necessary to produce speech transcriptions in a causal fashion. To prod...
Abstract: This paper deals with speaker adaptation techniques well suited for the task of online sub...
This paper deals with speaker adaptation techniques well suited for the task of online subtitling. T...
This paper addresses a novel algorithm of incremental speaker adaptation (ISA) based on universal ba...
This paper deals with adaptation techniques based on maximum likelihood linear transformations, whic...
In this paper, an approach for unsupervised dynamic adaptation of the language model used in an auto...
This paper addresses speaker adaptive acoustic modeling, based on feature space maximum likelih...
This paper introduces two novel techniques for instantaneous speaker adaptation, reference speaker w...
This paper presents a technical speaker adaptation method called WMLLR, which is based on maximum li...
The general goal of this thesis is to improve the performance of state-of-the-art statistical automa...
Abstract. In this paper a speaker adaptation methodology is proposed, which first automatically dete...
In real-time speech recognition applications, there is a need to implement a fast and reliable adapt...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...