This paper introduces two novel techniques for instantaneous speaker adaptation, reference speaker weighting and consistency modeling. An approach to hierarchical speaker clustering using gender and speaking rate as the clustering criteria is also presented. All three methods attempt to utilize the underlying within-speaker correlations that are present between the acoustic realizations of different phones. By accounting for these correlations a limited amount of adaptation data can be used to adapt the models of every phonetic acoustic model including those for phones which have not been observed in the adaptation data. In instantaneous adaptation experiments using the DARPA Resource Management corpus, a reduction in word error rate of 20...
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, ...
ICASSP2001: IEEE International Conference on Acoustics, Speech and Signal Processing, May 7-11, 20...
This paper examines an approach to speaker adaptation called speaker cluster weighting (SCW) for ra...
A method for unsupervised instantaneous speaker adaptation is presented and evaluated on a continuou...
The use of the PC and Internet for placing telephone calls will present new opportunities to capture...
Abstract. In this paper a speaker adaptation methodology is proposed, which first automatically dete...
For the problem of speaker adaptation in speech recognition, the performance depends on the availabi...
In real-time speech recognition applications, there is a need to implement a fast and reliable adapt...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
This paper describes the method of using multi-template unsupervised speaker adaptation based on HMM...
In the past few years numerous techniques have been proposed to improve the efficiency of basic adap...
Recently, we revisited the fast adaptation method called reference speaker weighting (RSW), and sugg...
Automatic speech recognition (ASR) converts human speech to readable text. Acoustic model adaptation...
This paper describes an efficient method for unsupervised speaker adaptation. This method is based o...
This paper addresses speaker adaptive acoustic modeling, based on feature space maximum likelih...
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, ...
ICASSP2001: IEEE International Conference on Acoustics, Speech and Signal Processing, May 7-11, 20...
This paper examines an approach to speaker adaptation called speaker cluster weighting (SCW) for ra...
A method for unsupervised instantaneous speaker adaptation is presented and evaluated on a continuou...
The use of the PC and Internet for placing telephone calls will present new opportunities to capture...
Abstract. In this paper a speaker adaptation methodology is proposed, which first automatically dete...
For the problem of speaker adaptation in speech recognition, the performance depends on the availabi...
In real-time speech recognition applications, there is a need to implement a fast and reliable adapt...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
This paper describes the method of using multi-template unsupervised speaker adaptation based on HMM...
In the past few years numerous techniques have been proposed to improve the efficiency of basic adap...
Recently, we revisited the fast adaptation method called reference speaker weighting (RSW), and sugg...
Automatic speech recognition (ASR) converts human speech to readable text. Acoustic model adaptation...
This paper describes an efficient method for unsupervised speaker adaptation. This method is based o...
This paper addresses speaker adaptive acoustic modeling, based on feature space maximum likelih...
EUROSPEECH2001: the 7th European Conference on Speech Communication and Technology, September 3-7, ...
ICASSP2001: IEEE International Conference on Acoustics, Speech and Signal Processing, May 7-11, 20...
This paper examines an approach to speaker adaptation called speaker cluster weighting (SCW) for ra...