Linear regression based speaker adaptation approaches can improve Automatic Speech Recognition (ASR) accuracy significantly for a target speaker. However, when the available adaptation data is limited to a few seconds, the accuracy of the speaker adapted models is often worse compared with speaker independent models. In this paper, we propose an approach to select a set of reference speakers acoustically close to the target speaker whose data can be used to augment the adaptation data. To determine the acoustic similarity of two speakers, we propose a distance metric based on transforming sample points in the acoustic space with the regression matrices of the two speakers. We show the validity of this approach through a speaker identificati...
Automatic speech recognition (ASR) incorporates knowledge and research in linguistics, computer scie...
The speaker-dependent HMM-based recognizers gives lower word error rates in comparison with the corr...
In this paper an effective technique for speaker adaptation on the feature domain is presented. This...
A robust ASR system needs to perform well in different environment and with different speakers. For ...
<p>Speaker dependent (SD) ASR systems have significantly lower word error rates (WER) compared to sp...
This paper presents a technical speaker adaptation method called WMLLR, which is based on maximum li...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
Speaker dependent (SD) ASR systems have significantly lower word error rates (WER) compared to speak...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
Though speaker adaptation has long been an importing topic in automatic speech recognition, the brea...
This paper investigates techniques to compensate for the effects of regional accents of British Engl...
This paper proposes state-of the-art Automatic Speaker Recognition System (ASR) based on Bayesian Di...
In real-time speech recognition applications, there is a need to implement a fast and reliable adapt...
Abstract—We present a new modeling approach for speaker recognition that uses the maximum-likelihood...
This paper proposes state-of the-art Automatic Speaker Recognition System (ASR) based on Bayesian Di...
Automatic speech recognition (ASR) incorporates knowledge and research in linguistics, computer scie...
The speaker-dependent HMM-based recognizers gives lower word error rates in comparison with the corr...
In this paper an effective technique for speaker adaptation on the feature domain is presented. This...
A robust ASR system needs to perform well in different environment and with different speakers. For ...
<p>Speaker dependent (SD) ASR systems have significantly lower word error rates (WER) compared to sp...
This paper presents a technical speaker adaptation method called WMLLR, which is based on maximum li...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
Speaker dependent (SD) ASR systems have significantly lower word error rates (WER) compared to speak...
ICASSP2006: IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, ...
Though speaker adaptation has long been an importing topic in automatic speech recognition, the brea...
This paper investigates techniques to compensate for the effects of regional accents of British Engl...
This paper proposes state-of the-art Automatic Speaker Recognition System (ASR) based on Bayesian Di...
In real-time speech recognition applications, there is a need to implement a fast and reliable adapt...
Abstract—We present a new modeling approach for speaker recognition that uses the maximum-likelihood...
This paper proposes state-of the-art Automatic Speaker Recognition System (ASR) based on Bayesian Di...
Automatic speech recognition (ASR) incorporates knowledge and research in linguistics, computer scie...
The speaker-dependent HMM-based recognizers gives lower word error rates in comparison with the corr...
In this paper an effective technique for speaker adaptation on the feature domain is presented. This...