Inter-speaker variation can be coped rather well in speech recognition by speaker adaptation techniques such as MLLR and MAP. However, when dealing with speech other than reading style, such as conversational speech, emo-tional speech and so on, current recognition systems can-not achieve a satisfactory performance even after speaker adaptation. In view of this situation, two-level adaptation method was newly proposed, where adaptation technique was applied in two levels to handle inter-speaker and in-tra-speaker variations. A speaker independent model is first adapted to a specific speaker to generate a speaker dependent model. Then, after classifying the training data into several categories, the speaker dependent model is fur-ther adapte...
A robust ASR system needs to perform well in different environment and with different speakers. For ...
Automatic speech recognition (ASR) converts human speech to readable text. Acoustic model adaptation...
High level features such as phone and word n-grams have been shown to be effective for speaker recog...
Speech recognition systems are usually speaker-inde-pendent, but they are not as good as speaker-dep...
This paper describes a new speaker adaptation strategy that we term speaker specific compensation. T...
For the problem of speaker adaptation in speech recognition, the performance depends on the availabi...
Abstract—We present a new modeling approach for speaker recognition that uses the maximum-likelihood...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
Acoustic variability across speakers is one of the challenges of speaker independent speech recognit...
In this paper an effective technique for speaker adaptation on the feature domain is presented. This...
This paper examines techniques for speaker normalisation and adaptation that are applied in training...
When we want to develop a recognition system for a new environment, we have to decide which is the b...
The speaker-dependent HMM-based recognizers gives lower word error rates in comparison with the corr...
Summarization: Several adaptation approaches have been proposed in an effort to improve the speech r...
W ork carried out as visiting student at M SR Asia. This paper presents a 3-stage adaptation framewo...
A robust ASR system needs to perform well in different environment and with different speakers. For ...
Automatic speech recognition (ASR) converts human speech to readable text. Acoustic model adaptation...
High level features such as phone and word n-grams have been shown to be effective for speaker recog...
Speech recognition systems are usually speaker-inde-pendent, but they are not as good as speaker-dep...
This paper describes a new speaker adaptation strategy that we term speaker specific compensation. T...
For the problem of speaker adaptation in speech recognition, the performance depends on the availabi...
Abstract—We present a new modeling approach for speaker recognition that uses the maximum-likelihood...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
Acoustic variability across speakers is one of the challenges of speaker independent speech recognit...
In this paper an effective technique for speaker adaptation on the feature domain is presented. This...
This paper examines techniques for speaker normalisation and adaptation that are applied in training...
When we want to develop a recognition system for a new environment, we have to decide which is the b...
The speaker-dependent HMM-based recognizers gives lower word error rates in comparison with the corr...
Summarization: Several adaptation approaches have been proposed in an effort to improve the speech r...
W ork carried out as visiting student at M SR Asia. This paper presents a 3-stage adaptation framewo...
A robust ASR system needs to perform well in different environment and with different speakers. For ...
Automatic speech recognition (ASR) converts human speech to readable text. Acoustic model adaptation...
High level features such as phone and word n-grams have been shown to be effective for speaker recog...