This paper is concerned with automatic speech recognition (ASR) for accented speech. Given a small amount of speech from a new speaker, is it better to apply speaker adaptation to the baseline, or to use accent identification (AID) to identify the speaker’s accent and select an accent-dependent acoustic model? Three accent-based model selection methods are inves- tigated: using the ‘true’ accent model, and unsupervised model selection using i-Vector and phonotactic-based AID. All three methods outperform the unadapted baseline. Most significantly, AID-based model selection using 43s of speech performs bet- ter than unsupervised speaker adaptation, even if the latter uses five times more adaptation data. Combining unsupervised AID- based mod...
(Now with TEMIC SDS GmbH, Ulm, Germany). It has been demonstrated repeatedly that the acoustic model...
Advances in speech technology, speech signal processing and phonetic representation are leading to n...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
This paper investigates techniques to compensate for the effects of regional accents of British Engl...
Accent is cited as an issue for speech recognition systems. Our experiments showed that the ASR word...
The ability to automatically identify a speaker's accent would be very useful for a speech recogniti...
International audienceCurrent automatic speech recognition (ASR) systems trained on native speech of...
Automatic speech recognition technology has developed rapidly in the past decade. Applications of th...
Accent variability is an important factor in speech that can sig-nificantly degrade automatic speech...
State-of-the-art Automatic Speech Recognition (ASR) models struggle to handle accented speech, parti...
Traditionally, work in automatic accent recognition has followed a similar research trajectory to th...
Accent is the pattern of pronunciation which can identify a person's linguistic, social or cultural ...
Accent is the pattern of pronunciation which can identify a person’s linguistic, social or cultural ...
Several adaptation approaches have been proposed in an eort to improve the speech recognition perfor...
LVCSR performance is consistently poor on low-proficiency non-native speech. While gains from speake...
(Now with TEMIC SDS GmbH, Ulm, Germany). It has been demonstrated repeatedly that the acoustic model...
Advances in speech technology, speech signal processing and phonetic representation are leading to n...
The performance of the speech recognition systems to translate voice to text is still an issue in la...
This paper investigates techniques to compensate for the effects of regional accents of British Engl...
Accent is cited as an issue for speech recognition systems. Our experiments showed that the ASR word...
The ability to automatically identify a speaker's accent would be very useful for a speech recogniti...
International audienceCurrent automatic speech recognition (ASR) systems trained on native speech of...
Automatic speech recognition technology has developed rapidly in the past decade. Applications of th...
Accent variability is an important factor in speech that can sig-nificantly degrade automatic speech...
State-of-the-art Automatic Speech Recognition (ASR) models struggle to handle accented speech, parti...
Traditionally, work in automatic accent recognition has followed a similar research trajectory to th...
Accent is the pattern of pronunciation which can identify a person's linguistic, social or cultural ...
Accent is the pattern of pronunciation which can identify a person’s linguistic, social or cultural ...
Several adaptation approaches have been proposed in an eort to improve the speech recognition perfor...
LVCSR performance is consistently poor on low-proficiency non-native speech. While gains from speake...
(Now with TEMIC SDS GmbH, Ulm, Germany). It has been demonstrated repeatedly that the acoustic model...
Advances in speech technology, speech signal processing and phonetic representation are leading to n...
The performance of the speech recognition systems to translate voice to text is still an issue in la...