We describe a new approach to automatic dialect and accent recognition which exceeds state-of-the-art performance in three recognition tasks. This approach improves the accuracy and substantially lower the time complexity of our earlier phonetic based kernel approach for dialect recognition. In contrast to state-of-the-art acoustic-based systems, our approach employs phone labels and segmentation to constrain the acoustic models. Given a speaker’s utterance, we first obtain phone hypotheses using a phone recognizer and then extract GMM-supervectors for each phone type, effectively summarizing the speaker’s phonetic characteristics in a single vector of phone-type supervectors. Using these vectors, we design a kernel function that computes t...
Speech dialects refer to linguistic and pronunciation variations in the speech of the same language....
Despite the success of deep learning in speech recognition, multi-dialect speech recognition remains...
Building NLP systems that serve everyone requires accounting for dialect differences. But dialects a...
In this paper, we introduce a new approach to dialect recognition which relies on the hypothesis tha...
In this paper, we introduce a new approach to dialect recognition which relies on the hypothesis tha...
A fundamental challenge for current research on speech science and technology is understanding and m...
In this paper, we introduce a new approach to dialect recognition that relies on context-dependent (...
In this paper, we introduce a new approach to dialect recognition that relies on context-dependent (...
We study the effectiveness of recently developed language recognition techniques based on speech rec...
Dialect variation spans different linguistic levels of analysis. Two examples include the typical ph...
Traditionally, work in automatic accent recognition has followed a similar research trajectory to th...
Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2011.Cataloged from PDF ver...
In this paper, we develop methods to identify accents of native speakers. Accent identification diff...
In this paper, three utterance modelling approaches, namely Gaussian Mean Supervector (GMS), i-vecto...
State-of-the-art Automatic Speech Recognition (ASR) models struggle to handle accented speech, parti...
Speech dialects refer to linguistic and pronunciation variations in the speech of the same language....
Despite the success of deep learning in speech recognition, multi-dialect speech recognition remains...
Building NLP systems that serve everyone requires accounting for dialect differences. But dialects a...
In this paper, we introduce a new approach to dialect recognition which relies on the hypothesis tha...
In this paper, we introduce a new approach to dialect recognition which relies on the hypothesis tha...
A fundamental challenge for current research on speech science and technology is understanding and m...
In this paper, we introduce a new approach to dialect recognition that relies on context-dependent (...
In this paper, we introduce a new approach to dialect recognition that relies on context-dependent (...
We study the effectiveness of recently developed language recognition techniques based on speech rec...
Dialect variation spans different linguistic levels of analysis. Two examples include the typical ph...
Traditionally, work in automatic accent recognition has followed a similar research trajectory to th...
Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2011.Cataloged from PDF ver...
In this paper, we develop methods to identify accents of native speakers. Accent identification diff...
In this paper, three utterance modelling approaches, namely Gaussian Mean Supervector (GMS), i-vecto...
State-of-the-art Automatic Speech Recognition (ASR) models struggle to handle accented speech, parti...
Speech dialects refer to linguistic and pronunciation variations in the speech of the same language....
Despite the success of deep learning in speech recognition, multi-dialect speech recognition remains...
Building NLP systems that serve everyone requires accounting for dialect differences. But dialects a...