In this paper, we propose a discriminative dynamic Gaussian mixture selection (DGMS) strategy to generate reliable accent-specific units (ASUs) for multi-accent speech recognition. Time-aligned phone recognition is used to generate the ASUs that model accent variations explicitly and accurately. DGMS reconstructs and adjusts a pre-trained set of hidden Markov model (HMM) state densities to build dynamic observation densities for each input speech frame. A discriminative minimum classification error criterion is adopted to optimize the sizes of the HMM state observation densities with a genetic algorithm (GA). To the author's knowledge, the discriminative optimization for DGMS accomplishes discriminative training of discrete variables that i...
In this paper, methods of Gaussian Mixture Model (GMM) are presented for both silence/voiced/voicele...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
In this paper, we propose a state-dependent tied mixture (SDTM) models with variable codebook size t...
Multiple accents are often present in Mandarin speech, as most Chi-nese have learned Mandarin as a s...
As speech recognition systems are used in ever more applications, it is crucial for the systems to b...
As speech recognition systems are used in ever more applications, it is crucial for the systems to b...
Automatic speech recognition technology has developed rapidly in the past decade. Applications of th...
Due to abundant resources not always being available for resource-limited languages, training an aco...
Phonetic differences always exist between any Chinese dialect and standard Chinese (Putonghua). In t...
To make full use of a small development data set to build a robust dialectal Chinese speech recogniz...
In this paper, the GMM-based text-independent speaker identification system for Mandarin speech is m...
In this paper, the GMM-based text-independent speaker identification system for Mandarin speech is m...
In mandarin, the words are composed by the concatenation of Chinese characters. In this paper, we pr...
In mandarin, the words are composed by the concatenation of Chinese characters. In this paper, we pr...
In this paper, several special speech recognition approaches based on hidden Markov models (HMMs) ar...
In this paper, methods of Gaussian Mixture Model (GMM) are presented for both silence/voiced/voicele...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
In this paper, we propose a state-dependent tied mixture (SDTM) models with variable codebook size t...
Multiple accents are often present in Mandarin speech, as most Chi-nese have learned Mandarin as a s...
As speech recognition systems are used in ever more applications, it is crucial for the systems to b...
As speech recognition systems are used in ever more applications, it is crucial for the systems to b...
Automatic speech recognition technology has developed rapidly in the past decade. Applications of th...
Due to abundant resources not always being available for resource-limited languages, training an aco...
Phonetic differences always exist between any Chinese dialect and standard Chinese (Putonghua). In t...
To make full use of a small development data set to build a robust dialectal Chinese speech recogniz...
In this paper, the GMM-based text-independent speaker identification system for Mandarin speech is m...
In this paper, the GMM-based text-independent speaker identification system for Mandarin speech is m...
In mandarin, the words are composed by the concatenation of Chinese characters. In this paper, we pr...
In mandarin, the words are composed by the concatenation of Chinese characters. In this paper, we pr...
In this paper, several special speech recognition approaches based on hidden Markov models (HMMs) ar...
In this paper, methods of Gaussian Mixture Model (GMM) are presented for both silence/voiced/voicele...
This paper investigates employment of Subspace Gaussian Mixture Models (SGMMs) for acoustic model ad...
In this paper, we propose a state-dependent tied mixture (SDTM) models with variable codebook size t...