In this paper, we propose to use Gaussian mixture model (GMM) supervectors in a feed-forward deep neural network (DNN) for age identification from voice. The GMM is trained with short-term mel-frequency cepstral coefficients (MFCC). The proposed GMM/DNN method is compared with a feed-forward DNN and a recurrent neural network (RNN) in which the MFCC features are directly used. We also make a comparison with the classical GMM and GMM/support vector machine (SVM) methods. Baseline results are obtained with a set of long-term features which are commonly used for age identification in previous studies. A feed-forward DNN and an SVM are trained using the long term features. All the systems are tested using a speech database which consists...
Systems that automatically detect voice pathologies are usually trained with recordings belonging to...
A recent trend in speech processing is embedding generation models, which can encode much of the rel...
In this paper, large vocabulary children’s speech recognition is investigated by using the Deep Neur...
The objective of this research was to develop deep learning classifiers and various parameters that ...
This paper compares two approaches of automatic age and gen-der classification with 7 classes. The f...
The most successful systems in previous comparative studies on speaker age recognition used short-te...
This paper focuses on the automatic recognition of a per-son’s age and gender based only on his or h...
DNN-HMMs, are recently very promising acoustic models achieving good speech recognition results over...
The process of ageing causes changes to the voice over time. There have been significant research ef...
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion...
This paper proposes a novel approach for automatic estimation of four important traits of speakers, ...
G ender and age estimation based on Gaussian Mixture Models (GMM) is introduced. Records from the Cz...
Automatic age and gender recognition for speech applications is very important for a number of reaso...
This paper presents a comparative study of four different ap-proaches to automatic age and gender cl...
This paper introduces deep neural network (DNN)–hidden Markov model (HMM)-based methods to tackle sp...
Systems that automatically detect voice pathologies are usually trained with recordings belonging to...
A recent trend in speech processing is embedding generation models, which can encode much of the rel...
In this paper, large vocabulary children’s speech recognition is investigated by using the Deep Neur...
The objective of this research was to develop deep learning classifiers and various parameters that ...
This paper compares two approaches of automatic age and gen-der classification with 7 classes. The f...
The most successful systems in previous comparative studies on speaker age recognition used short-te...
This paper focuses on the automatic recognition of a per-son’s age and gender based only on his or h...
DNN-HMMs, are recently very promising acoustic models achieving good speech recognition results over...
The process of ageing causes changes to the voice over time. There have been significant research ef...
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion...
This paper proposes a novel approach for automatic estimation of four important traits of speakers, ...
G ender and age estimation based on Gaussian Mixture Models (GMM) is introduced. Records from the Cz...
Automatic age and gender recognition for speech applications is very important for a number of reaso...
This paper presents a comparative study of four different ap-proaches to automatic age and gender cl...
This paper introduces deep neural network (DNN)–hidden Markov model (HMM)-based methods to tackle sp...
Systems that automatically detect voice pathologies are usually trained with recordings belonging to...
A recent trend in speech processing is embedding generation models, which can encode much of the rel...
In this paper, large vocabulary children’s speech recognition is investigated by using the Deep Neur...