This paper focuses on the automatic recognition of a per-son’s age and gender based only on his or her voice. Up to five different systems are compared and combined in dif-ferent configurations: three systems model the speaker’s characteristics in different feature spaces, i.e., MFCC, PLP, TRAPS, by Gaussian mixture models. The features of these systems are the concatenated mean vectors. Sys-tem number 4 uses a physical two-mass vocal model and estimates in a data-driven optimization procedure 9 glot-tal features from voiced speech sections. For each ut-terance the minimum, maximum and mean vectors form a 27-dimensional feature vector. The last system calcu-lates a 219-dimensional prosodic feature set for each ut-terance based on voice and ...
Automatic recognition of paralinguistic information from speech is important. Speaker identity, gend...
Important problems in speech soft biometrics include the prediction of speaker's age or gender. Here...
The human voice is comprised of sound made by a human being using the vocal cord for talking,singing...
This paper presents a description of the INESC-ID Spoken Language Systems Laboratory (L2F) Age and G...
This paper presents a comparative study of four different ap-proaches to automatic age and gender cl...
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion...
Automatic age and gender recognition for speech applications is very important for a number of reaso...
This paper compares two approaches of automatic age and gen-der classification with 7 classes. The f...
Extracting speaker-dependent paralinguistic information out of a person's voice, provides an opportu...
This work deals with speaker´s age and gender recognition. At the beginning it introduces the practi...
We develop an acoustic feature set for the estimation of a per-son’s age from a recorded speech sign...
In this paper, we propose to use Gaussian mixture model (GMM) supervectors in a feed-forward deep ne...
A recent trend in speech processing is embedding generation models, which can encode much of the rel...
The most successful systems in previous comparative studies on speaker age recognition used short-te...
Speech signals carry important information about a speaker such as age, gender, language, accent and...
Automatic recognition of paralinguistic information from speech is important. Speaker identity, gend...
Important problems in speech soft biometrics include the prediction of speaker's age or gender. Here...
The human voice is comprised of sound made by a human being using the vocal cord for talking,singing...
This paper presents a description of the INESC-ID Spoken Language Systems Laboratory (L2F) Age and G...
This paper presents a comparative study of four different ap-proaches to automatic age and gender cl...
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion...
Automatic age and gender recognition for speech applications is very important for a number of reaso...
This paper compares two approaches of automatic age and gen-der classification with 7 classes. The f...
Extracting speaker-dependent paralinguistic information out of a person's voice, provides an opportu...
This work deals with speaker´s age and gender recognition. At the beginning it introduces the practi...
We develop an acoustic feature set for the estimation of a per-son’s age from a recorded speech sign...
In this paper, we propose to use Gaussian mixture model (GMM) supervectors in a feed-forward deep ne...
A recent trend in speech processing is embedding generation models, which can encode much of the rel...
The most successful systems in previous comparative studies on speaker age recognition used short-te...
Speech signals carry important information about a speaker such as age, gender, language, accent and...
Automatic recognition of paralinguistic information from speech is important. Speaker identity, gend...
Important problems in speech soft biometrics include the prediction of speaker's age or gender. Here...
The human voice is comprised of sound made by a human being using the vocal cord for talking,singing...