An important research direction in speech technology is robust cross-corpus and cross-language emotion recognition. In this paper, we propose computationally efficient and performance effective feature normalization strategies for the challenging task of cross-corpus acoustic emotion recognition. We particularly deploy a cascaded normalization approach, combining linear speaker level, nonlinear value level and feature vector level normalization to minimize speaker-and corpus-related effects as well as to maximize class separability with linear kernel classifiers. We use extreme learning machine classifiers on five corpora representing five languages from different families, namely Danish, English, German, Russian and Turkish. Using a standa...
Pattern recognition tasks often face the situation that training data are not fully representative o...
In the age of information and automation, where the robotics sphere is increasing each day, and peop...
In this paper, we focus on a challenging, but interesting, task in speech emotion recognition (SER),...
13th International Symposium on Neural Networks (ISNN) -- JUL 06-08, 2016 -- Saint Petersburg, RUSSI...
Contending with signal variability due to source and channel effects is a critical problem in automa...
Abstract The performance of speech recognition systems trained with neutral utterances degrades sign...
The majority of existing speech emotion recognition research focuses on automatic emotion detection ...
In this thesis, we describe extensive experiments on the classification of emotions from speech usin...
To date, several methods have been explored for the challenging task of cross-language speech emotio...
To solve the problem of feature distribution discrepancy in cross-corpus speech emotion recognition ...
Affective computing is becoming increasingly significant in the interaction between humans and machi...
We use four speech databases with realistic, non-prompted emotions, and a large state-of-the-art aco...
Machine Learning (ML) algorithms within a human–computer framework are the leading force in speech e...
19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018) -- A...
This paper reports on mono- and cross-lingual performance of different acoustic and/or prosodic feat...
Pattern recognition tasks often face the situation that training data are not fully representative o...
In the age of information and automation, where the robotics sphere is increasing each day, and peop...
In this paper, we focus on a challenging, but interesting, task in speech emotion recognition (SER),...
13th International Symposium on Neural Networks (ISNN) -- JUL 06-08, 2016 -- Saint Petersburg, RUSSI...
Contending with signal variability due to source and channel effects is a critical problem in automa...
Abstract The performance of speech recognition systems trained with neutral utterances degrades sign...
The majority of existing speech emotion recognition research focuses on automatic emotion detection ...
In this thesis, we describe extensive experiments on the classification of emotions from speech usin...
To date, several methods have been explored for the challenging task of cross-language speech emotio...
To solve the problem of feature distribution discrepancy in cross-corpus speech emotion recognition ...
Affective computing is becoming increasingly significant in the interaction between humans and machi...
We use four speech databases with realistic, non-prompted emotions, and a large state-of-the-art aco...
Machine Learning (ML) algorithms within a human–computer framework are the leading force in speech e...
19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018) -- A...
This paper reports on mono- and cross-lingual performance of different acoustic and/or prosodic feat...
Pattern recognition tasks often face the situation that training data are not fully representative o...
In the age of information and automation, where the robotics sphere is increasing each day, and peop...
In this paper, we focus on a challenging, but interesting, task in speech emotion recognition (SER),...