Abstract Speech emotion recognition (SER) is a hot topic in speech signal processing. When the training data and the test data come from different corpus, their feature distributions are different, which leads to the degradation of the recognition performance. Therefore, in order to solve this problem, a cross-corpus speech emotion recognition method is proposed based on subspace learning and domain adaptation in this paper. Specifically, training set data and the test set data are used to form the source domain and target domain, respectively. Then, the Hessian matrix is introduced to obtain the subspace for the extracted features in both source and target domains. In addition, an information entropy-based domain adaption method is introdu...
Obtaining large, human labelled speech datasets to train models for emotion recognition is a notorio...
For speech emotion datasets, it has been difficult to acquire large quantities of reliable data and ...
Obtaining large, human labelled speech datasets to train models for emotion recognition is a notorio...
AbstractIn this paper, we investigate an interesting problem, i.e., unsupervised cross-corpus speech...
In many practical applications, a speech emotion recognition model learned on a source (training) do...
In this paper, we focus on a challenging, but interesting, task in speech emotion recognition (SER),...
To solve the problem of feature distribution discrepancy in cross-corpus speech emotion recognition ...
In this paper, we investigate the problem of cross-corpus speech emotion recognition (SER), in which...
The majority of existing speech emotion recognition research focuses on automatic emotion detection ...
Despite the recent advancement in speech emotion recognition (SER) within a single corpus setting, t...
To date, several methods have been explored for the challenging task of cross-language speech emotio...
An important research direction in speech technology is robust cross-corpus and cross-language emoti...
One of the serious obstacles to the applications of speech emotion recognition systems in real-life ...
The absence of labeled samples limits the development of speech emotion recognition (SER). Data augm...
Abstract—In speech emotion recognition, training and test data used for system development usually t...
Obtaining large, human labelled speech datasets to train models for emotion recognition is a notorio...
For speech emotion datasets, it has been difficult to acquire large quantities of reliable data and ...
Obtaining large, human labelled speech datasets to train models for emotion recognition is a notorio...
AbstractIn this paper, we investigate an interesting problem, i.e., unsupervised cross-corpus speech...
In many practical applications, a speech emotion recognition model learned on a source (training) do...
In this paper, we focus on a challenging, but interesting, task in speech emotion recognition (SER),...
To solve the problem of feature distribution discrepancy in cross-corpus speech emotion recognition ...
In this paper, we investigate the problem of cross-corpus speech emotion recognition (SER), in which...
The majority of existing speech emotion recognition research focuses on automatic emotion detection ...
Despite the recent advancement in speech emotion recognition (SER) within a single corpus setting, t...
To date, several methods have been explored for the challenging task of cross-language speech emotio...
An important research direction in speech technology is robust cross-corpus and cross-language emoti...
One of the serious obstacles to the applications of speech emotion recognition systems in real-life ...
The absence of labeled samples limits the development of speech emotion recognition (SER). Data augm...
Abstract—In speech emotion recognition, training and test data used for system development usually t...
Obtaining large, human labelled speech datasets to train models for emotion recognition is a notorio...
For speech emotion datasets, it has been difficult to acquire large quantities of reliable data and ...
Obtaining large, human labelled speech datasets to train models for emotion recognition is a notorio...