Computational Paralinguistics has several unresolved issues, one of which is coping with large variability due to speakers, spoken content and corpora. In this paper, we address the variability compensation issue by proposing a novel method composed of i) Fisher vector encoding of low level descrip-tors extracted from the signal, ii) speaker z-normalization ap-plied after speaker clustering iii) non-linear normalization of features and iv) classification based on Kernel Extreme Learn-ing Machines and Partial Least Squares regression. For ex-perimental validation, we apply the proposed method on IN-TERSPEECH 2015 Computational Paralinguistics Challenge (ComParE 2015), Eating Condition sub-challenge, which is a seven-class classification task...
A new front-end normalization algorithm that uses a parametric non-linear transformation is proposed...
Expressive richness in natural languages presents a significant challenge for statistical language m...
An important research direction in speech technology is robust cross-corpus and cross-language emoti...
17th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2016) -- S...
18th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2017) -- A...
International audienceSpeech classifiers of paralinguistic traits traditionally learn from diverse h...
In this paper, we present a computational paralinguistic method for assessing whether a person has a...
This paper extends the within-class covariance normalization (WCCN) technique described in [1, 2] fo...
The burgeoning field of computational paralinguistics deals with the ways in which spoken words are ...
2018-12-13Regularization is crucial to the success of many practical deep learning models, in partic...
Abstract The INTERSPEECH 2018 Computational Paralinguistics Challenge addresses four different prob...
In this position paper we present the FP7 ERC starting grant project iHEARu (Intelligent systems ’ H...
Proceedings of Interspeech 2008, Brisbane (Australia)This paper presents improvements in text-depend...
This work tests several classification techniques and acoustic features and further combines them us...
The goal in this work is to automatically classify speakers ’ level of cognitive load (low, medium, ...
A new front-end normalization algorithm that uses a parametric non-linear transformation is proposed...
Expressive richness in natural languages presents a significant challenge for statistical language m...
An important research direction in speech technology is robust cross-corpus and cross-language emoti...
17th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2016) -- S...
18th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2017) -- A...
International audienceSpeech classifiers of paralinguistic traits traditionally learn from diverse h...
In this paper, we present a computational paralinguistic method for assessing whether a person has a...
This paper extends the within-class covariance normalization (WCCN) technique described in [1, 2] fo...
The burgeoning field of computational paralinguistics deals with the ways in which spoken words are ...
2018-12-13Regularization is crucial to the success of many practical deep learning models, in partic...
Abstract The INTERSPEECH 2018 Computational Paralinguistics Challenge addresses four different prob...
In this position paper we present the FP7 ERC starting grant project iHEARu (Intelligent systems ’ H...
Proceedings of Interspeech 2008, Brisbane (Australia)This paper presents improvements in text-depend...
This work tests several classification techniques and acoustic features and further combines them us...
The goal in this work is to automatically classify speakers ’ level of cognitive load (low, medium, ...
A new front-end normalization algorithm that uses a parametric non-linear transformation is proposed...
Expressive richness in natural languages presents a significant challenge for statistical language m...
An important research direction in speech technology is robust cross-corpus and cross-language emoti...