The curse of dimensionality is a well-established phenomenon. However, the properties of high dimensional data are often poorly understood and overlooked during the process of data modelling and analysis. Similarly, how to optimally fuse different modalities is still a big research question. In this paper, we addressed these challenges by proposing a novel two level brain-inspired compression based optimised multimodal fusion framework for emotion recognition. In the first level, the framework extracts the compressed and optimised multimodal features by applying a deep convolutional neural network (CNN) based compression on each modality (i.e. audio, text, and visuals). The second level simply concatenates the extracted optimised and compre...
Automatic emotion recognition is a challenging task since emotion is communicated through different ...
To interact naturally and achieve mutual sympathy between humans and machines, emotion recognition i...
In this contribution, we investigate the effectiveness of deep fusion of text and audio features for...
The curse of dimensionality is a well-established phenomenon. However, the properties of high dimens...
Nowadays, human emotion recognition is a mandatory task for many human machine interaction fields. T...
We present our system description of input-levelmultimodal fusion of audio, video, and text forrecog...
Emotion recognition has become one of the most researched subjects in the scientific community, espe...
Humans express their emotions in a variety of ways, which inspires research on multimodal fusion-bas...
Emotion recognition is an increasingly important sub-field in artificial intelligence (AI). Advances...
In order to overcome the limitation of single mode emotion recognition. This paper describes a novel...
Emotion understanding represents a core aspect of human communication. Our social behaviours are clo...
The understanding of a psychological phenomena such as emotion is of paramount importance for psycho...
Multimodal emotion recognition has attracted great interest recently and numerous methodologies have...
Abstract Automatic affect recognition is a challenging task due to the various modalities emotions ...
Automatic speech emotion recognition (SER) by a computer is a critical component for more natural hu...
Automatic emotion recognition is a challenging task since emotion is communicated through different ...
To interact naturally and achieve mutual sympathy between humans and machines, emotion recognition i...
In this contribution, we investigate the effectiveness of deep fusion of text and audio features for...
The curse of dimensionality is a well-established phenomenon. However, the properties of high dimens...
Nowadays, human emotion recognition is a mandatory task for many human machine interaction fields. T...
We present our system description of input-levelmultimodal fusion of audio, video, and text forrecog...
Emotion recognition has become one of the most researched subjects in the scientific community, espe...
Humans express their emotions in a variety of ways, which inspires research on multimodal fusion-bas...
Emotion recognition is an increasingly important sub-field in artificial intelligence (AI). Advances...
In order to overcome the limitation of single mode emotion recognition. This paper describes a novel...
Emotion understanding represents a core aspect of human communication. Our social behaviours are clo...
The understanding of a psychological phenomena such as emotion is of paramount importance for psycho...
Multimodal emotion recognition has attracted great interest recently and numerous methodologies have...
Abstract Automatic affect recognition is a challenging task due to the various modalities emotions ...
Automatic speech emotion recognition (SER) by a computer is a critical component for more natural hu...
Automatic emotion recognition is a challenging task since emotion is communicated through different ...
To interact naturally and achieve mutual sympathy between humans and machines, emotion recognition i...
In this contribution, we investigate the effectiveness of deep fusion of text and audio features for...