In this paper we explore the synthesis of sound effects using conditional generative adversarial networks (cGANs). We commissioned Foley artist Ulf Olausson to record a dataset of knocking sound effects with different emotions and trained a cGAN on it. We analysed the resulting synthesised sound effects by comparing their temporal acoustic features to the original dataset and by performing an online listening test. Results show that the acoustic features of the synthesised sounds are similar to those of the recorded dataset. Additionally, the listening test results show that the synthesised sounds can be identified by people with experience in sound design, but the model is not far from fooling non-experts. Moreover, on average most emotion...
This paper adapts a StyleGAN model for speech generation with minimal or no conditioning on text. St...
Audio-visual emotion recognition is the research of identifying human emotional states by combining ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The dataset was recorded by the professional foley artist Ulf Olausson at the FoleyWorks (http://fol...
In this study, we investigate the usage of generative adversarial networks for modelling a collectio...
Over recent years generative models utilizing deep neural networks have demonstrated outstanding cap...
Knocking sounds are highly meaningful everyday sounds. There exist many ways of knocking, expressing...
Knocking sounds are highly meaningful everyday sounds. There exist many ways of knocking, expressing...
Several attempts have been made to synthesize speech from text. However, existing methods tend to ge...
Recent advancements in generative audio synthesis have allowed for the development of creative tools...
Single-image generative adversarial networks learn from the internal distribution of a single traini...
Synthetic creation of drum sounds (e.g., in drum machines) is commonly performed using analog or dig...
Generative Adversarial Networks (GANs) have achieved excellent audio synthesis quality in the last y...
To improve the diversity and quality of sound mimicry of electric automobile engines, a generative a...
In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a variation of S...
This paper adapts a StyleGAN model for speech generation with minimal or no conditioning on text. St...
Audio-visual emotion recognition is the research of identifying human emotional states by combining ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The dataset was recorded by the professional foley artist Ulf Olausson at the FoleyWorks (http://fol...
In this study, we investigate the usage of generative adversarial networks for modelling a collectio...
Over recent years generative models utilizing deep neural networks have demonstrated outstanding cap...
Knocking sounds are highly meaningful everyday sounds. There exist many ways of knocking, expressing...
Knocking sounds are highly meaningful everyday sounds. There exist many ways of knocking, expressing...
Several attempts have been made to synthesize speech from text. However, existing methods tend to ge...
Recent advancements in generative audio synthesis have allowed for the development of creative tools...
Single-image generative adversarial networks learn from the internal distribution of a single traini...
Synthetic creation of drum sounds (e.g., in drum machines) is commonly performed using analog or dig...
Generative Adversarial Networks (GANs) have achieved excellent audio synthesis quality in the last y...
To improve the diversity and quality of sound mimicry of electric automobile engines, a generative a...
In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a variation of S...
This paper adapts a StyleGAN model for speech generation with minimal or no conditioning on text. St...
Audio-visual emotion recognition is the research of identifying human emotional states by combining ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...