Over recent years generative models utilizing deep neural networks have demonstrated outstanding capacity in synthesizing high-quality and plausible human speech and music. The majority of research in neural audio synthesis (NAS) targets speech or music, whereas general sound effects such as environmental sounds or Foley sounds have received less attention. In this work, we study the generative performance of NAS models for sound effects with a conditional Wasserstein GAN (WGAN) model. We train our models conditioned on different classes of sound effects and report on their performances in terms of quality and diversity. Many existing GAN models use magnitude spectrograms which require audio reconstruction using phase estimation after train...
Creating variations of sound effects for video games is a time-consuming task that grows with the si...
Emotion recognition through audio is a rather challenging task that entails proper feature extractio...
To improve the diversity and quality of sound mimicry of electric automobile engines, a generative a...
Generative Adversarial Networks (GANs) currently achieve the state-of-the-art sound synthesis qualit...
In this study, we investigate the usage of generative adversarial networks for modelling a collectio...
In this paper we explore the synthesis of sound effects using conditional generative adversarial net...
Single-image generative adversarial networks learn from the internal distribution of a single traini...
Deep generative models have recently achieved impressive performance in speech and music synthesis. ...
© 2019 Association for Computing Machinery. Generative audio models based on neural networks have le...
Listening in noise is a challenging problem that affects the hearing capability of not only normal h...
This file was last viewed in Adobe Acrobat Pro.Training neural networks require sizeable datasets fo...
Recent advancements in generative audio synthesis have allowed for the development of creative tools...
While generative adversarial networks (GANs) have been widely used in research on audio generation, ...
Recent advances in neural network -based text-to-speech have reached human level naturalness in synt...
Acoustic scene generation (ASG) is a task to generate waveforms for acoustic scenes. ASG can be used...
Creating variations of sound effects for video games is a time-consuming task that grows with the si...
Emotion recognition through audio is a rather challenging task that entails proper feature extractio...
To improve the diversity and quality of sound mimicry of electric automobile engines, a generative a...
Generative Adversarial Networks (GANs) currently achieve the state-of-the-art sound synthesis qualit...
In this study, we investigate the usage of generative adversarial networks for modelling a collectio...
In this paper we explore the synthesis of sound effects using conditional generative adversarial net...
Single-image generative adversarial networks learn from the internal distribution of a single traini...
Deep generative models have recently achieved impressive performance in speech and music synthesis. ...
© 2019 Association for Computing Machinery. Generative audio models based on neural networks have le...
Listening in noise is a challenging problem that affects the hearing capability of not only normal h...
This file was last viewed in Adobe Acrobat Pro.Training neural networks require sizeable datasets fo...
Recent advancements in generative audio synthesis have allowed for the development of creative tools...
While generative adversarial networks (GANs) have been widely used in research on audio generation, ...
Recent advances in neural network -based text-to-speech have reached human level naturalness in synt...
Acoustic scene generation (ASG) is a task to generate waveforms for acoustic scenes. ASG can be used...
Creating variations of sound effects for video games is a time-consuming task that grows with the si...
Emotion recognition through audio is a rather challenging task that entails proper feature extractio...
To improve the diversity and quality of sound mimicry of electric automobile engines, a generative a...