In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a variation of StyleGAN, a state-of-the-art image generator. By conditioning StyleWaveGAN on both the type of drum and several audio descriptors, we are able to synthesize waveforms faster than real-time on a GPU directly in CD quality up to a duration of 1.5s while retaining a considerable amount of control over the generation. We also introduce an alternative to the progressive growing of GANs and experimented on the effect of dataset balancing for generative tasks. The experiments are carried out on an augmented subset of a publicly available dataset comprised of different drums and cymbals. We evaluate against two recent drum generators, WaveGAN and Neur...
Can we generate drum synthesizers automatically? We present an approach for the automatic generation...
Fast and user-controllable music generation could enable novel ways of composing or performing music...
In this work, we apply the CycleGAN image-to-image translation framework to Mel-scaled log-amplitude...
In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a variation of S...
Synthetic creation of drum sounds (e.g., in drum machines) is commonly performed using analog or dig...
Recent advancements in generative audio synthesis have allowed for the development of creative tools...
Synthetic creation of drum sounds (e.g., in drum machines) is commonly performed using analog or dig...
Neural audio synthesizers exploit deep learning as an alternative to traditional synthesizers that g...
Many recent approaches to creative transformations of musical audio have been motivated by the succe...
This paper proposes a new benchmark task for generat-ing musical passages in the audio domain by usi...
International audienceIn this paper we investigate into perceptual properties of Style-WaveGAN, a dr...
While generative adversarial networks (GANs) have been widely used in research on audio generation, ...
This paper proposes a new benchmark task for generating musical passages in the audio domain by usin...
In this study, we investigate the usage of generative adversarial networks for modelling a collectio...
Creative rhythmic transformations of musical audio refer to automated methods for manipulation of te...
Can we generate drum synthesizers automatically? We present an approach for the automatic generation...
Fast and user-controllable music generation could enable novel ways of composing or performing music...
In this work, we apply the CycleGAN image-to-image translation framework to Mel-scaled log-amplitude...
In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a variation of S...
Synthetic creation of drum sounds (e.g., in drum machines) is commonly performed using analog or dig...
Recent advancements in generative audio synthesis have allowed for the development of creative tools...
Synthetic creation of drum sounds (e.g., in drum machines) is commonly performed using analog or dig...
Neural audio synthesizers exploit deep learning as an alternative to traditional synthesizers that g...
Many recent approaches to creative transformations of musical audio have been motivated by the succe...
This paper proposes a new benchmark task for generat-ing musical passages in the audio domain by usi...
International audienceIn this paper we investigate into perceptual properties of Style-WaveGAN, a dr...
While generative adversarial networks (GANs) have been widely used in research on audio generation, ...
This paper proposes a new benchmark task for generating musical passages in the audio domain by usin...
In this study, we investigate the usage of generative adversarial networks for modelling a collectio...
Creative rhythmic transformations of musical audio refer to automated methods for manipulation of te...
Can we generate drum synthesizers automatically? We present an approach for the automatic generation...
Fast and user-controllable music generation could enable novel ways of composing or performing music...
In this work, we apply the CycleGAN image-to-image translation framework to Mel-scaled log-amplitude...