We present AERO, a audio super-resolution model that processes speech and music signals in the spectral domain. AERO is based on an encoder-decoder architecture with U-Net like skip connections. We optimize the model using both time and frequency domain loss functions. Specifically, we consider a set of reconstruction losses together with perceptual ones in the form of adversarial and feature discriminator loss functions. To better handle phase information the proposed method operates over the complex-valued spectrogram using two separate channels. Unlike prior work which mainly considers low and high frequency concatenation for audio super-resolution, the proposed method directly predicts the full frequency range. We demonstrate high perfo...
Audio classification, as a set of important and challenging tasks, groups speech signals according t...
This paper introduces a Pure Data abstraction that implements the method of cepstral deconvolution b...
Speech generation and enhancement have seen recent breakthroughs in quality thanks to deep learning....
This thesis reports various attempts at applying generative deep neural networks to audio for the ta...
Audio super-resolution is a fundamental task that predicts high-frequency components for low-resolut...
To maintain a reasonable perceived quality and to reduce degradations, classical audio or speech sou...
The scope of this work is to introduce a conceptually simple yet effective algorithm for blind high-...
Models for audio source separation usually operate on the magnitude spectrum, which ignores phase in...
The use of perceptually based (lossy) audio codecs, like MPEG 1 - layer 3 (`mp3'), has become very p...
In audio processing applications, the generation of expressive sounds based on high-level representa...
International audienceMany interactive applications, such as video games, require processing a large...
Speech enhancement and source separation are related tasks that aim to extract and/or improve a sign...
Speech enhancement (SE) aims to improve speech quality and intelligibility by removing acoustic corr...
In this paper we present first experimental results with a novel audio coding technique based on app...
Audio compression is usually achieved with algorithms that exploit spectral properties of the given ...
Audio classification, as a set of important and challenging tasks, groups speech signals according t...
This paper introduces a Pure Data abstraction that implements the method of cepstral deconvolution b...
Speech generation and enhancement have seen recent breakthroughs in quality thanks to deep learning....
This thesis reports various attempts at applying generative deep neural networks to audio for the ta...
Audio super-resolution is a fundamental task that predicts high-frequency components for low-resolut...
To maintain a reasonable perceived quality and to reduce degradations, classical audio or speech sou...
The scope of this work is to introduce a conceptually simple yet effective algorithm for blind high-...
Models for audio source separation usually operate on the magnitude spectrum, which ignores phase in...
The use of perceptually based (lossy) audio codecs, like MPEG 1 - layer 3 (`mp3'), has become very p...
In audio processing applications, the generation of expressive sounds based on high-level representa...
International audienceMany interactive applications, such as video games, require processing a large...
Speech enhancement and source separation are related tasks that aim to extract and/or improve a sign...
Speech enhancement (SE) aims to improve speech quality and intelligibility by removing acoustic corr...
In this paper we present first experimental results with a novel audio coding technique based on app...
Audio compression is usually achieved with algorithms that exploit spectral properties of the given ...
Audio classification, as a set of important and challenging tasks, groups speech signals according t...
This paper introduces a Pure Data abstraction that implements the method of cepstral deconvolution b...
Speech generation and enhancement have seen recent breakthroughs in quality thanks to deep learning....