© 2020 IEEE. A growing need for on-device machine learning has led to an increased interest in light-weight neural networks that lower model complexity while retaining performance. While a variety of general-purpose techniques exist in this context, very few approaches exploit domain-specific properties to further improve upon the capacity-performance trade-off. In this paper, extending our prior work [1], we train a network to emulate the behaviour of an audio codec and use this network to construct a loss. By approximating the psychoacoustic model underlying the codec, our approach enables light-weight neural networks to focus on perceptually relevant properties without wasting their limited capacity on imperceptible signal components. We...
Audio codecs generate notable artifacts when operating at low bitrates, which degrade the quality of...
OBJECTIVE: A hearing aid's noise reduction algorithm cannot infer to which speaker the user intends ...
The high levels goals of this thesis are to: understand the neural representation of sound, produce ...
A growing need for on-device machine learning has led to an increased interest in light-weight neura...
© 2019 Association for Computing Machinery. Generative audio models based on neural networks have le...
Engineers have pushed the boundaries of audio compression and designed numerous lossy audio compress...
Increasing digital storage and transmission of speech and audio necessitates the use of codecs that ...
This work investigates alternate pre-emphasis filters used as part of the loss function during neura...
Despite that L1 and L2 loss functions do not represent any perceptually-related information besides ...
Neural networks are used for the problem of music source separation from recordings. One such networ...
Estimating time-frequency domain masks for single-channel speech enhancement using deep learning met...
This dissertation explores compression techniques for neural networks to enable control of resource ...
AbstractWe train neural networks of varying depth with a loss function which imposes the output repr...
A method based on Deep Neural Networks (DNNs) and time-frequency masking has been recently developed...
This paper discusses if using Neural Networks we can develop model which emulates audio effects and ...
Audio codecs generate notable artifacts when operating at low bitrates, which degrade the quality of...
OBJECTIVE: A hearing aid's noise reduction algorithm cannot infer to which speaker the user intends ...
The high levels goals of this thesis are to: understand the neural representation of sound, produce ...
A growing need for on-device machine learning has led to an increased interest in light-weight neura...
© 2019 Association for Computing Machinery. Generative audio models based on neural networks have le...
Engineers have pushed the boundaries of audio compression and designed numerous lossy audio compress...
Increasing digital storage and transmission of speech and audio necessitates the use of codecs that ...
This work investigates alternate pre-emphasis filters used as part of the loss function during neura...
Despite that L1 and L2 loss functions do not represent any perceptually-related information besides ...
Neural networks are used for the problem of music source separation from recordings. One such networ...
Estimating time-frequency domain masks for single-channel speech enhancement using deep learning met...
This dissertation explores compression techniques for neural networks to enable control of resource ...
AbstractWe train neural networks of varying depth with a loss function which imposes the output repr...
A method based on Deep Neural Networks (DNNs) and time-frequency masking has been recently developed...
This paper discusses if using Neural Networks we can develop model which emulates audio effects and ...
Audio codecs generate notable artifacts when operating at low bitrates, which degrade the quality of...
OBJECTIVE: A hearing aid's noise reduction algorithm cannot infer to which speaker the user intends ...
The high levels goals of this thesis are to: understand the neural representation of sound, produce ...