Feature representations derived from models pre-trained on large-scale datasets have shown their generalizability on a variety of audio analysis tasks. Despite this generalizability, however, task-specific features can outperform if sufficient training data is available, as specific task-relevant properties can be learned. Furthermore, the complex pre-trained models bring considerable computational burdens during inference. We propose to leverage both detailed task-specific features from spectrogram input and generic pre-trained features by introducing two regularization methods that integrate the information of both feature classes. The workload is kept low during inference as the pre-trained features are only necessary for training. In ex...
We propose a technique of training models for feature extraction using prior expectation of regions ...
Most machine learning models for audio tasks are dealing with a handcrafted feature, the spectrogram...
Methods for extracting audio and speech features have been studied since pioneering work on spectrum...
The integration of additional side information to improve music source separation has been investiga...
Pre-trained models are essential as feature extractors in modern machine learning systems in various...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
Labeling and maintaining a commercial sound effects library is a time-consuming task exacerbated by ...
Deep learning models have recently led to significant improvements in a wide variety of tasks. Known...
Generative models aim to understand the properties of data, through the construction of latent space...
Content-based music information retrieval tasks are typi-cally solved with a two-stage approach: fea...
We present a feature generation system designed to create audio features for supervised classificat...
International audienceThis paper investigates the use of supervised feature learning approaches for ...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
In many domains, such as artificial intelligence, computer vision, speech, and bioinformatics, featu...
Two important categories of machine learning method-ologies have recently attracted much interest in...
We propose a technique of training models for feature extraction using prior expectation of regions ...
Most machine learning models for audio tasks are dealing with a handcrafted feature, the spectrogram...
Methods for extracting audio and speech features have been studied since pioneering work on spectrum...
The integration of additional side information to improve music source separation has been investiga...
Pre-trained models are essential as feature extractors in modern machine learning systems in various...
In this work, we provide a broad comparative analysis of strategies for pre-training audio understan...
Labeling and maintaining a commercial sound effects library is a time-consuming task exacerbated by ...
Deep learning models have recently led to significant improvements in a wide variety of tasks. Known...
Generative models aim to understand the properties of data, through the construction of latent space...
Content-based music information retrieval tasks are typi-cally solved with a two-stage approach: fea...
We present a feature generation system designed to create audio features for supervised classificat...
International audienceThis paper investigates the use of supervised feature learning approaches for ...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
In many domains, such as artificial intelligence, computer vision, speech, and bioinformatics, featu...
Two important categories of machine learning method-ologies have recently attracted much interest in...
We propose a technique of training models for feature extraction using prior expectation of regions ...
Most machine learning models for audio tasks are dealing with a handcrafted feature, the spectrogram...
Methods for extracting audio and speech features have been studied since pioneering work on spectrum...