Accepted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022)International audienceThe success of deep learning comes from its ability to capture the hierarchical structure of data by learning high-level representations defined in terms of low-level ones. In this paper we explore self-supervised learning of hierarchical representations of speech by applying multiple levels of Contrastive Predictive Coding (CPC). We observe that simply stacking two CPC models does not yield significant improvements over single-level architectures. Inspired by the fact that speech is often described as a sequence of discrete units unevenly distributed in time, we propose a model in which the output of a low-level CPC module is non-unifor...
Continuous monitoring with an ever-increasing number of sensors has become ubiquitous across many ap...
In this paper we consider speech coding as a problem of speech modelling. In particular, prediction ...
The current monaural state of the art tools for speech separation relies on supervised learning. Thi...
To extract robust deep representations from long sequential modeling of speech data, we propose a se...
The auditory pathway consists of multiple stages, from the cochlear nucleus to the auditory cortex. ...
The auditory pathway consists of multiple stages, from the cochlear nucleus to the auditory cortex. ...
Deep neural networks trained with supervised learning algorithms on large amounts of labeled speech ...
For a language with no transcribed speech available (the zero-resource scenario), conventional acous...
.<F3.733e+05> It is demonstrated that a componential code emerges when a self-organising neura...
Transformer has recently become one of the most popular deep learning models often utilized for proc...
International audienceSelf-supervised models for speech processing form representational spaces with...
International audienceSeveral deep neural networks have recently been shown to generate activations ...
Learning in the brain is poorly understood and learning rules that respect biological constraints, y...
International audienceConsiderable progress has recently been made in natural language processing: d...
Although supervised deep learning has revolutionized speech and audio processing, it has necessitate...
Continuous monitoring with an ever-increasing number of sensors has become ubiquitous across many ap...
In this paper we consider speech coding as a problem of speech modelling. In particular, prediction ...
The current monaural state of the art tools for speech separation relies on supervised learning. Thi...
To extract robust deep representations from long sequential modeling of speech data, we propose a se...
The auditory pathway consists of multiple stages, from the cochlear nucleus to the auditory cortex. ...
The auditory pathway consists of multiple stages, from the cochlear nucleus to the auditory cortex. ...
Deep neural networks trained with supervised learning algorithms on large amounts of labeled speech ...
For a language with no transcribed speech available (the zero-resource scenario), conventional acous...
.<F3.733e+05> It is demonstrated that a componential code emerges when a self-organising neura...
Transformer has recently become one of the most popular deep learning models often utilized for proc...
International audienceSelf-supervised models for speech processing form representational spaces with...
International audienceSeveral deep neural networks have recently been shown to generate activations ...
Learning in the brain is poorly understood and learning rules that respect biological constraints, y...
International audienceConsiderable progress has recently been made in natural language processing: d...
Although supervised deep learning has revolutionized speech and audio processing, it has necessitate...
Continuous monitoring with an ever-increasing number of sensors has become ubiquitous across many ap...
In this paper we consider speech coding as a problem of speech modelling. In particular, prediction ...
The current monaural state of the art tools for speech separation relies on supervised learning. Thi...