This thesis takes the classical signal processing problem of separating the speech of a target speaker from a real-world audio recording containing noise, background interference — from competing speech or other non-speech sources —, and reverberation, and seeks data-driven solutions based on supervised learning methods, particularly recurrent neural networks (RNNs). Such speech separation methods can inject robustness in automatic speech recognition (ASR) systems and have been an active area of research for the past two decades. We particularly focus on applications where multi-channel recordings are available. Stand-alone beamformers cannot simultaneously suppress diffuse-noise and protect the desired signal from any distortions. Post-...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...
Abstract—This paper describes an in-depth investigation of training criteria, network architectures ...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...
This thesis takes the classical signal processing problem of separating the speech of a target speak...
International audienceWe evaluate some recent developments in recurrent neural network (RNN) based s...
In real world environments, the speech signals received by our ears are usually a combination of dif...
In this paper, we compare different deep neural networks (DNN) in extracting speech signals from com...
Speech separation algorithms are faced with a difficult task of producing high degree of separation ...
Many speech technologies, such as automatic speech recognition and speaker identification, are conve...
Separation of speech mixtures in noisy and reverberant environments remains a challenging task for s...
Despite the recent progress of automatic speech recognition (ASR) driven by deep learning, conversat...
International audienceWe present a source separation system for high-order ambisonics (HOA) contents...
We propose TF-GridNet for speech separation. The model is a novel multi-path deep neural network (DN...
This paper proposes a neural network based system for multi-channel speech enhancement and dereverbe...
Ph. D. Thesis.Monaural speech separation and enhancement aim to remove noise interference from the n...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...
Abstract—This paper describes an in-depth investigation of training criteria, network architectures ...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...
This thesis takes the classical signal processing problem of separating the speech of a target speak...
International audienceWe evaluate some recent developments in recurrent neural network (RNN) based s...
In real world environments, the speech signals received by our ears are usually a combination of dif...
In this paper, we compare different deep neural networks (DNN) in extracting speech signals from com...
Speech separation algorithms are faced with a difficult task of producing high degree of separation ...
Many speech technologies, such as automatic speech recognition and speaker identification, are conve...
Separation of speech mixtures in noisy and reverberant environments remains a challenging task for s...
Despite the recent progress of automatic speech recognition (ASR) driven by deep learning, conversat...
International audienceWe present a source separation system for high-order ambisonics (HOA) contents...
We propose TF-GridNet for speech separation. The model is a novel multi-path deep neural network (DN...
This paper proposes a neural network based system for multi-channel speech enhancement and dereverbe...
Ph. D. Thesis.Monaural speech separation and enhancement aim to remove noise interference from the n...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...
Abstract—This paper describes an in-depth investigation of training criteria, network architectures ...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...