International audienceWe present a source separation system for high-order ambisonics (HOA) contents. We derive a multichannel spatial filter from a mask estimated by a long short-term memory (LSTM) recurrent neural network. We combine one channel of the mixture with the outputs of basic HOA beamformers as inputs to the LSTM, assuming that we know the directions of arrival of the directional sources. In our experiments, the speech of interest can be corrupted either by diffuse noise or by an equally loud competing speaker. We show that adding as input the output of the beamformer steered toward the competing speech in addition to that of the beamformer steered toward the target speech brings significant improvements in terms of word error r...
International audienceSpeech separation with several speakers is a challenging task because of the n...
International audienceIn this article we investigate how the local Gaussian model (LGM) can be appli...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...
This thesis takes the classical signal processing problem of separating the speech of a target speak...
Speech separation algorithms are faced with a difficult task of producing high degree of separation ...
This thesis takes the classical signal processing problem of separating the speech of a target speak...
The problems of speech separation and enhancement concern the extraction of the speech emitted by a ...
The problems of speech separation and enhancement concern the extraction of the speech emitted by a ...
In recent research, deep neural network (DNN) has been used to solve the monaural source separation ...
Most deep-learning-based multi-channel speech enhancement methods focus on designing a set of beamfo...
Comunicació presentada a la 12th ITG Conference on Speech Communication, celebrada els dies 5 a 7 d'...
Comunicació presentada a la 12th ITG Conference on Speech Communication, celebrada els dies 5 a 7 d'...
Despite the recent progress of automatic speech recognition (ASR) driven by deep learning, conversat...
International audienceIn this article we investigate how the local Gaussian model (LGM) can be appli...
International audienceIn this article we investigate how the local Gaussian model (LGM) can be appli...
International audienceSpeech separation with several speakers is a challenging task because of the n...
International audienceIn this article we investigate how the local Gaussian model (LGM) can be appli...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...
This thesis takes the classical signal processing problem of separating the speech of a target speak...
Speech separation algorithms are faced with a difficult task of producing high degree of separation ...
This thesis takes the classical signal processing problem of separating the speech of a target speak...
The problems of speech separation and enhancement concern the extraction of the speech emitted by a ...
The problems of speech separation and enhancement concern the extraction of the speech emitted by a ...
In recent research, deep neural network (DNN) has been used to solve the monaural source separation ...
Most deep-learning-based multi-channel speech enhancement methods focus on designing a set of beamfo...
Comunicació presentada a la 12th ITG Conference on Speech Communication, celebrada els dies 5 a 7 d'...
Comunicació presentada a la 12th ITG Conference on Speech Communication, celebrada els dies 5 a 7 d'...
Despite the recent progress of automatic speech recognition (ASR) driven by deep learning, conversat...
International audienceIn this article we investigate how the local Gaussian model (LGM) can be appli...
International audienceIn this article we investigate how the local Gaussian model (LGM) can be appli...
International audienceSpeech separation with several speakers is a challenging task because of the n...
International audienceIn this article we investigate how the local Gaussian model (LGM) can be appli...
Separation of speech embedded in non-stationary interference is a challenging problem that has recen...