International audienceWe propose a semi-supervised multichannel speech enhancement system based on a probabilistic model which assumes that both speech and noise follow the heavy-tailed multi-variate complex Cauchy distribution. As we advocate, this allows handling strong and adverse noisy conditions. Consequently, the model is parameterized by the source magnitude spectrograms and the source spatial scatter matrices. To deal with the non-additivity of scatter matrices, our first contribution is to perform the enhancement on a projected space. Then, our second contribution is to combine a latent variable model for speech, which is trained by following the variational autoencoder framework, with a low-rank model for the noise source. At test...
We present a novel structured variational inference algorithm for probabilistic speech separation. T...
International audienceThis work builds on a previous work on unsupervised speech enhancement using a...
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explici...
International audienceWe propose a semi-supervised multichannel speech enhancement system based on a...
International audiencehis paper focuses on single-channel semi-supervised speech en-hancement...
We present a parametric model-based multichannel approach for speech enhancement. By employing an au...
The goal of this work is to generalize speech enhancement methods from single channel microphones, d...
International audienceThis paper introduces a new method for multichannel speech enhancement based o...
A new speech enhancement method based on Maximum A-Posteriori (MAP) estimation on Gaussian Mixture M...
International audienceThis paper presents a generative approach to speech enhancement based on a rec...
International audienceVariational auto-encoders (VAEs) are deep generative latent variable models th...
International audienceToday's smart devices using speaker verification are getting equipped with mul...
Submitted to IEEE/ACM Transactions on Audio, Speech, and Language ProcessingVariational auto-encoder...
Speech is a fundamental means of human communication. In the last several decades, much effort has b...
Statistical signal processing has been very successful. We proposed novel probabilistic models and d...
We present a novel structured variational inference algorithm for probabilistic speech separation. T...
International audienceThis work builds on a previous work on unsupervised speech enhancement using a...
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explici...
International audienceWe propose a semi-supervised multichannel speech enhancement system based on a...
International audiencehis paper focuses on single-channel semi-supervised speech en-hancement...
We present a parametric model-based multichannel approach for speech enhancement. By employing an au...
The goal of this work is to generalize speech enhancement methods from single channel microphones, d...
International audienceThis paper introduces a new method for multichannel speech enhancement based o...
A new speech enhancement method based on Maximum A-Posteriori (MAP) estimation on Gaussian Mixture M...
International audienceThis paper presents a generative approach to speech enhancement based on a rec...
International audienceVariational auto-encoders (VAEs) are deep generative latent variable models th...
International audienceToday's smart devices using speaker verification are getting equipped with mul...
Submitted to IEEE/ACM Transactions on Audio, Speech, and Language ProcessingVariational auto-encoder...
Speech is a fundamental means of human communication. In the last several decades, much effort has b...
Statistical signal processing has been very successful. We proposed novel probabilistic models and d...
We present a novel structured variational inference algorithm for probabilistic speech separation. T...
International audienceThis work builds on a previous work on unsupervised speech enhancement using a...
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explici...