Spatial filters can exploit deep-learning-based speech enhancement models to increase their reliability in scenarios with multiple speech sources scenarios. To further improve speech quality, it is common to perform postfiltering on the estimated target speech obtained with spatial filtering. In this work, Minimum Variance Distortionless Response (MVDR) is employed to provide the interference estimation, along with the estimation of the target speech, to be later used for postfiltering. This improves the enhancement performance over a single-input baseline in a far more significant way than by increasing the model's complexity. Results suggest that less computing resources are required for postfiltering when provided with both target and in...
This paper investigates the problem of enhancing a single desired speech source from a mixture of si...
In conventional multichannel audio signal enhancement, spatial and spectral filtering are often perf...
With the advancements in deep learning approaches, the performance of speech enhancing systems in th...
Multi-frame approaches for single-microphone speech enhancement, e.g., the multi-frame minimum-varia...
This paper describes multichannel speech enhancement for improving automatic speech recognition (ASR...
Compared with single-channel speech enhancement methods, multichannel methods can utilize spatial in...
This paper describes the practical response- and performance-aware development of online speech enha...
This paper proposes a neural network based system for multi-channel speech enhancement and dereverbe...
Speech enhancement has an increasing demand in mobile communications and faces a great challenge in ...
In a scenario with multiple persons talking simultaneously, the spatial characteristics of the signa...
In speech communication systems, the microphone signals are degraded by reverberation and ambient no...
This paper describes a practical dual-process speech enhancement system that adapts environment-sens...
Distortion resulting from acoustic echo suppression (AES) is a common issue in full-duplex communica...
For single-microphone noise reduction, a minimum variance distortionless response (MVDR) filter has ...
Most deep-learning-based multi-channel speech enhancement methods focus on designing a set of beamfo...
This paper investigates the problem of enhancing a single desired speech source from a mixture of si...
In conventional multichannel audio signal enhancement, spatial and spectral filtering are often perf...
With the advancements in deep learning approaches, the performance of speech enhancing systems in th...
Multi-frame approaches for single-microphone speech enhancement, e.g., the multi-frame minimum-varia...
This paper describes multichannel speech enhancement for improving automatic speech recognition (ASR...
Compared with single-channel speech enhancement methods, multichannel methods can utilize spatial in...
This paper describes the practical response- and performance-aware development of online speech enha...
This paper proposes a neural network based system for multi-channel speech enhancement and dereverbe...
Speech enhancement has an increasing demand in mobile communications and faces a great challenge in ...
In a scenario with multiple persons talking simultaneously, the spatial characteristics of the signa...
In speech communication systems, the microphone signals are degraded by reverberation and ambient no...
This paper describes a practical dual-process speech enhancement system that adapts environment-sens...
Distortion resulting from acoustic echo suppression (AES) is a common issue in full-duplex communica...
For single-microphone noise reduction, a minimum variance distortionless response (MVDR) filter has ...
Most deep-learning-based multi-channel speech enhancement methods focus on designing a set of beamfo...
This paper investigates the problem of enhancing a single desired speech source from a mixture of si...
In conventional multichannel audio signal enhancement, spatial and spectral filtering are often perf...
With the advancements in deep learning approaches, the performance of speech enhancing systems in th...