Over the years, countless algorithms have been proposed to solve the problem of speech enhancement from a noisy mixture. Many have succeeded in improving at least parts of the signal, while often deteriorating others. Based on the assumption that different algo-rithms are likely to enjoy different qualities and suffer from differ-ent flaws, we investigate the possibility of combining the strengths of multiple speech enhancement algorithms, formulating the prob-lem in an ensemble learning framework. As a first example of such a system, we consider the prediction of a time-frequency mask ob-tained from the clean speech, based on the outputs of various al-gorithms applied on the noisy mixture. We consider several ap-proaches involving various ...
Abstract- Environmental audio classification has been the focus in the field of speech recognition. ...
It is known that applying a time-frequency binary mask to very noisy speech can improve its intellig...
Statistical signal processing has been very successful. We proposed novel probabilistic models and d...
Information theoretical concepts have been used in the analysis of human hearing and for the definit...
The time-frequency mask and the magnitude spectrum are two common targets for deep learning-based sp...
This paper investigates a number of ensemble methods for improv-ing the performance of phoneme class...
Abstract—We present a spectral domain, speech enhancement algorithm. The new algorithm is based on a...
Speech enhancement in stationary noise is addressed using the ideal channel selection framework. In ...
We address the question of whether and how boosting and bagging can be used for speech recognition. ...
Self-supervised learning (SSL) achieves great success in monaural speech enhancement, while the accu...
For many tasks in machine learning, performance gains can often be obtained by combining together an...
We address the question of whether and how boosting and bagging can be used for speech recognition....
Automatic speech recognition (ASR) is a technology that allows a computer and mobile device to recog...
This dissertation is about classification methods and class probability prediction. It can be roughl...
An optimal approach for enhancing a speech signal degraded by uncorrelated stationary additive noise...
Abstract- Environmental audio classification has been the focus in the field of speech recognition. ...
It is known that applying a time-frequency binary mask to very noisy speech can improve its intellig...
Statistical signal processing has been very successful. We proposed novel probabilistic models and d...
Information theoretical concepts have been used in the analysis of human hearing and for the definit...
The time-frequency mask and the magnitude spectrum are two common targets for deep learning-based sp...
This paper investigates a number of ensemble methods for improv-ing the performance of phoneme class...
Abstract—We present a spectral domain, speech enhancement algorithm. The new algorithm is based on a...
Speech enhancement in stationary noise is addressed using the ideal channel selection framework. In ...
We address the question of whether and how boosting and bagging can be used for speech recognition. ...
Self-supervised learning (SSL) achieves great success in monaural speech enhancement, while the accu...
For many tasks in machine learning, performance gains can often be obtained by combining together an...
We address the question of whether and how boosting and bagging can be used for speech recognition....
Automatic speech recognition (ASR) is a technology that allows a computer and mobile device to recog...
This dissertation is about classification methods and class probability prediction. It can be roughl...
An optimal approach for enhancing a speech signal degraded by uncorrelated stationary additive noise...
Abstract- Environmental audio classification has been the focus in the field of speech recognition. ...
It is known that applying a time-frequency binary mask to very noisy speech can improve its intellig...
Statistical signal processing has been very successful. We proposed novel probabilistic models and d...