In speaker recognition, deep neural networks deliver state-of-the-art performance due to their large capacities and powerful feature extraction abilities. However, this performance can be highly affected by interference from background noise and other speakers. This thesis focuses on new neural network architectures that are designed to overcome such interference and thereby improve the robustness of the speaker recognition system. In order to improve the noise robustness of the speaker recognition model, two novel network architectures are proposed. The first is the hierarchical attention network, which is able to capture both local and global features in order to improve the robustness of the network. The experimental results show i...
Despite the remarkable progress recently made in distant speech recognition, state-of-the-art techno...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
Speech enhancement, aiming at improving the intelligibility and overall perceptual quality of a cont...
In this paper, a novel architecture for speaker recognition is proposed by cascading speech enhancem...
Representation learning is a fundamental ingredient of deep learning. However, learning a good repre...
While the use of deep neural networks has significantly boosted speaker recognition performance, it ...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...
The recent development of embedded platforms along with spectacular growth in communication networki...
Deep learning is an emerging technology that is considered one of the most promising directions for ...
In Automatic Speech Recognition (ASR) the non-linear data projection provided by a one hidden layer ...
In recent years, deep neural network models gained popularity as a modeling approach for many speech...
Deep learning and neural network research has grown significantly in the fields of automatic speech ...
The performance of automatic speech recognition (ASR) system can be enhanced by adaptation of the AS...
Extensive Research has been conducted on speech recognition and Speaker Recognition over the past fe...
Ph. D. Thesis.Monaural speech separation and enhancement aim to remove noise interference from the n...
Despite the remarkable progress recently made in distant speech recognition, state-of-the-art techno...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
Speech enhancement, aiming at improving the intelligibility and overall perceptual quality of a cont...
In this paper, a novel architecture for speaker recognition is proposed by cascading speech enhancem...
Representation learning is a fundamental ingredient of deep learning. However, learning a good repre...
While the use of deep neural networks has significantly boosted speaker recognition performance, it ...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...
The recent development of embedded platforms along with spectacular growth in communication networki...
Deep learning is an emerging technology that is considered one of the most promising directions for ...
In Automatic Speech Recognition (ASR) the non-linear data projection provided by a one hidden layer ...
In recent years, deep neural network models gained popularity as a modeling approach for many speech...
Deep learning and neural network research has grown significantly in the fields of automatic speech ...
The performance of automatic speech recognition (ASR) system can be enhanced by adaptation of the AS...
Extensive Research has been conducted on speech recognition and Speaker Recognition over the past fe...
Ph. D. Thesis.Monaural speech separation and enhancement aim to remove noise interference from the n...
Despite the remarkable progress recently made in distant speech recognition, state-of-the-art techno...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
Speech enhancement, aiming at improving the intelligibility and overall perceptual quality of a cont...