Machine learning algorithms have achieved the state-of-the-art results by utilizing deep neural networks (DNNs) across different tasks in recent years. However, the performance of DNNs suffers from mismatched conditions between training and test datasets. This is a general machine learning problem in all applications such as machine vision, natural language processing, and audio processing. For instance, the usage of different recording devices and ambient noises can be referred to as some of the causing factors for mismatched conditions between training and test datasets in audio classification tasks. Due to mismatched conditions, a well-performed DNN model in training phase encounters a decrease in performance when evaluated on unseen dat...
Adaptation to speaker variations is an essential component of speech recognition systems. One common...
Despite the recent success of deep neural network-based approaches in sound source localization, the...
Existing acoustic scene classification (ASC) systems often fail to generalize across different recor...
Machine learning algorithms have achieved the state-of-the-art results by utilizing deep neural netw...
Distribution mismatches between the data seen at training and at application time remain a major cha...
In classification tasks, the classification accuracy diminishes when the data is gathered in differe...
International audienceAcoustic scene classification systems face performance degradation when traine...
Electronic ISBN:978-1-7281-1123-0International audienceA challenging problem in deep learning-based ...
A speaker embeddings framework achieves state-of-the-art speaker recognition performance by modeling...
This paper presents a domain adaptation model for sound event detection. A common challenge for soun...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
Speech enhancement directly using deep neural network (DNN) is of major interest due to the capabili...
Abstract—In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique fo...
In recent years deep learning has become one of the most popular machine learning techniques for a ...
Deep Neural Networks (DNNs) trained on one dataset (source domain) do not perform well on another se...
Adaptation to speaker variations is an essential component of speech recognition systems. One common...
Despite the recent success of deep neural network-based approaches in sound source localization, the...
Existing acoustic scene classification (ASC) systems often fail to generalize across different recor...
Machine learning algorithms have achieved the state-of-the-art results by utilizing deep neural netw...
Distribution mismatches between the data seen at training and at application time remain a major cha...
In classification tasks, the classification accuracy diminishes when the data is gathered in differe...
International audienceAcoustic scene classification systems face performance degradation when traine...
Electronic ISBN:978-1-7281-1123-0International audienceA challenging problem in deep learning-based ...
A speaker embeddings framework achieves state-of-the-art speaker recognition performance by modeling...
This paper presents a domain adaptation model for sound event detection. A common challenge for soun...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
Speech enhancement directly using deep neural network (DNN) is of major interest due to the capabili...
Abstract—In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique fo...
In recent years deep learning has become one of the most popular machine learning techniques for a ...
Deep Neural Networks (DNNs) trained on one dataset (source domain) do not perform well on another se...
Adaptation to speaker variations is an essential component of speech recognition systems. One common...
Despite the recent success of deep neural network-based approaches in sound source localization, the...
Existing acoustic scene classification (ASC) systems often fail to generalize across different recor...