Speech technology has developed to levels equivalent with human parity through the use of deep neural networks. However, it is unclear how the learned dependencies within these networks can be attributed to metrics such as recognition performance. This research focuses on strategies to interpret and exploit these learned context dependencies to improve speech recognition models. Context dependency analysis had not yet been explored for speech recognition networks. In order to highlight and observe dependent representations within speech recognition models, a novel analysis framework is proposed. This analysis framework uses statistical correlation indexes to compute the coefficiency between neural representations. By comparing the coeffi...
End-to-end automatic speech recognition (ASR) models aim to learn a generalised speech representatio...
This paper demonstrates the significance of using contextual information in machine learning and spe...
Abstract. This paper argues that neural networks are good vehicles for automatic speech recognition ...
Language models are a critical component of an automatic speech recognition (ASR) system. Neural net...
Speech is at the core of human communication. Speaking and listing comes so natural to us that we do...
End-to-end automatic speech recognition (ASR) models aim to learn a generalised speech representatio...
Representation learning is a fundamental ingredient of deep learning. However, learning a good repre...
Although the importance of contextual information in speech recognition has been acknowledged for a ...
Introduction: In recent years, machines powered by deep learning have achieved near-human levels of ...
abstract: Audio signals, such as speech and ambient sounds convey rich information pertaining to a u...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
This paper argues that neural networks are good vehicles for automatic speech recognition not simply...
Emotion recognition in conversations is essential for ensuring advanced human-machine interactions. ...
Over these last few years, the use of Artificial Neural Networks (ANNs), now often referred to as de...
Automatic speech recognition system (ASR) contains three main parts: an acoustic model, a lexicon a...
End-to-end automatic speech recognition (ASR) models aim to learn a generalised speech representatio...
This paper demonstrates the significance of using contextual information in machine learning and spe...
Abstract. This paper argues that neural networks are good vehicles for automatic speech recognition ...
Language models are a critical component of an automatic speech recognition (ASR) system. Neural net...
Speech is at the core of human communication. Speaking and listing comes so natural to us that we do...
End-to-end automatic speech recognition (ASR) models aim to learn a generalised speech representatio...
Representation learning is a fundamental ingredient of deep learning. However, learning a good repre...
Although the importance of contextual information in speech recognition has been acknowledged for a ...
Introduction: In recent years, machines powered by deep learning have achieved near-human levels of ...
abstract: Audio signals, such as speech and ambient sounds convey rich information pertaining to a u...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
This paper argues that neural networks are good vehicles for automatic speech recognition not simply...
Emotion recognition in conversations is essential for ensuring advanced human-machine interactions. ...
Over these last few years, the use of Artificial Neural Networks (ANNs), now often referred to as de...
Automatic speech recognition system (ASR) contains three main parts: an acoustic model, a lexicon a...
End-to-end automatic speech recognition (ASR) models aim to learn a generalised speech representatio...
This paper demonstrates the significance of using contextual information in machine learning and spe...
Abstract. This paper argues that neural networks are good vehicles for automatic speech recognition ...