In this paper, we propose a framework for joint normalization of spectral and temporal statistics of speech features for robust speech recognition. Current feature normalization approaches normalize the spectral and temporal aspects of feature statistics separately to overcome noise and reverberation. As a result, the interaction between the spectral normalization (e.g. mean and variance normalization, MVN) and temporal normalization (e.g. temporal structure normalization, TSN) is ignored. We propose a joint spectral and temporal normalization (JSTN) framework to simultaneously normalize these two aspects of feature statistics. In JSTN, feature trajectories are filtered by linear filters and the filters' coefficients are optimized by maximi...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
Abstract: The changing on peaks structure of the speech spectrum is perhaps the most important cause...
In this paper, we propose a framework for joint normalization of spectral and temporal statistics of...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
[[abstract]]Data-driven temporal filtering approaches based on a specific optimization technique hav...
[[abstract]]Data-driven temporal filtering approaches based on a specific optimization technique hav...
[[abstract]]Data-driven temporal filtering approaches based on a specific optimization technique hav...
This paper presents an improved version of a spectral normalisation based method for extraction of s...
[[abstract]]This letter proposes a novel scheme that applies feature statistics normalization techni...
[[abstract]]This letter proposes a novel scheme that applies feature statistics normalization techni...
[[abstract]]This letter proposes a novel scheme that applies feature statistics normalization techni...
[[abstract]]This letter proposes a novel scheme that applies feature statistics normalization techni...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
Abstract: The changing on peaks structure of the speech spectrum is perhaps the most important cause...
In this paper, we propose a framework for joint normalization of spectral and temporal statistics of...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
[[abstract]]Data-driven temporal filtering approaches based on a specific optimization technique hav...
[[abstract]]Data-driven temporal filtering approaches based on a specific optimization technique hav...
[[abstract]]Data-driven temporal filtering approaches based on a specific optimization technique hav...
This paper presents an improved version of a spectral normalisation based method for extraction of s...
[[abstract]]This letter proposes a novel scheme that applies feature statistics normalization techni...
[[abstract]]This letter proposes a novel scheme that applies feature statistics normalization techni...
[[abstract]]This letter proposes a novel scheme that applies feature statistics normalization techni...
[[abstract]]This letter proposes a novel scheme that applies feature statistics normalization techni...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
Abstract: The changing on peaks structure of the speech spectrum is perhaps the most important cause...