In this work, we show how speaker-independent CDHMM word recognition performance can be significantly improved for clean speech by filtering the time sequence of spectral parameters to enhance its time dynamics. Experimental results with the standard TI connected digits database show the filter can achieve more than 30% reduction of string recognition error. As shown in this paper, that improvement is partially due to the speaker variability reduction obtained by attenuating the very low modulation frequencies. The widely used cepstral mean subtraction technique also improves the recognition rate, but it can not achieve such a noticeable improvement as the parameter filter. In fact, the best results are obtained when the peak of the long-te...
In this paper, we investigate the performance of modulation related features and normalized spectral...
In a distant-talking environment, the length of channel impulse response is longer than the short-te...
Cepstral coefficients are widely used in speech recognition. In this paper, we claim that they are n...
In this work, we show how speaker-independent CDHMM word recognition performance can be significantl...
In this work, we show how speaker-independent CDHMM word recognition performance can be significantl...
Recently, the set of spectral parameters of every speech frame that result from filtering the freque...
In automatic speech recognition, the signal is usually represented by a set of time sequences of spe...
In this paper, two ways of obtaining more robust spectral parameters are explored. Firstly, an hybri...
The time sequences of speech parameters resulting from current short-time spectral estimators show a...
Speech dynamic feature are routinely used in current speech recognition systems in combination with ...
very speech recognition system requires a signal representation that parametrically models the tempo...
Speech recognition system extract the textual data from the speech signal. The research in speech re...
Simple IIR or FIR filters have been widely used in isolated or connected word recognition tasks to f...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
In this paper, we investigate the performance of modulation related features and normalized spectral...
In a distant-talking environment, the length of channel impulse response is longer than the short-te...
Cepstral coefficients are widely used in speech recognition. In this paper, we claim that they are n...
In this work, we show how speaker-independent CDHMM word recognition performance can be significantl...
In this work, we show how speaker-independent CDHMM word recognition performance can be significantl...
Recently, the set of spectral parameters of every speech frame that result from filtering the freque...
In automatic speech recognition, the signal is usually represented by a set of time sequences of spe...
In this paper, two ways of obtaining more robust spectral parameters are explored. Firstly, an hybri...
The time sequences of speech parameters resulting from current short-time spectral estimators show a...
Speech dynamic feature are routinely used in current speech recognition systems in combination with ...
very speech recognition system requires a signal representation that parametrically models the tempo...
Speech recognition system extract the textual data from the speech signal. The research in speech re...
Simple IIR or FIR filters have been widely used in isolated or connected word recognition tasks to f...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
In this paper, we investigate the performance of modulation related features and normalized spectral...
In a distant-talking environment, the length of channel impulse response is longer than the short-te...
Cepstral coefficients are widely used in speech recognition. In this paper, we claim that they are n...