Classification of speech phonemes is challenging, especially under noisy environments, and hence traditional speech recognition systems do not perform well in the presence of noise. Unlike traditional methods in which features are mostly extracted from the properties of the acoustic signal, this study proposes a new feature for phoneme classification using neural responses from a physiologically based computational model of the auditory periphery. The two-dimensional neurogram was constructed from the simulated responses of auditory-nerve fibres to speech phonemes. Features of neurogram images were extracted using the Discrete Radon Transform, and the dimensionality of features was reduced using an efficient feature selection technique. A s...
ObjectiveThe superior temporal gyrus (STG) and neighboring brain regions play a key role in human la...
<p>Schematic of the decoding of neural responses. For each auditory center, a decoder was trained to...
Shortcomings of automatic speech recognition (ASR) applications are becoming more evident as they ar...
Speaker identification under noisy conditions is one of the challenging topics in the field of speec...
Several filterbank-based metrics have been proposed to predict speech intelligibility (SI). However,...
In this work, a first approach to a robust phoneme recognition task by means of a biologically inspi...
A novel feature based on the simulated neural response of the auditory periphery was proposed in thi...
Behavioural and psychophysical measurements in audiology are currently a challenging and resource co...
Behavioural and psychophysical measurements in audiology are frequently a challenging and resource c...
Speaker identification is the mechanism of determining a person among a set of speakers to certify ...
Human-like performance by machines in tasks of speech and audio processing has remained an elusive g...
Automatic Speaker Identification (SID) is growing for the current demands of human-machine interacti...
This dataset comprises neural responses to a set of speech sounds from several brain regions. Audito...
It is well known that machines perform far worse than humans in recognizing speech and audio, especi...
This study proposes a new non-intrusive measure of speech quality, the neurogram speech quality meas...
ObjectiveThe superior temporal gyrus (STG) and neighboring brain regions play a key role in human la...
<p>Schematic of the decoding of neural responses. For each auditory center, a decoder was trained to...
Shortcomings of automatic speech recognition (ASR) applications are becoming more evident as they ar...
Speaker identification under noisy conditions is one of the challenging topics in the field of speec...
Several filterbank-based metrics have been proposed to predict speech intelligibility (SI). However,...
In this work, a first approach to a robust phoneme recognition task by means of a biologically inspi...
A novel feature based on the simulated neural response of the auditory periphery was proposed in thi...
Behavioural and psychophysical measurements in audiology are currently a challenging and resource co...
Behavioural and psychophysical measurements in audiology are frequently a challenging and resource c...
Speaker identification is the mechanism of determining a person among a set of speakers to certify ...
Human-like performance by machines in tasks of speech and audio processing has remained an elusive g...
Automatic Speaker Identification (SID) is growing for the current demands of human-machine interacti...
This dataset comprises neural responses to a set of speech sounds from several brain regions. Audito...
It is well known that machines perform far worse than humans in recognizing speech and audio, especi...
This study proposes a new non-intrusive measure of speech quality, the neurogram speech quality meas...
ObjectiveThe superior temporal gyrus (STG) and neighboring brain regions play a key role in human la...
<p>Schematic of the decoding of neural responses. For each auditory center, a decoder was trained to...
Shortcomings of automatic speech recognition (ASR) applications are becoming more evident as they ar...