The objective of this work is to study the suitability of existing spectral mapping methods for enhancement of throat microphone (TM) speech, and propose a more elegant method for spectral mapping. Gaussian mixture models (GMM) and neural networks (NN) have been used for spectral mapping. Though GMM-based mapping captures the variability among speech sounds through multiple mixtures, it can only provide a linear map between the source and the target. On the other hand, NN-based mapping is capable of providing a nonlinear map but a single mapping scheme may not handle variability across different speech sounds. Incorporating the advantages from these approaches, we propose a spectral mapping method using multiple neural networks. Speech data...
Neural network (NN) applications have recently been employed to extract the parameters of an artic-u...
International audienceThe NAM-to-speech conversion proposed by Toda and colleagues which converts No...
The time-frequency mask and the magnitude spectrum are two common targets for deep learning-based sp...
The objective of this work is to study the suitability of existing spectral mapping methods for enha...
Throat microphone is robust to the surrounding noise and can even pick up whispers; however, speech ...
Speech recorded from a throat microphone is robust to the surrounding noise, but sounds unnatural un...
In this paper, we use artificial neural networks (ANNs) for voice conversion and exploit the mapping...
In this paper, we propose a new statistical enhancement system for throat microphone recordings thro...
The objective of this work is to represent the information in the speech signal picked up by a throa...
Part of the Communications in Computer and Information Science book series (CCIS, volume 1087).In th...
Statistical speech reconstruction for larynx-related dysphonia has achieved good performance using G...
International audienceThe NAM-to-speech conversion proposed by Toda and colleagues which converts No...
International audienceIn this paper, we present a statistical method based on GMM modeling to map th...
In this paper, a voice conversion approach that combines two distinct ideas is pro-posed to improve ...
This paper presents two connectionist approaches to spectral mapping for speaker normalization. The ...
Neural network (NN) applications have recently been employed to extract the parameters of an artic-u...
International audienceThe NAM-to-speech conversion proposed by Toda and colleagues which converts No...
The time-frequency mask and the magnitude spectrum are two common targets for deep learning-based sp...
The objective of this work is to study the suitability of existing spectral mapping methods for enha...
Throat microphone is robust to the surrounding noise and can even pick up whispers; however, speech ...
Speech recorded from a throat microphone is robust to the surrounding noise, but sounds unnatural un...
In this paper, we use artificial neural networks (ANNs) for voice conversion and exploit the mapping...
In this paper, we propose a new statistical enhancement system for throat microphone recordings thro...
The objective of this work is to represent the information in the speech signal picked up by a throa...
Part of the Communications in Computer and Information Science book series (CCIS, volume 1087).In th...
Statistical speech reconstruction for larynx-related dysphonia has achieved good performance using G...
International audienceThe NAM-to-speech conversion proposed by Toda and colleagues which converts No...
International audienceIn this paper, we present a statistical method based on GMM modeling to map th...
In this paper, a voice conversion approach that combines two distinct ideas is pro-posed to improve ...
This paper presents two connectionist approaches to spectral mapping for speaker normalization. The ...
Neural network (NN) applications have recently been employed to extract the parameters of an artic-u...
International audienceThe NAM-to-speech conversion proposed by Toda and colleagues which converts No...
The time-frequency mask and the magnitude spectrum are two common targets for deep learning-based sp...