In this chapter, we introduce hybrid postfilters into speech synthesis, with the objective of enhancing the quality of the synthesized speech. Our approach combines a Wiener filter with deep neural networks. Several attempts to enhance synthetic speech have contemplated single-stage deep-learning-based postfilters, which learn to perform a mapping of the synthetic speech parameters to the natural ones. In the synthetic speech produced by statistical methods, we have measured low-level noise components, so the common single-stage postfilters must achieve the reduction of that component, as well as the complex relationship between the parameters of the synthetic and the natural speech. That is why we consider a two-stage approach: In th...
This work proposes a method of model-based speech enhancement that uses a network of HMMs to first ...
Traditional speech enhancement systems reduce noise by modifying the noisy signal to make it more li...
This work proposes a method of speech enhancement that uses a network of HMMs to first decode noisy ...
Statistical parametric speech synthesis based on Hidden Markov Models has been an important techniq...
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-b...
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9703).Recent developments in...
Recent developments in speech synthesis have produced systems capable of producing speech which clos...
In this paper we propose a deep neural network to model the conditional probability of the spectral ...
Several researchers have contemplated deep learning-based post-filters to increase the quality of st...
Several researchers have contemplated deep learning-based post-filters to increase the quality of st...
Several attempts to enhance statistical parametric speech synthesis have contemplated deep-learning-...
Over the past several decades, numerous speech enhancement techniques have been proposed to improve ...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
We address the problem of speech enhancement in real-world noisy scenarios. We propose to solve the ...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
This work proposes a method of model-based speech enhancement that uses a network of HMMs to first ...
Traditional speech enhancement systems reduce noise by modifying the noisy signal to make it more li...
This work proposes a method of speech enhancement that uses a network of HMMs to first decode noisy ...
Statistical parametric speech synthesis based on Hidden Markov Models has been an important techniq...
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-b...
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9703).Recent developments in...
Recent developments in speech synthesis have produced systems capable of producing speech which clos...
In this paper we propose a deep neural network to model the conditional probability of the spectral ...
Several researchers have contemplated deep learning-based post-filters to increase the quality of st...
Several researchers have contemplated deep learning-based post-filters to increase the quality of st...
Several attempts to enhance statistical parametric speech synthesis have contemplated deep-learning-...
Over the past several decades, numerous speech enhancement techniques have been proposed to improve ...
This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based spe...
We address the problem of speech enhancement in real-world noisy scenarios. We propose to solve the ...
Although Hidden Markov Model based speech synthesis has been proved to have good performance, there ...
This work proposes a method of model-based speech enhancement that uses a network of HMMs to first ...
Traditional speech enhancement systems reduce noise by modifying the noisy signal to make it more li...
This work proposes a method of speech enhancement that uses a network of HMMs to first decode noisy ...