Aims of voice transformation are to change styles of given utterances. Most voice transformation methods process speech signals in a time-frequency domain. In the time domain, whenprocessing spectral information, conventional methods do not consider relations between neighboring frames. If unexpected modifications happen, there are discontinuities between frames,which lead to the degradation of the transformed speech quality. This paper proposes a new modeling of temporal structure of speech to ensure the smoothness of the transformed speech for improving the quality of transformed speech in the voice transformation. In our work, we propose an improvement of the temporal decomposition (TD) technique, which decomposes a speech signal into ev...
Temporal decomposition (TD) is an effective technique to compress the spectral information of speech...
This paper proposes a novel algorithm for temporal decomposition (TD) of speech, called `Limited Err...
Temporal decomposition of a speech utterance results in a description of speech parameters in terms ...
The challenge of speech modification is to flexibly modify the speech without degrading speech quali...
The challenge of speech modification is to flexibly modify the speech without degrading speech quali...
In state-of-the-art voice conversion systems, GMM-based voice conversion methods are regarded as som...
Manipulating spectral structure often leads to degradation of speech quality, which is mainly due to...
In most state-of-the-art voice gender conversion systems, the converted speech still sounds unnatura...
In this paper a new approach to Temporal Decomposition (TD) of speech, called Spectral Stability Bas...
A method for decomposing speech into a small number of temporally overlapping events has been propos...
In the paper, novel approach that efficiently extracts the tempo-ral information of speech has been ...
This paper presents a method of temporal decomposition (TD) for line spectral frequency (LSF) parame...
Recently, the speaker normalization technique VTLN (vo-cal tract length normalization), known from s...
This paper presents methods for independently modifying the time and pitch scale of acoustic signals...
Time-scale modification (TSM) is a process whereby signals are compressed or expanded in time in a m...
Temporal decomposition (TD) is an effective technique to compress the spectral information of speech...
This paper proposes a novel algorithm for temporal decomposition (TD) of speech, called `Limited Err...
Temporal decomposition of a speech utterance results in a description of speech parameters in terms ...
The challenge of speech modification is to flexibly modify the speech without degrading speech quali...
The challenge of speech modification is to flexibly modify the speech without degrading speech quali...
In state-of-the-art voice conversion systems, GMM-based voice conversion methods are regarded as som...
Manipulating spectral structure often leads to degradation of speech quality, which is mainly due to...
In most state-of-the-art voice gender conversion systems, the converted speech still sounds unnatura...
In this paper a new approach to Temporal Decomposition (TD) of speech, called Spectral Stability Bas...
A method for decomposing speech into a small number of temporally overlapping events has been propos...
In the paper, novel approach that efficiently extracts the tempo-ral information of speech has been ...
This paper presents a method of temporal decomposition (TD) for line spectral frequency (LSF) parame...
Recently, the speaker normalization technique VTLN (vo-cal tract length normalization), known from s...
This paper presents methods for independently modifying the time and pitch scale of acoustic signals...
Time-scale modification (TSM) is a process whereby signals are compressed or expanded in time in a m...
Temporal decomposition (TD) is an effective technique to compress the spectral information of speech...
This paper proposes a novel algorithm for temporal decomposition (TD) of speech, called `Limited Err...
Temporal decomposition of a speech utterance results in a description of speech parameters in terms ...