Voice Transformation (VT) aims at modifying some components of a voice signal while retaining other components. This thesis proposes the use of a novel, neural encoder-decoder (NED) framework for VT, which offers maximum advantage of preserving invariant content information, while a bank of encoders is used to extract modifiable facets of a voice signal. The decoder aggregates both invariant and modified components of speech to regenerate the signal and in doing so accomplishes the objective of VT. This framework is shown to be the first unified framework able to conduct various types of VT tasks and differs from previous approaches in offering the following advantages: (1) it is highly flexible and extensible in system structure; (2) it is...
Ph.D.Recent studies on the deep neural network reveal its ability to extract abstract semantics from...
Ph.D.Over the past a few years, the computer vision community has witnessed great success achieved i...
[[abstract]] 台灣的廟會是許多民眾藉以聚集交際的重要活動,目的不僅是為熱鬧氣氛,其中民俗藝陣的內容也可以同時謝神和娛人;然而,隨著現代人的娛樂方式及生活型態之轉變,在講求熱鬧及節省經費為...
Ph.D.Natural Language Understanding (NLU) is focusing on enabling machine to understand and process ...
Despite the rapid progress of automatic speech recognition (ASR) technologies in the past few decade...
Internet-of-things (IoT) devices powered by acoustic processing systems currently have a large share...
Powerful video representations serve as the foundation for many video understanding tasks, such as a...
"Brief introduction to the natural world language(Predicate meaning language)" The natural world la...
Acoustic scene classification (ASC) aims to identify the type of scene (environment) in which a give...
Image restoration is an extensively studied topic that aims at estimating the clear image from a cor...
Ph.D.Speaker verification, which uses the speaker's unique voice to verify the identify, is an impor...
Ph.D.This thesis mainly investigates the use of posteriorgram-to-acoustic modeling forunconstrained ...
Ph.D.Sequence learning aims to process sequential data such as text, speech, and video, and discover...
Deep learning in visual understanding and editing tasks has witnessed great success in recent years,...
The field of photonic integrated circuits is gaining significant momentum because it allows cost-eff...
Ph.D.Recent studies on the deep neural network reveal its ability to extract abstract semantics from...
Ph.D.Over the past a few years, the computer vision community has witnessed great success achieved i...
[[abstract]] 台灣的廟會是許多民眾藉以聚集交際的重要活動,目的不僅是為熱鬧氣氛,其中民俗藝陣的內容也可以同時謝神和娛人;然而,隨著現代人的娛樂方式及生活型態之轉變,在講求熱鬧及節省經費為...
Ph.D.Natural Language Understanding (NLU) is focusing on enabling machine to understand and process ...
Despite the rapid progress of automatic speech recognition (ASR) technologies in the past few decade...
Internet-of-things (IoT) devices powered by acoustic processing systems currently have a large share...
Powerful video representations serve as the foundation for many video understanding tasks, such as a...
"Brief introduction to the natural world language(Predicate meaning language)" The natural world la...
Acoustic scene classification (ASC) aims to identify the type of scene (environment) in which a give...
Image restoration is an extensively studied topic that aims at estimating the clear image from a cor...
Ph.D.Speaker verification, which uses the speaker's unique voice to verify the identify, is an impor...
Ph.D.This thesis mainly investigates the use of posteriorgram-to-acoustic modeling forunconstrained ...
Ph.D.Sequence learning aims to process sequential data such as text, speech, and video, and discover...
Deep learning in visual understanding and editing tasks has witnessed great success in recent years,...
The field of photonic integrated circuits is gaining significant momentum because it allows cost-eff...
Ph.D.Recent studies on the deep neural network reveal its ability to extract abstract semantics from...
Ph.D.Over the past a few years, the computer vision community has witnessed great success achieved i...
[[abstract]] 台灣的廟會是許多民眾藉以聚集交際的重要活動,目的不僅是為熱鬧氣氛,其中民俗藝陣的內容也可以同時謝神和娛人;然而,隨著現代人的娛樂方式及生活型態之轉變,在講求熱鬧及節省經費為...