Ph.D.This thesis mainly investigates the use of posteriorgram-to-acoustic modeling forunconstrained Voice Conversion (VC) with deep learning. Many existing systems arelimited by some constraints including requirement of parallel data, requirement oflarge amounts of language-dependent training data, which also affects speech qualityand speaker similarity. Therefore, this thesis concentrates on two goals that alleviatethe said constraints: 1) improving speech naturalness and speaker similarity of theconverted speech; 2) improving practicality and flexibility of VC systems.For VC with parallel data, many previous efforts (e.g. GMMs, DNNs) model therelationship between the paired non-target speech and target speech at the frame-basedlevel direc...
Ph.D.3D point clouds are standard outputs of 3D scanning devices and depth sensors. Due to the popul...
Novel motor task learning by one hand unilaterally results in an auto-gain of performance in the unt...
Deep learning in visual understanding and editing tasks has witnessed great success in recent years,...
Understanding single modality mediums including audio, visual, and language have achieved great succ...
Ph.D.In this thesis, DNN acoustic model adaptation is investigated. Performance of automatic speech ...
Voice Transformation (VT) aims at modifying some components of a voice signal while retaining other ...
M.Phil.Human action understanding from videos has been an important task in computer vision. Compare...
Ph.D.Interference suppression and time control/estimation are two critical problems in wireless comm...
M.Phil.Evidence shows that the systems of speech perception and production are intrinsically linked ...
Massive MIMO has drawn researchers’ attention significantly in recent years. Due to the huge number ...
Ph.D.Over the past a few years, the computer vision community has witnessed great success achieved i...
Ph.D.This thesis proposes an end-to-end neural framework for expressive text-to-speech (E-TTS) synth...
Ph.D.This dissertation investigates the morphosyntax-prosody interactions in Fuzhou. It looks into t...
Object-level video understanding is an important task in computer vision that provides pixel-level u...
Image registration, the processing of finding meaningful correspondences between two or multiple ima...
Ph.D.3D point clouds are standard outputs of 3D scanning devices and depth sensors. Due to the popul...
Novel motor task learning by one hand unilaterally results in an auto-gain of performance in the unt...
Deep learning in visual understanding and editing tasks has witnessed great success in recent years,...
Understanding single modality mediums including audio, visual, and language have achieved great succ...
Ph.D.In this thesis, DNN acoustic model adaptation is investigated. Performance of automatic speech ...
Voice Transformation (VT) aims at modifying some components of a voice signal while retaining other ...
M.Phil.Human action understanding from videos has been an important task in computer vision. Compare...
Ph.D.Interference suppression and time control/estimation are two critical problems in wireless comm...
M.Phil.Evidence shows that the systems of speech perception and production are intrinsically linked ...
Massive MIMO has drawn researchers’ attention significantly in recent years. Due to the huge number ...
Ph.D.Over the past a few years, the computer vision community has witnessed great success achieved i...
Ph.D.This thesis proposes an end-to-end neural framework for expressive text-to-speech (E-TTS) synth...
Ph.D.This dissertation investigates the morphosyntax-prosody interactions in Fuzhou. It looks into t...
Object-level video understanding is an important task in computer vision that provides pixel-level u...
Image registration, the processing of finding meaningful correspondences between two or multiple ima...
Ph.D.3D point clouds are standard outputs of 3D scanning devices and depth sensors. Due to the popul...
Novel motor task learning by one hand unilaterally results in an auto-gain of performance in the unt...
Deep learning in visual understanding and editing tasks has witnessed great success in recent years,...