This work addresses the problem of mapping fixed representations (embeddings) of a speech signal to face embeddings and then generating a face from the mapped embedding using a generative adversarial network (GAN) that was trained for face generation. GANs are a type of neural networks that can generate data similar to the data they were trained on. The architecture of the proposed system is based on four components: a face embedding extractor, a voice embedding extractor, an algorithm on top of a GAN that can generate a face from a face embedding, and my mapping network used to map a voice embedding to a face embedding. The pre-trained neural networks FaceNet and SpeechBrain are adopted as embedding extractors. A model that uses a pre-trai...
International audienceRecently, Deep Neural Networks (DNNs) have become a central subject of discuss...
Face recognition has become a widely adopted biometric in forensics, security and law enforcement th...
The main goal of this thesis is to implement and compare models based on various architectures of co...
Speech is a rich biometric signal that contains information about the identity, gender and emotional...
Speech is a rich biometric signal that contains information about the identity, gender and emotional...
Tato práce řeší problém mapování fixních reprezentací (embeddingů) řečového signálu na embeddingy ob...
Speech is a rich biometric signal that contains information about the identity, gender and emotional...
Generative adversarial networks (GANs) synthesize realistic samples (image, audio, video, etc.) from...
In this paper, we present FaceTuneGAN, a new 3D face model representation decomposing and encoding s...
Creating visuals from words may appear to be a complex process, but it is achievable with today’s te...
Synthesis of face images by translating facial attributes is an important problem in computer vision...
StarGAN realizes image conversion among multiple domain image, but the combined and coordinated acti...
Generating synthesized images, being able to animate or transform them somehow, has lately been expe...
The objective of this paper is a neural network model that controls the pose and expression of a giv...
There is high demand of realistic facial expression in current computer graphics and multimedia rese...
International audienceRecently, Deep Neural Networks (DNNs) have become a central subject of discuss...
Face recognition has become a widely adopted biometric in forensics, security and law enforcement th...
The main goal of this thesis is to implement and compare models based on various architectures of co...
Speech is a rich biometric signal that contains information about the identity, gender and emotional...
Speech is a rich biometric signal that contains information about the identity, gender and emotional...
Tato práce řeší problém mapování fixních reprezentací (embeddingů) řečového signálu na embeddingy ob...
Speech is a rich biometric signal that contains information about the identity, gender and emotional...
Generative adversarial networks (GANs) synthesize realistic samples (image, audio, video, etc.) from...
In this paper, we present FaceTuneGAN, a new 3D face model representation decomposing and encoding s...
Creating visuals from words may appear to be a complex process, but it is achievable with today’s te...
Synthesis of face images by translating facial attributes is an important problem in computer vision...
StarGAN realizes image conversion among multiple domain image, but the combined and coordinated acti...
Generating synthesized images, being able to animate or transform them somehow, has lately been expe...
The objective of this paper is a neural network model that controls the pose and expression of a giv...
There is high demand of realistic facial expression in current computer graphics and multimedia rese...
International audienceRecently, Deep Neural Networks (DNNs) have become a central subject of discuss...
Face recognition has become a widely adopted biometric in forensics, security and law enforcement th...
The main goal of this thesis is to implement and compare models based on various architectures of co...