While language-guided image manipulation has made remarkable progress, the challenge of how to instruct the manipulation process faithfully reflecting human intentions persists. An accurate and comprehensive description of a manipulation task using natural language is laborious and sometimes even impossible, primarily due to the inherent uncertainty and ambiguity present in linguistic expressions. Is it feasible to accomplish image manipulation without resorting to external cross-modal language information? If this possibility exists, the inherent modality gap would be effortlessly eliminated. In this paper, we propose a novel manipulation methodology, dubbed ImageBrush, that learns visual instructions for more accurate image editing. Our k...
In this work, we propose TediGAN, a novel framework for multi-modal image generation and manipulati...
This paper studies whether a perceptual visual system can simulate human-like cognitive capabilities...
We introduce DialogPaint, a novel framework that bridges conversational interactions with image edit...
Recently, text-guided image manipulation has received increasing attention in the research field of ...
Procedural tasks such as following a recipe or editing an image are very common. They require a pers...
Generic image inpainting aims to complete a corrupted image by borrowing surrounding information, wh...
Common types of image editing methods focus on low-level characteristics. In this thesis, I leverage...
Recent large-scale text-driven synthesis models have attracted much attention thanks to their remark...
At present, text-guided image manipulation is a notable subject of study in the vision and language ...
Image manipulation has attracted a lot of interest due to its wide range of applications. Prior work...
Text-guided image manipulation tasks have recently gained attention in the vision-and-language commu...
Humans are avid consumers of visual content. Every day, people watch videos, play digital games and ...
Language is such a powerful representation for capturing the knowledge and information about our wor...
Based on an ongoing attempt to integrate Natural Language instructions with human figure animation, ...
Text-driven image editing aims to manipulate images with the guidance of natural language descriptio...
In this work, we propose TediGAN, a novel framework for multi-modal image generation and manipulati...
This paper studies whether a perceptual visual system can simulate human-like cognitive capabilities...
We introduce DialogPaint, a novel framework that bridges conversational interactions with image edit...
Recently, text-guided image manipulation has received increasing attention in the research field of ...
Procedural tasks such as following a recipe or editing an image are very common. They require a pers...
Generic image inpainting aims to complete a corrupted image by borrowing surrounding information, wh...
Common types of image editing methods focus on low-level characteristics. In this thesis, I leverage...
Recent large-scale text-driven synthesis models have attracted much attention thanks to their remark...
At present, text-guided image manipulation is a notable subject of study in the vision and language ...
Image manipulation has attracted a lot of interest due to its wide range of applications. Prior work...
Text-guided image manipulation tasks have recently gained attention in the vision-and-language commu...
Humans are avid consumers of visual content. Every day, people watch videos, play digital games and ...
Language is such a powerful representation for capturing the knowledge and information about our wor...
Based on an ongoing attempt to integrate Natural Language instructions with human figure animation, ...
Text-driven image editing aims to manipulate images with the guidance of natural language descriptio...
In this work, we propose TediGAN, a novel framework for multi-modal image generation and manipulati...
This paper studies whether a perceptual visual system can simulate human-like cognitive capabilities...
We introduce DialogPaint, a novel framework that bridges conversational interactions with image edit...