We offer a method for one-shot mask-guided image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled MAGIC, leverages structured gradients from a pre-trained quasi-robust classifier to better preserve the input semantics while preserving its classification accuracy, thereby guaranteeing credibility in the synthesis. Unlike current methods that use complex primitives to supervise the process or use attention maps as a weak supervisory signal, MAGIC aggregates gradients over the input, driven by a guide binary mask that enforces a strong, spatial prior. MAGIC implements a series of manipulations with a single framework achievi...
Recent advances in Generative Adversarial Networks (GANs) have shown great progress on a large varie...
Text-to-image is a process of generating an image from the input text. It has a variety of applicati...
Humans are avid consumers of visual content. Every day, people watch videos, play digital games and ...
© 2019 Neural information processing systems foundation. All rights reserved. We show that the basic...
The interest of the machine learning community in image synthesis has grown significantly in recent ...
Image synthesis aims to generate realistic and high-fidelity images automatically. It has attracted ...
Controllable image synthesis models allow creation of diverse images based on text instructions or g...
Patch-based synthesis is a powerful framework for nu-merous image and video editing applications suc...
The tremendous success of neural networks is clouded by the existence of adversarial examples: malic...
It has been witnessed that masked image modeling (MIM) has shown a huge potential in self-supervised...
Deep learning has made great progress in solving many computer vision tasks for which labeled data i...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Despite the fact that complex visual scenes contain multiple, overlapping objects, people perform ob...
Image synthesis is an important problem in computer vision and has many applications, such as comput...
Recent advances in deep generative models have enabled computers to imagine and generate fictional i...
Recent advances in Generative Adversarial Networks (GANs) have shown great progress on a large varie...
Text-to-image is a process of generating an image from the input text. It has a variety of applicati...
Humans are avid consumers of visual content. Every day, people watch videos, play digital games and ...
© 2019 Neural information processing systems foundation. All rights reserved. We show that the basic...
The interest of the machine learning community in image synthesis has grown significantly in recent ...
Image synthesis aims to generate realistic and high-fidelity images automatically. It has attracted ...
Controllable image synthesis models allow creation of diverse images based on text instructions or g...
Patch-based synthesis is a powerful framework for nu-merous image and video editing applications suc...
The tremendous success of neural networks is clouded by the existence of adversarial examples: malic...
It has been witnessed that masked image modeling (MIM) has shown a huge potential in self-supervised...
Deep learning has made great progress in solving many computer vision tasks for which labeled data i...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Despite the fact that complex visual scenes contain multiple, overlapping objects, people perform ob...
Image synthesis is an important problem in computer vision and has many applications, such as comput...
Recent advances in deep generative models have enabled computers to imagine and generate fictional i...
Recent advances in Generative Adversarial Networks (GANs) have shown great progress on a large varie...
Text-to-image is a process of generating an image from the input text. It has a variety of applicati...
Humans are avid consumers of visual content. Every day, people watch videos, play digital games and ...