Driven by successes in deep learning, computer vision research has begun to move beyond object detection and image classification to more sophisticated tasks like image captioning or visual question answering. Motivating such endeavors is the desire for models to capture not only objects present in an image, but more fine-grained aspects of a scene such as relationships between objects and their attributes. Scene graphs provide a formal construct for capturing these aspects of an image. Despite this, there have been only a few recent efforts to generate scene graphs from imagery. Previous works limit themselves to settings where bounding box information is available at train time and do not attempt to generate scene graphs with attributes. ...
We live in a world made up of different objects, people, and environments interacting with each othe...
Generating images from semantic visual knowledge is a challenging task, that can be useful to condit...
Recently, several deep learning models are proposed that operate on graph-structured data. These mod...
Deep learning techniques have led to remarkable breakthroughs in the field of generic object detecti...
Scene graph generation has received growing attention with the advancements in image understanding t...
An image contains a lot of information, and that information can be used in high-level complex syste...
We propose a new algorithm, called Deep Generative Probabilistic Graph Neural Networks (DG-PGNN), to...
Scene graph parsing aims at understanding an image as a graph where vertices are visual objects (pot...
In this paper, we study graph-to-image generation conditioned exclusively on scene graphs, in which ...
Many top-performing image captioning models rely solely on object features computed with an object d...
The visual world we sense, interpret and interact everyday is a complex composition of interleaved p...
In this paper, we address the task of semantic-guided scene generation. One open challenge widely ob...
Advancements on text-to-image synthesis generate remarkable images from textual descriptions. Howeve...
In this thesis, we propose novel deep learning algorithms for the vision and language tasks, includi...
Conceptual representations of images involving descriptions of entities and their relations are ofte...
We live in a world made up of different objects, people, and environments interacting with each othe...
Generating images from semantic visual knowledge is a challenging task, that can be useful to condit...
Recently, several deep learning models are proposed that operate on graph-structured data. These mod...
Deep learning techniques have led to remarkable breakthroughs in the field of generic object detecti...
Scene graph generation has received growing attention with the advancements in image understanding t...
An image contains a lot of information, and that information can be used in high-level complex syste...
We propose a new algorithm, called Deep Generative Probabilistic Graph Neural Networks (DG-PGNN), to...
Scene graph parsing aims at understanding an image as a graph where vertices are visual objects (pot...
In this paper, we study graph-to-image generation conditioned exclusively on scene graphs, in which ...
Many top-performing image captioning models rely solely on object features computed with an object d...
The visual world we sense, interpret and interact everyday is a complex composition of interleaved p...
In this paper, we address the task of semantic-guided scene generation. One open challenge widely ob...
Advancements on text-to-image synthesis generate remarkable images from textual descriptions. Howeve...
In this thesis, we propose novel deep learning algorithms for the vision and language tasks, includi...
Conceptual representations of images involving descriptions of entities and their relations are ofte...
We live in a world made up of different objects, people, and environments interacting with each othe...
Generating images from semantic visual knowledge is a challenging task, that can be useful to condit...
Recently, several deep learning models are proposed that operate on graph-structured data. These mod...