Structured representations of images that model visual relationships are beneficial for many vision and vision-language applications. However, current human-annotated visual relationship datasets suffer from the long-tailed predicate distribution problem which limits the potential of visual relationship models. In this work, we introduce a self-supervised method that implicitly learns the visual relationships without relying on any ground-truth visual relationship annotations. Our method relies on 1) intra- and inter-modality encodings to respectively model relationships within each modality separately and jointly, and 2) relationship probing, which seeks to discover the graph structure within each modality. By leveraging masked language mo...
Inferring the interactions between objects, a.k.a visual relationship detection, is a crucial point ...
International audienceA thorough comprehension of image content demands a complex grasp of the inter...
The aim of visual relation detection is to provide a comprehensive understanding of an image by desc...
Visual Relationship Detection (VRD) aims to understand real-world objects' interactions by grounding...
In the research area of computer vision and artificial intelligence, learning the relationships of o...
Visual relationship detection is fundamental for holistic image understanding. However, the localiza...
Visual relationship detection is fundamental for holistic image understanding. However, the localiza...
Visual relationship detection aims to describe the interactions between pairs of objects. Different ...
Reasoning about the relationships between object pairs in images is a crucial task for holistic scen...
International audienceThis paper introduces a novel approach for modeling visual relations between p...
Significant effort has been recently devoted to modelling visual relations. This has mostly addresse...
Visual Relationship Detection (VRD) is a relatively young research area, where the goal is to develo...
Understanding an image goes beyond recognizing and locating the objects in it, the relationships bet...
Inferring the interactions between objects, a.k.a visual relationship detection, is a crucial point ...
Visual relationship detection aims to completely understand visual scenes and has recently received ...
Inferring the interactions between objects, a.k.a visual relationship detection, is a crucial point ...
International audienceA thorough comprehension of image content demands a complex grasp of the inter...
The aim of visual relation detection is to provide a comprehensive understanding of an image by desc...
Visual Relationship Detection (VRD) aims to understand real-world objects' interactions by grounding...
In the research area of computer vision and artificial intelligence, learning the relationships of o...
Visual relationship detection is fundamental for holistic image understanding. However, the localiza...
Visual relationship detection is fundamental for holistic image understanding. However, the localiza...
Visual relationship detection aims to describe the interactions between pairs of objects. Different ...
Reasoning about the relationships between object pairs in images is a crucial task for holistic scen...
International audienceThis paper introduces a novel approach for modeling visual relations between p...
Significant effort has been recently devoted to modelling visual relations. This has mostly addresse...
Visual Relationship Detection (VRD) is a relatively young research area, where the goal is to develo...
Understanding an image goes beyond recognizing and locating the objects in it, the relationships bet...
Inferring the interactions between objects, a.k.a visual relationship detection, is a crucial point ...
Visual relationship detection aims to completely understand visual scenes and has recently received ...
Inferring the interactions between objects, a.k.a visual relationship detection, is a crucial point ...
International audienceA thorough comprehension of image content demands a complex grasp of the inter...
The aim of visual relation detection is to provide a comprehensive understanding of an image by desc...