Significant effort has been recently devoted to modelling visual relations. This has mostly addressed the design of architectures, typically by adding parameters and increasing model complexity. However, visual relation learning is a long-tailed problem, due to the combinatorial nature of joint reasoning about groups of objects. Increasing model complexity is, in general, ill-suited for long-tailed problems due to their tendency to over-fit. In this thesis, we explore an alternative hypothesis, denoted the Devil is in the Tails. Under this hypothesis, better performance is achieved by keeping the model simple but improving its ability to cope with long-tailed distributions. To test this hypothesis, we devise a new approach for training vi...
In the research area of computer vision and artificial intelligence, learning the relationships of o...
Visual relationship detection aims to capture interactions between pairs of objects in images. Relat...
Visual relationship detection aims to capture interactions between pairs of objects in images. Relat...
Significant effort has been recently devoted to modelling visual relations. This has mostly addresse...
The aim of visual relation detection is to provide a comprehensive understanding of an image by desc...
The aim of visual relation detection is to provide a comprehensive understanding of an image by desc...
Structured representations of images that model visual relationships are beneficial for many vision ...
Visual relationship detection aims to describe the interactions between pairs of objects. Different ...
International audienceThis paper introduces a novel approach for modeling visual relations between p...
Visual relationship detection is fundamental for holistic image understanding. However, the localiza...
Visual relationship detection is fundamental for holistic image understanding. However, the localiza...
Visual Relationship Detection (VRD) aims to understand real-world objects' interactions by grounding...
Large scale visual understanding is challenging, as it requires a model to handle the widely-spread ...
International audienceA thorough comprehension of image content demands a complex grasp of the inter...
Long-tailed relation classification is a challenging problem as the head classes may dominate the tr...
In the research area of computer vision and artificial intelligence, learning the relationships of o...
Visual relationship detection aims to capture interactions between pairs of objects in images. Relat...
Visual relationship detection aims to capture interactions between pairs of objects in images. Relat...
Significant effort has been recently devoted to modelling visual relations. This has mostly addresse...
The aim of visual relation detection is to provide a comprehensive understanding of an image by desc...
The aim of visual relation detection is to provide a comprehensive understanding of an image by desc...
Structured representations of images that model visual relationships are beneficial for many vision ...
Visual relationship detection aims to describe the interactions between pairs of objects. Different ...
International audienceThis paper introduces a novel approach for modeling visual relations between p...
Visual relationship detection is fundamental for holistic image understanding. However, the localiza...
Visual relationship detection is fundamental for holistic image understanding. However, the localiza...
Visual Relationship Detection (VRD) aims to understand real-world objects' interactions by grounding...
Large scale visual understanding is challenging, as it requires a model to handle the widely-spread ...
International audienceA thorough comprehension of image content demands a complex grasp of the inter...
Long-tailed relation classification is a challenging problem as the head classes may dominate the tr...
In the research area of computer vision and artificial intelligence, learning the relationships of o...
Visual relationship detection aims to capture interactions between pairs of objects in images. Relat...
Visual relationship detection aims to capture interactions between pairs of objects in images. Relat...