Humans continue to outperform modern AI systems in their ability to parse and understand complex visual scenes flexibly. Attention and memory are two systems known to play a critical role in our ability to selectively maintain and manipulate behaviorally-relevant visual information to solve some of the most challenging visual reasoning tasks. Here, we present a novel architecture for visual reasoning inspired by the cognitive-science literature on visual reasoning, the Memory- and Attention-based (visual) REasOning (MAREO) architecture. MAREO instantiates an active-vision theory, which posits that the brain solves complex visual reasoning problems compositionally by learning to combine previously-learned elementary visual operations to form...
In this paper, we describe the attention mechanisms in CHREST, a computational architecture of human...
Recent brain imaging studies have provided evidence that the parietal cortex plays a key role in rea...
Deep neural networks learn representations of data to facilitate problem-solving in their respective...
Humans continue to outperform modern AI systems in their ability to parse and understand complex vis...
Visual understanding requires comprehending complex visual relations between objects within a scene....
International audienceA fundamental component of human vision is our ability to parse complex visual...
International audienceAchieving artificial visual reasoning — the ability to answer image-related qu...
Many tasks that are easy for humans are difficult for machines. Particularly, while humans excel at ...
Advances in machine learning have generated increasing enthusiasm for tasks that require high-level ...
Recognising actions and objects from video material has attracted growing research attention and giv...
Raven’s Progressive Matrices (RPMs) have been widely used to evaluate the visual reasoning ability o...
AbstractVisual cognition, high-level vision, mid-level vision and top-down processing all refer to d...
Understanding images requires rich background knowledge that is not often written down and hard for ...
Bauckhage C, Wachsmuth S, Hanheide M, et al. The visual active memory perspective on integrated reco...
This thesis addresses the problem of creating computer vision systems that will facilitate high-leve...
In this paper, we describe the attention mechanisms in CHREST, a computational architecture of human...
Recent brain imaging studies have provided evidence that the parietal cortex plays a key role in rea...
Deep neural networks learn representations of data to facilitate problem-solving in their respective...
Humans continue to outperform modern AI systems in their ability to parse and understand complex vis...
Visual understanding requires comprehending complex visual relations between objects within a scene....
International audienceA fundamental component of human vision is our ability to parse complex visual...
International audienceAchieving artificial visual reasoning — the ability to answer image-related qu...
Many tasks that are easy for humans are difficult for machines. Particularly, while humans excel at ...
Advances in machine learning have generated increasing enthusiasm for tasks that require high-level ...
Recognising actions and objects from video material has attracted growing research attention and giv...
Raven’s Progressive Matrices (RPMs) have been widely used to evaluate the visual reasoning ability o...
AbstractVisual cognition, high-level vision, mid-level vision and top-down processing all refer to d...
Understanding images requires rich background knowledge that is not often written down and hard for ...
Bauckhage C, Wachsmuth S, Hanheide M, et al. The visual active memory perspective on integrated reco...
This thesis addresses the problem of creating computer vision systems that will facilitate high-leve...
In this paper, we describe the attention mechanisms in CHREST, a computational architecture of human...
Recent brain imaging studies have provided evidence that the parietal cortex plays a key role in rea...
Deep neural networks learn representations of data to facilitate problem-solving in their respective...