Referring expressions comprehension is the task of locating the image region described by a natural language expression, which refer to the properties of the region or the relationships with other regions. Most previous work handles this problem by selecting the most relevant regions from a set of candidate regions, when there are many candidate regions in the set these methods are inefficient. Inspired by recent success of image captioning by using deep learning methods, in this paper we proposed a framework to understand the referring expressions by multiple steps of reasoning. We present a model for referring expressions comprehension by selecting the most relevant region directly from the image. The core of our model is a recurrent atte...
Almost all natural language generation (NLG) systems are faced with the problem of the generation of...
Referring Expression Comprehension (REC) is one of the most important tasks in visual reasoning that...
We present a neural network model of referent identification in a preferential looking task. The inp...
Different from universal object detection, referring expression comprehension (REC) aims to locate s...
We introduce GroundNet, a neural network for referring expression recognition---the task of localizi...
International audienceWe propose ``Areas of Attention'', a novel attention-based model for automatic...
Visual Relational Reasoning is crucial for many vision-and-language based tasks, such as Visual Ques...
Recently, attention mechanism has been successfully applied in image captioning, but the existing at...
Zarrieß S, Schlangen D. Decoding Strategies for Neural Referring Expression Generation. In: Proceed...
Referring expression comprehension aims at grounding the object in an image referred to by the expre...
Visual attention mechanism has been widely used by image captioning model in order to dynamically at...
Referring expression is a kind of language expression being used for referring to particular objects...
Traditional computational approaches to referring expression generation operate in a deliberate mann...
Referring generation expression is a natural language processing task that involves creating noun ph...
International audienceThe generation of referring expressions is one of the most extensively explore...
Almost all natural language generation (NLG) systems are faced with the problem of the generation of...
Referring Expression Comprehension (REC) is one of the most important tasks in visual reasoning that...
We present a neural network model of referent identification in a preferential looking task. The inp...
Different from universal object detection, referring expression comprehension (REC) aims to locate s...
We introduce GroundNet, a neural network for referring expression recognition---the task of localizi...
International audienceWe propose ``Areas of Attention'', a novel attention-based model for automatic...
Visual Relational Reasoning is crucial for many vision-and-language based tasks, such as Visual Ques...
Recently, attention mechanism has been successfully applied in image captioning, but the existing at...
Zarrieß S, Schlangen D. Decoding Strategies for Neural Referring Expression Generation. In: Proceed...
Referring expression comprehension aims at grounding the object in an image referred to by the expre...
Visual attention mechanism has been widely used by image captioning model in order to dynamically at...
Referring expression is a kind of language expression being used for referring to particular objects...
Traditional computational approaches to referring expression generation operate in a deliberate mann...
Referring generation expression is a natural language processing task that involves creating noun ph...
International audienceThe generation of referring expressions is one of the most extensively explore...
Almost all natural language generation (NLG) systems are faced with the problem of the generation of...
Referring Expression Comprehension (REC) is one of the most important tasks in visual reasoning that...
We present a neural network model of referent identification in a preferential looking task. The inp...