Recently, the Visual Question Answering (VQA) task has gained increasing attention in artificial intelligence. Existing VQA methods mainly adopt the visual attention mechanism to associate the input question with corresponding image regions for effective question answering. The free-form region based and the detection-based visual attention mechanisms are mostly investigated, with the former ones attending free-form image regions and the latter ones attending pre-specified detection-box regions. We argue that the two attention mechanisms are able to provide complementary information and should be effectively integrated to better solve the VQA problem. In this paper, we propose a novel deep neural network for VQA that integrates both attenti...
The alignment of information between the image and the question is of great significance in the visu...
The quantity of images that populate the Internet is dramatically increasing. It becomes of critical...
Visual Question Answering~(VQA) requires a simultaneous understanding of images and questions. Exist...
We propose a novel attention based deep learning ar-chitecture for visual question answering task (V...
© 2018 IEEE. Visual question answering (VQA) is challenging, because it requires a simultaneous unde...
© 2017 IEEE. Visual question answering (VQA) is challenging because it requires a simultaneous under...
Visual Question Answering (VQA) is a recently proposed multimodal task in the general area of machin...
This paper proposes to improve visual question answering (VQA) with structured representations of bo...
Visual Question Answering (VQA) is a task for evaluating image scene understanding abilities and sho...
CVPR2019 accepted paperInternational audienceMultimodal attentional networks are currently state-of-...
We propose a method for visual question answering which combines an internal representation of the c...
This paper focuses on answering fill-in-the-blank style multiple choice questions from the Visual M...
The task of Visual Question Answering (VQA) has emerged in recent years for its potential applicatio...
Computer Vision is a scientific discipline which involves the development of an algorithmic basis fo...
Visual Question Answering (VQA) raises a great challenge for computer vision and natural language pr...
The alignment of information between the image and the question is of great significance in the visu...
The quantity of images that populate the Internet is dramatically increasing. It becomes of critical...
Visual Question Answering~(VQA) requires a simultaneous understanding of images and questions. Exist...
We propose a novel attention based deep learning ar-chitecture for visual question answering task (V...
© 2018 IEEE. Visual question answering (VQA) is challenging, because it requires a simultaneous unde...
© 2017 IEEE. Visual question answering (VQA) is challenging because it requires a simultaneous under...
Visual Question Answering (VQA) is a recently proposed multimodal task in the general area of machin...
This paper proposes to improve visual question answering (VQA) with structured representations of bo...
Visual Question Answering (VQA) is a task for evaluating image scene understanding abilities and sho...
CVPR2019 accepted paperInternational audienceMultimodal attentional networks are currently state-of-...
We propose a method for visual question answering which combines an internal representation of the c...
This paper focuses on answering fill-in-the-blank style multiple choice questions from the Visual M...
The task of Visual Question Answering (VQA) has emerged in recent years for its potential applicatio...
Computer Vision is a scientific discipline which involves the development of an algorithmic basis fo...
Visual Question Answering (VQA) raises a great challenge for computer vision and natural language pr...
The alignment of information between the image and the question is of great significance in the visu...
The quantity of images that populate the Internet is dramatically increasing. It becomes of critical...
Visual Question Answering~(VQA) requires a simultaneous understanding of images and questions. Exist...