Image captioning generates written descriptions of an image. In recent image captioning research, attention regions seldom cover all objects, and generated captions may lack the details of objects and may remain far from reality. In this paper, we propose a word guided attention (WGA) method for image captioning. First, WGA extracts word information using the embedded word and memory cell by applying transformation and multiplication. Then, WGA applies word information to the attention results and obtains the attended feature vectors via elementwise multiplication. Finally, we apply WGA with the words from different time steps to obtain previous word guided attention (PW) and current word attention (CW) in the decoder. Experiments on the MS...
In daily life, deliberation is a common behavior for human to improve or refine their work (e.g., wr...
In recent years, with the rapid development of artificial intelligence, image caption has gradually ...
The result of a deep learning-based image captioning system with encoder-decoder framework relies he...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
With the maturity of computer vision and natural language processing technology, we are becoming mor...
International audienceWe propose ``Areas of Attention'', a novel attention-based model for automatic...
© 2017 Association for Computing Machinery. Visual content description has been attracting broad res...
Automatic generation of captions for a given image is an active research area in Artificial Intel...
Image captioning and visual language grounding are two important tasks for image understanding, but ...
Inspired by recent work in machine translation and object detection, we introduce an attention based...
We present an attention-based image captioning method using DenseNet features. Conventional image ca...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
Generating natural and accurate descriptions in image captioning has always been a challenge. In thi...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
In daily life, deliberation is a common behavior for human to improve or refine their work (e.g., wr...
In recent years, with the rapid development of artificial intelligence, image caption has gradually ...
The result of a deep learning-based image captioning system with encoder-decoder framework relies he...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
With the maturity of computer vision and natural language processing technology, we are becoming mor...
International audienceWe propose ``Areas of Attention'', a novel attention-based model for automatic...
© 2017 Association for Computing Machinery. Visual content description has been attracting broad res...
Automatic generation of captions for a given image is an active research area in Artificial Intel...
Image captioning and visual language grounding are two important tasks for image understanding, but ...
Inspired by recent work in machine translation and object detection, we introduce an attention based...
We present an attention-based image captioning method using DenseNet features. Conventional image ca...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
Generating natural and accurate descriptions in image captioning has always been a challenge. In thi...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
In daily life, deliberation is a common behavior for human to improve or refine their work (e.g., wr...
In recent years, with the rapid development of artificial intelligence, image caption has gradually ...
The result of a deep learning-based image captioning system with encoder-decoder framework relies he...