Generating natural and accurate descriptions in image captioning has always been a challenge. In this paper, we propose a novel recall mechanism to imitate the way human conduct captioning. There are three parts in our recall mechanism : recall unit, semantic guide (SG) and recalled-word slot (RWS). Recall unit is a text-retrieval module designed to retrieve recalled words for images. SG and RWS are designed for the best use of recalled words. SG branch can generate a recalled context, which can guide the process of generating caption. RWS branch is responsible for copying recalled words to the caption. Inspired by pointing mechanism in text summarization, we adopt a soft switch to balance the generated-word probabilities between SG and RWS...
In this paper we explore the bi-directional mapping be-tween images and their sentence-based descrip...
With the maturity of computer vision and natural language processing technology, we are becoming mor...
Image captioning is the task of automatically generating a description of an image. Traditional imag...
Generating natural and accurate descriptions in image captioning has always been a challenge. In thi...
Image captioning, which aims to automatically generate text description of given images, has receive...
Recently, a great progress in automatic image captioning has been achieved by using semantic concept...
Image captioning generates written descriptions of an image. In recent image captioning research, at...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
In daily life, deliberation is a common behavior for human to improve or refine their work (e.g., wr...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
Generating stylized captions for images is a challenging task since it requires not only describing ...
The existing image captioning approaches typically train a one-stage sentence decoder, which is diff...
Automatic generation of natural language description for individual images (a.k.a. image captioning)...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
In this paper we explore the bi-directional mapping be-tween images and their sentence-based descrip...
With the maturity of computer vision and natural language processing technology, we are becoming mor...
Image captioning is the task of automatically generating a description of an image. Traditional imag...
Generating natural and accurate descriptions in image captioning has always been a challenge. In thi...
Image captioning, which aims to automatically generate text description of given images, has receive...
Recently, a great progress in automatic image captioning has been achieved by using semantic concept...
Image captioning generates written descriptions of an image. In recent image captioning research, at...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
This paper presents a novel approach for automatically generating image descriptions: visual detecto...
In daily life, deliberation is a common behavior for human to improve or refine their work (e.g., wr...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
Generating stylized captions for images is a challenging task since it requires not only describing ...
The existing image captioning approaches typically train a one-stage sentence decoder, which is diff...
Automatic generation of natural language description for individual images (a.k.a. image captioning)...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
In this paper we explore the bi-directional mapping be-tween images and their sentence-based descrip...
With the maturity of computer vision and natural language processing technology, we are becoming mor...
Image captioning is the task of automatically generating a description of an image. Traditional imag...