Generating Image Captions Using Bahdanau Attention Mechanism and Transfer Learning

Shahnawaz Ayoub
Yonis Gulzar
Faheem Ahmad Reegu
Sherzod Turaev

Open link

Publication date

December 2022

DOI

10.3390/sym14122681

Publisher

MDPI AG

Journal

Symmetry

Abstract

Automatic image caption prediction is a challenging task in natural language processing. Most of the researchers have used the convolutional neural network as an encoder and decoder. However, an accurate image caption prediction requires a model to understand the semantic relationship that exists between the various objects present in an image. The attention mechanism performs a linear combination of encoder and decoder states. It emphasizes the semantic information present in the caption with the visual information present in an image. In this paper, we incorporated the Bahdanau attention mechanism with two pre-trained convolutional neural networks—Vector Geometry Group and InceptionV3—to predict the captions of a given image. The two pre-...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Generating Image Captions Using Bahdanau Attention Mechanism and Transfer Learning

Abstract

Extracted data

Generating Image Captions Using Bahdanau Attention Mechanism and Transfer Learning

Abstract

Extracted data

Related items

Related items