We deal with the problem of generating textual captions from optical remote sensing (RS) images using the notion of deep reinforcement learning. Due to the high inter-class similarity in reference sentences describing remote sensing data, jointly encoding the sentences and images encourages prediction of captions that are semantically more precise than the ground truth in many cases. To this end, we introduce an Actor Dual-Critic training strategy where a second critic model is deployed in the form of an encoder-decoder RNN to encode the latent information corresponding to the original and generated captions. While all actor-critic methods use an actor to predict sentences for an image and a critic to provide rewards, our proposed encoder-d...
The existing methods for image captioning usually train the language model under the cross entropy l...
Image captioning is the process of automatically generating a description of an image in natural lan...
Theoretical thesis.Bibliography: pages 66-72.1. Introduction -- 2. Background and literature review ...
Remote sensing images, and the unique properties that characterize them, are attracting increased at...
High-resolution remote sensing images are now available with the progress of remote sensing technolo...
The performance of remote sensing image retrieval (RSIR) systems depends on the capability of the ex...
Methods to describe an image or video with natural language, namely image and video captioning, have...
Image captioning is a crucial technology with numerous applications, including enhancing accessibili...
Image classification, as the core task in the computer vision field, has proceeded at a breakneck p...
Large-scale caption-labeled remote sensing image samples are expensive to acquire, and the training ...
The domain of Deep Learning that is related to generation of textual description of images is called...
The domain of Deep Learning that is related to generation of textual description of images is cal...
The emergence of large-scale large language models, with GPT-4 as a prominent example, has significa...
Image captioning generates a semantic description of an image. It deals with image understanding and...
In the recent years, remote sensing has faced a huge evolution. The constantly growing availability ...
The existing methods for image captioning usually train the language model under the cross entropy l...
Image captioning is the process of automatically generating a description of an image in natural lan...
Theoretical thesis.Bibliography: pages 66-72.1. Introduction -- 2. Background and literature review ...
Remote sensing images, and the unique properties that characterize them, are attracting increased at...
High-resolution remote sensing images are now available with the progress of remote sensing technolo...
The performance of remote sensing image retrieval (RSIR) systems depends on the capability of the ex...
Methods to describe an image or video with natural language, namely image and video captioning, have...
Image captioning is a crucial technology with numerous applications, including enhancing accessibili...
Image classification, as the core task in the computer vision field, has proceeded at a breakneck p...
Large-scale caption-labeled remote sensing image samples are expensive to acquire, and the training ...
The domain of Deep Learning that is related to generation of textual description of images is called...
The domain of Deep Learning that is related to generation of textual description of images is cal...
The emergence of large-scale large language models, with GPT-4 as a prominent example, has significa...
Image captioning generates a semantic description of an image. It deals with image understanding and...
In the recent years, remote sensing has faced a huge evolution. The constantly growing availability ...
The existing methods for image captioning usually train the language model under the cross entropy l...
Image captioning is the process of automatically generating a description of an image in natural lan...
Theoretical thesis.Bibliography: pages 66-72.1. Introduction -- 2. Background and literature review ...