Reinforcement learning (RL) has shown its advantages in image captioning by optimizing the non-differentiable metric directly in the reward learning process. However, due to the reward hacking problem in RL, maximizing reward may not lead to better quality of the caption, especially from the aspects of propositional content and distinctiveness. In this work, we propose to use a new learning method, meta learning, to utilize supervision from the ground truth whilst optimizing the reward function in RL. To improve the propositional content and the distinctiveness of the generated captions, the proposed model provides the global optimal solution by taking different gradient steps towards the supervision task and the reinforcement task, simulta...
Meta-learning strives to learn about and improve a student's machine learning algorithm. However, ex...
Human ratings are currently the most accurate way to assess the quality of an image captioning model...
In this paper, we propose a novel conditional-generativeadversarial-nets-based image captioning fram...
Theoretical thesis.Bibliography: pages 66-72.1. Introduction -- 2. Background and literature review ...
The existing methods for image captioning usually train the language model under the cross entropy l...
Context, the embedding of previous collected trajectories, is a powerful construct for Meta-Reinforc...
The existing image captioning approaches typically train a one-stage sentence decoder, which is diff...
International audienceThis paper addresses a cornerstone of Automated Machine Learning: the problem ...
Image captioning is a crucial technology with numerous applications, including enhancing accessibili...
The existing methods for image captioning usually train the language model under the cross entropy l...
In this thesis, we discuss meta learning for control:policy learning algorithms that can themselves ...
Reward shaping is one of the most effective methods to tackle the crucial yet challenging problem of...
Intelligent agents should have the ability to leverage knowledge from previously learned tasks in or...
Despite the remarkable progress of image captioning, existing captioners typically lack the controll...
While artificial learning agents have demonstrated impressive capabilities, these successes are typi...
Meta-learning strives to learn about and improve a student's machine learning algorithm. However, ex...
Human ratings are currently the most accurate way to assess the quality of an image captioning model...
In this paper, we propose a novel conditional-generativeadversarial-nets-based image captioning fram...
Theoretical thesis.Bibliography: pages 66-72.1. Introduction -- 2. Background and literature review ...
The existing methods for image captioning usually train the language model under the cross entropy l...
Context, the embedding of previous collected trajectories, is a powerful construct for Meta-Reinforc...
The existing image captioning approaches typically train a one-stage sentence decoder, which is diff...
International audienceThis paper addresses a cornerstone of Automated Machine Learning: the problem ...
Image captioning is a crucial technology with numerous applications, including enhancing accessibili...
The existing methods for image captioning usually train the language model under the cross entropy l...
In this thesis, we discuss meta learning for control:policy learning algorithms that can themselves ...
Reward shaping is one of the most effective methods to tackle the crucial yet challenging problem of...
Intelligent agents should have the ability to leverage knowledge from previously learned tasks in or...
Despite the remarkable progress of image captioning, existing captioners typically lack the controll...
While artificial learning agents have demonstrated impressive capabilities, these successes are typi...
Meta-learning strives to learn about and improve a student's machine learning algorithm. However, ex...
Human ratings are currently the most accurate way to assess the quality of an image captioning model...
In this paper, we propose a novel conditional-generativeadversarial-nets-based image captioning fram...