Text-based environments enable RL agents to learn to converse and perform interactive tasks through natural language. However, previous RL approaches applied to text-based environments show poor performance when evaluated on unseen games. This paper investigates the improvement of generalisation performance through the simple switch from a value-based update method to a policy-based one, within text-based environments. We show that by replacing commonly used value-based methods with REINFORCE with baseline, a far more general agent is produced. The policy-based agent is evaluated on Coin Collector and Question Answering with interactive text (QAit), two text-based environments designed to test zero-shot performance. We see substantial imp...
The lack of data efficiency and stability is one of the main challenges in end-to-end model free rei...
textReinforcement Learning (RL) offers a promising approach towards achieving the dream of autonomou...
PhD Theses.In complex environments the learning of robust and general policies often requires expos...
Deep reinforcement learning provides a promising approach for text-based games in studying natural l...
Obtaining policies that can generalise to new environments in reinforcement learning is challenging....
The choice of state and action representation in Reinforcement Learning (RL) has a significant effec...
Text-based games are complex, interactive simulations where a player is asked to process the text de...
To be successful in real-world tasks, Reinforcement Learning (RL) needs to exploit the compositional...
Text-based games can be used to develop task-oriented text agents for accomplishing tasks with high-...
Text-based games have emerged as an important test-bed for Reinforcement Learning (RL) research, req...
Text-based games present a unique class of sequential decision making problem in which agents intera...
It has been a long-standing goal in Artificial Intelligence (AI) to build machines that can solve ta...
Different from classic Supervised Learning, Reinforcement Learning (RL), is fundamentally interactiv...
The ability to learn optimal control policies in systems where action space is defined by sentences ...
© 2018 AI Access Foundation. All rights reserved. In this paper, we explore the utilization of natur...
The lack of data efficiency and stability is one of the main challenges in end-to-end model free rei...
textReinforcement Learning (RL) offers a promising approach towards achieving the dream of autonomou...
PhD Theses.In complex environments the learning of robust and general policies often requires expos...
Deep reinforcement learning provides a promising approach for text-based games in studying natural l...
Obtaining policies that can generalise to new environments in reinforcement learning is challenging....
The choice of state and action representation in Reinforcement Learning (RL) has a significant effec...
Text-based games are complex, interactive simulations where a player is asked to process the text de...
To be successful in real-world tasks, Reinforcement Learning (RL) needs to exploit the compositional...
Text-based games can be used to develop task-oriented text agents for accomplishing tasks with high-...
Text-based games have emerged as an important test-bed for Reinforcement Learning (RL) research, req...
Text-based games present a unique class of sequential decision making problem in which agents intera...
It has been a long-standing goal in Artificial Intelligence (AI) to build machines that can solve ta...
Different from classic Supervised Learning, Reinforcement Learning (RL), is fundamentally interactiv...
The ability to learn optimal control policies in systems where action space is defined by sentences ...
© 2018 AI Access Foundation. All rights reserved. In this paper, we explore the utilization of natur...
The lack of data efficiency and stability is one of the main challenges in end-to-end model free rei...
textReinforcement Learning (RL) offers a promising approach towards achieving the dream of autonomou...
PhD Theses.In complex environments the learning of robust and general policies often requires expos...