International audienceThis paper investigates the impact of reward shaping on a reinforcement learning-based spoken dialogue system's learning. A diffuse reward function gives a reward after each transition between two dialogue states. A sparse function only gives a reward at the end of the dialogue. Reward shaping consists of learning a diffuse function without modifying the optimal policy compared to a sparse one. Two reward shaping methods are applied to a corpus of dialogues evaluated with numerical performance scores. Learning with these functions is compared to the sparse case and it is shown, on simulated dialogues, that the policies learnt after reward shaping lead to higher performance
Shaping can be an effective method for improving the learning rate in reinforcement systems. Previou...
This document proposes to learn the behaviour of the dialogue manager of a spoken dialogue system fr...
International audienceSpoken Dialogue Systems are man-machine interfaces which use spoken language a...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
Reinforcement learning is widely used for dialogue policy optimization where the reward function oft...
Statistical spoken dialogue systems have the attractive property of being able to be optimised from ...
Statistical spoken dialogue systems have the attractive property of being able to be optimised from ...
Adapting Spoken Dialogue Systems to the user is supposed to result in more efficient and successful ...
In continuing tasks, average-reward reinforcement learning may be a more appropriate problem formula...
In a spoken dialogue system, the function of a dialogue manager is to select actions based on observ...
Shaping can be an effective method for improving the learning rate in reinforcement systems. Previou...
This document proposes to learn the behaviour of the dialogue manager of a spoken dialogue system fr...
International audienceSpoken Dialogue Systems are man-machine interfaces which use spoken language a...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
Reinforcement learning is widely used for dialogue policy optimization where the reward function oft...
Statistical spoken dialogue systems have the attractive property of being able to be optimised from ...
Statistical spoken dialogue systems have the attractive property of being able to be optimised from ...
Adapting Spoken Dialogue Systems to the user is supposed to result in more efficient and successful ...
In continuing tasks, average-reward reinforcement learning may be a more appropriate problem formula...
In a spoken dialogue system, the function of a dialogue manager is to select actions based on observ...
Shaping can be an effective method for improving the learning rate in reinforcement systems. Previou...
This document proposes to learn the behaviour of the dialogue manager of a spoken dialogue system fr...
International audienceSpoken Dialogue Systems are man-machine interfaces which use spoken language a...