Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has been in the focus of research for many years. While most work that is based on reinforcement learning employs an objective measure like task success for modelling the reward signal, we propose to use a reward signal based on user satisfaction. We propose a novel estimator and show that it outperforms all previous estimators while learning temporal dependencies implicitly. We show in simulated experiments that a live user satisfaction estimation model may be applied resulting in higher estimated satisfaction whilst achieving similar success rates. Moreover, we show that a satisfaction estimation model trained on one domain may be applied in ma...
International audienceReinforcement learning-based spoken dialogue systems aimto compute an optimal ...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
International audienceReinforcement learning-based spoken dialogue systems aimto compute an optimal ...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
Reinforcement learning-based spoken dialogue systems aim to compute an optimal strategy for dialogue...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
International audienceReinforcement learning-based spoken dialogue systems aimto compute an optimal ...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
International audienceReinforcement learning-based spoken dialogue systems aimto compute an optimal ...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
Reinforcement learning-based spoken dialogue systems aim to compute an optimal strategy for dialogue...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
International audienceReinforcement learning-based spoken dialogue systems aimto compute an optimal ...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
International audienceReinforcement learning-based spoken dialogue systems aimto compute an optimal ...