Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has been in the focus of research for many years. While most work which is based on reinforcement learning employs an objective measure like task success for modelling the reward signal, we propose to use a reward based on user satisfaction. We will show in simulated experiments that a live user satisfaction estimation model may be applied resulting in higher estimated satisfaction whilst achieving similar success rates. Moreover, we will show that one satisfaction estimation model which has been trained on one domain may be applied in many other domains which cover a similar task. We will verify our findings by employing the model to one of the ...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Reinforcement learning-based spoken dialogue systems aim to compute an optimal strategy for dialogue...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
We report evaluation results for real users of a learnt dialogue management policy versus a hand-cod...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Reinforcement learning-based spoken dialogue systems aim to compute an optimal strategy for dialogue...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
International audienceThis paper addresses the problem of defining, from data, a reward function in ...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
We report evaluation results for real users of a learnt dialogue management policy versus a hand-cod...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Reinforcement learning-based spoken dialogue systems aim to compute an optimal strategy for dialogue...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...