Statistical spoken dialogue systems have the attractive property of being able to be optimised from data via interactions with real users. However in the rein-forcement learning paradigm the dialogue manager (agent) often requires significant time to explore the state-action space to learn to behave in a desirable manner. This is a critical issue when the system is trained on-line with real users where learn-ing costs are expensive. Reward shaping is one promising technique for addressing these concerns. Here we examine three re-current neural network (RNN) approaches for providing reward shaping information in addition to the primary (task-orientated) environmental feedback. These RNNs are trained on returns from dialogues gener-ated by a ...
Reinforcement learning methods have emerged as a popular choice for training an efficient and effect...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Reinforcement Learning approaches are commonly used for dialog policy learning. Reward function is a...
Statistical spoken dialogue systems have the attractive property of being able to be optimised from ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
The ability to compute an accurate reward function is essential for optimising a dialogue policy via...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A ...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Reinforcement learning methods have emerged as a popular choice for training an efficient and effect...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Reinforcement Learning approaches are commonly used for dialog policy learning. Reward function is a...
Statistical spoken dialogue systems have the attractive property of being able to be optimised from ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
The ability to compute an accurate reward function is essential for optimising a dialogue policy via...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A ...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Reinforcement learning methods have emerged as a popular choice for training an efficient and effect...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Reinforcement Learning approaches are commonly used for dialog policy learning. Reward function is a...