Gaussian processes reinforcement learning provides an appealing framework for training the dialogue policy as it takes into account correlations of the objective function given different dialogue belief states, which can significantly speed up the learning. These correlations are modelled by the kernel function which may depend on hyper-parameters. So far, for real-world dialogue systems the hyperparameters have been hand-tuned, relying on the designer to adjust the correlations, or simple non-parametrised kernel functions have been used instead. Here, we examine different kernel structures and show that it is possible to optimise the hyperparameters from data yielding improved performance of the resulting dialogue policy. We confirm this i...
Abstract—Reinforcement learning is now an acknowledged ap-proach for optimising the interaction stra...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learn...
Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they h...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
Modelling dialogue as a Partially Observable Markov Decision Process (POMDP) enables a dialogue poli...
The optimization of dialogue policies using reinforcement learning (RL) is now an accepted part of t...
Statistical dialogue systems offer the potential to reduce costs by learning policies automatically ...
This paper presents a novel algorithm for learning parameters in statistical dialogue systems which ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
With the increase of machine learning usage by industries and scientific communities in a variety of...
Adapting a Spoken Dialogue System to the user's satisfaction is supposed to result in more successfu...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
A partially observable Markov decision process (POMDP) has been proposed as a dialog model that enab...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
Abstract—Reinforcement learning is now an acknowledged ap-proach for optimising the interaction stra...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learn...
Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they h...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
Modelling dialogue as a Partially Observable Markov Decision Process (POMDP) enables a dialogue poli...
The optimization of dialogue policies using reinforcement learning (RL) is now an accepted part of t...
Statistical dialogue systems offer the potential to reduce costs by learning policies automatically ...
This paper presents a novel algorithm for learning parameters in statistical dialogue systems which ...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
With the increase of machine learning usage by industries and scientific communities in a variety of...
Adapting a Spoken Dialogue System to the user's satisfaction is supposed to result in more successfu...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
A partially observable Markov decision process (POMDP) has been proposed as a dialog model that enab...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
Abstract—Reinforcement learning is now an acknowledged ap-proach for optimising the interaction stra...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learn...