Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estimate the parameters of a dialogue policy which selects the system's responses based on the inferred dialogue state. However, the inference of the dialogue state itself depends on a dialogue model which describes the expected behaviour of a user when interacting with the system. Ideally the parameters of this dialogue model should be also optimised to maximise the expected cumulative reward. This article presents two novel reinforcement algorithms for learning the parameters of a dialogue model. First, the Natural Belief Critic algorithm is designed to optimise the m...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal...
This paper presents a novel algorithm for learning parameters in statistical dialogue systems which ...
This paper presents a novel algorithm for learning parameters in statistical dialogue systems which ...
In a spoken dialogue system, the function of a dialogue manager is to select actions based on observ...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they h...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal...
This paper presents a novel algorithm for learning parameters in statistical dialogue systems which ...
This paper presents a novel algorithm for learning parameters in statistical dialogue systems which ...
In a spoken dialogue system, the function of a dialogue manager is to select actions based on observ...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they h...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal...