The performance of adversarial dialogue generation models relies on the quality of the reward signal produced by the discriminator. The reward signal from a poor discriminator can be very sparse and unstable, which may lead the generator to fall into a local optimum or to produce nonsense replies. To alleviate the first problem, we first extend a recently proposed adversarial dialogue generation method to an adversarial imitation learning solution. Then, in the framework of adversarial inverse reinforcement learning, we propose a new reward model for dialogue generation that can provide a more accurate and precise reward signal for generator training. We evaluate the performance of the resulting model with automatic metrics and human evalua...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
International audienceHuman machine interaction is a field where machine learning is present at almo...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...
The performance of adversarial dialogue generation models relies on the quality of the reward signal...
Reinforcement learning methods have emerged as a popular choice for training an efficient and effect...
Despite the recent success of large-scale language models on various downstream NLP tasks, the repet...
This paper introduces an adversarial method to stress-test trained metrics for the evaluation of con...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
Reinforcement learning (RL) provides a powerful framework for decision-making, but its application i...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
Building a controllable neural conversation model (NCM) is an important task. In this paper, we focu...
International audienceHuman machine interaction is a field where machine learning is present at almo...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
International audienceHuman machine interaction is a field where machine learning is present at almo...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...
The performance of adversarial dialogue generation models relies on the quality of the reward signal...
Reinforcement learning methods have emerged as a popular choice for training an efficient and effect...
Despite the recent success of large-scale language models on various downstream NLP tasks, the repet...
This paper introduces an adversarial method to stress-test trained metrics for the evaluation of con...
Abstract. This paper investigates the impact of reward shaping on a reinforcement learning-based spo...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
Reinforcement learning (RL) provides a powerful framework for decision-making, but its application i...
International audienceThis paper investigates the impact of reward shaping on a reinforcement learni...
Building a controllable neural conversation model (NCM) is an important task. In this paper, we focu...
International audienceHuman machine interaction is a field where machine learning is present at almo...
Reinforcement techniques have been successfully used to maximise the expected cumulative reward of s...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
International audienceHuman machine interaction is a field where machine learning is present at almo...
As robots and other autonomous agents enter our homes, hospitals, schools, and workplaces, it is imp...