Recent works usually address Dialog policy learning DPL by training a reinforcement learning (RL) agent to determine the best dialog action. However, existing works on deep RL require a large volume of agent-user interactions to achieve acceptable performance. In this paper, we propose to make full use of the plain text knowledge from the pre-trained language model to accelerate the RL agent's learning speed. Specifically, we design a dialog action-aware transformer encoder (DaTrans), which integrates a new fine-tuning procedure named masked last action task to encourage DaTrans to be dialog-aware and distils action-specific features. Then, DaTrans is further optimized in an RL setting with ongoing interactions and evolves through explorati...
Multi-action dialog policy (MADP), which generates multiple atomic dialog actions per turn, has been...
Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, cu...
International audienceDesigning dialog policies for voice-enabled interfaces is a tailoring job that...
Dialogue policy learning (DPL) is a crucial component of dialogue modelling. Its primary role is to ...
Goal-oriented dialog policy learning algorithms aim to learn a dialog policy for selecting language ...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
Learning task-oriented dialog policies via reinforcement learning typically requires large amounts o...
Reinforcement Learning approaches are commonly used for dialog policy learning. Reward function is a...
Recently, reinforcement learning (RL) has been applied to task-oriented dialogue systems by using la...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
Dialogue act annotations are important to improve response generation quality in task-oriented dialo...
International audienceOne major drawback of Reinforcement Learning (RL) Spoken Dialogue Systems is t...
Deep reinforcement learning dialogue systems are attractive because they can jointly learn their fea...
One of the main challenges for conversational agents is to select the optimal dialogue policy based ...
Multi-action dialog policy (MADP), which generates multiple atomic dialog actions per turn, has been...
Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, cu...
International audienceDesigning dialog policies for voice-enabled interfaces is a tailoring job that...
Dialogue policy learning (DPL) is a crucial component of dialogue modelling. Its primary role is to ...
Goal-oriented dialog policy learning algorithms aim to learn a dialog policy for selecting language ...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
Learning task-oriented dialog policies via reinforcement learning typically requires large amounts o...
Reinforcement Learning approaches are commonly used for dialog policy learning. Reward function is a...
Recently, reinforcement learning (RL) has been applied to task-oriented dialogue systems by using la...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
Dialogue act annotations are important to improve response generation quality in task-oriented dialo...
International audienceOne major drawback of Reinforcement Learning (RL) Spoken Dialogue Systems is t...
Deep reinforcement learning dialogue systems are attractive because they can jointly learn their fea...
One of the main challenges for conversational agents is to select the optimal dialogue policy based ...
Multi-action dialog policy (MADP), which generates multiple atomic dialog actions per turn, has been...
Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, cu...
International audienceDesigning dialog policies for voice-enabled interfaces is a tailoring job that...