Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, current pre-training methods mainly focus on enhancing dialog understanding and generation tasks while neglecting the exploitation of dialog policy. In this paper, we propose GALAXY, a novel pre-trained dialog model that explicitly learns dialog policy from limited labeled dialogs and large-scale unlabeled dialog corpora via semi-supervised learning. Specifically, we introduce a dialog act prediction task for policy optimization during pre-training and employ a consistency regularization term to refine the learned representation with the help of unlabeled dialogs. We also implement a gating mechanism to weigh suitable unlabeled dialog samples. ...
Response generation for task-oriented dialogues involves two basic components: dialogue planning and...
Although many pretrained models exist for text or images, there have been relatively fewer attempts ...
A partially observable Markov decision process (POMDP) has been proposed as a dialog model that enab...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
This paper presents our task-oriented dialog system UBAR which models task-oriented dialogs on a dia...
Goal-oriented dialog policy learning algorithms aim to learn a dialog policy for selecting language ...
In this paper we describe a new approach for learning dialog act processing. In this approach we int...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
Dialog system is class of intelligent system that interacts with human via natural languageinterface...
Learning task-oriented dialog policies via reinforcement learning typically requires large amounts o...
Building a universal conversational agent has been a long-standing goal of the dialogue research com...
Numerous new dialog domains are being created every day while collecting data for these domains is e...
Recently, reinforcement learning (RL) has been applied to task-oriented dialogue systems by using la...
DoctorThis paper presents a new hybrid dialog management framework that integrates a statistical ran...
Incorporating external knowledge into the response generation process is essential to building more ...
Response generation for task-oriented dialogues involves two basic components: dialogue planning and...
Although many pretrained models exist for text or images, there have been relatively fewer attempts ...
A partially observable Markov decision process (POMDP) has been proposed as a dialog model that enab...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
This paper presents our task-oriented dialog system UBAR which models task-oriented dialogs on a dia...
Goal-oriented dialog policy learning algorithms aim to learn a dialog policy for selecting language ...
In this paper we describe a new approach for learning dialog act processing. In this approach we int...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
Dialog system is class of intelligent system that interacts with human via natural languageinterface...
Learning task-oriented dialog policies via reinforcement learning typically requires large amounts o...
Building a universal conversational agent has been a long-standing goal of the dialogue research com...
Numerous new dialog domains are being created every day while collecting data for these domains is e...
Recently, reinforcement learning (RL) has been applied to task-oriented dialogue systems by using la...
DoctorThis paper presents a new hybrid dialog management framework that integrates a statistical ran...
Incorporating external knowledge into the response generation process is essential to building more ...
Response generation for task-oriented dialogues involves two basic components: dialogue planning and...
Although many pretrained models exist for text or images, there have been relatively fewer attempts ...
A partially observable Markov decision process (POMDP) has been proposed as a dialog model that enab...