We propose a method for constructing dialogue success classifiers that are capable of making accurate predictions in domains unseen during training. Pooling and adaptation are also investigated for constructing multi-domain models when data is available in the new domain. This is achieved by reformulating the features input to the recurrent neural network models introduced in [1]. Importantly, on our task of main interest, this enables policy training in a new domain without the dialogue success classifier (which forms the reinforcement learning reward function) ever having seen data from that domain before. This occurs whilst incurring only a small reduction in performance relative to developing and using an in-domain dialogue success clas...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domai...
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domai...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
Reinforcement learning methods have emerged as a popular choice for training an efficient and effect...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
Policy optimization is the core part of statistical dialogue management. Deep reinforcement learning...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Statistical dialogue systems offer the potential to reduce costs by learning policies automatically ...
Learning task-oriented dialog policies via reinforcement learning typically requires large amounts o...
Human conversation is inherently complex, often spanning many different topics/domains. This makes p...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domai...
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domai...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
Reinforcement learning methods have emerged as a popular choice for training an efficient and effect...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
To train a statistical spoken dialogue system (SDS) it is essen-tial that an accurate method for mea...
Policy optimization is the core part of statistical dialogue management. Deep reinforcement learning...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Statistical dialogue systems offer the potential to reduce costs by learning policies automatically ...
Learning task-oriented dialog policies via reinforcement learning typically requires large amounts o...
Human conversation is inherently complex, often spanning many different topics/domains. This makes p...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has ...
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domai...
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domai...