Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domains) face scalability problems due to large search spaces. This paper proposes a three-stage method for multi-domain dialogue policy learning—termed NDQN, and applies it to an information-seeking spoken dialogue system in the domains of restaurants and hotels. In this method, the first stage does multi-policy learning via a network of DQN agents; the second makes use of compact state representations by compressing raw inputs; and the third stage applies a pre-training phase for bootstraping the behaviour of agents in the network. Experimental results comparing DQN (baseline) versus NDQN (proposed) using simulations report that the proposed me...
Task-oriented dialogue systems provide a convenient interface to help users complete tasks. An impor...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domai...
Deep reinforcement learning dialogue systems are attractive because they can jointly learn their fea...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
Reinforcement learning (RL) is a promising approach to solve dialogue policy optimisation. Tradition...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Deep reinforcement learning has shown great potential in training dialogue policies. However, its fa...
Deep reinforcement learning has shown great potential in training dialogue policies. However, its fa...
We present a new algorithm that significantly improves the efficiency of exploration for deep Q-lear...
Deep reinforcement learning for interactive multimodal robots is attractive for endowing machines wi...
Policy optimization is the core part of statistical dialogue management. Deep reinforcement learning...
Human conversation is inherently complex, often spanning many different topics/domains. This makes p...
Task-oriented dialogue systems provide a convenient interface to help users complete tasks. An impor...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
Standard deep reinforcement learning methods such as Deep Q-Networks (DQN) for multiple tasks (domai...
Deep reinforcement learning dialogue systems are attractive because they can jointly learn their fea...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue age...
Reinforcement learning (RL) is a promising approach to solve dialogue policy optimisation. Tradition...
© 2018 Chuandong YinTask-oriented dialogue systems such as Apple Siri and Microsoft Cortana are beco...
Deep reinforcement learning has shown great potential in training dialogue policies. However, its fa...
Deep reinforcement learning has shown great potential in training dialogue policies. However, its fa...
We present a new algorithm that significantly improves the efficiency of exploration for deep Q-lear...
Deep reinforcement learning for interactive multimodal robots is attractive for endowing machines wi...
Policy optimization is the core part of statistical dialogue management. Deep reinforcement learning...
Human conversation is inherently complex, often spanning many different topics/domains. This makes p...
Task-oriented dialogue systems provide a convenient interface to help users complete tasks. An impor...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...