During the recent Dialog State Tracking Challenge (DSTC), a fundamental question was raised: “Would better performance in dialog state tracking translate to better performance of the optimized policy by reinforcement learning? ” Also, during the challenge system evaluation, another non-trivial question arose: “Which evaluation metric and schedule would best predict improvement in overall dialog performance?” This paper aims to answer these questions by applying an off-policy reinforcement learning method to the output of each challenge system. The results give a positive answer to the first question. Thus the effort to separately improve the performance of dialog state tracking as carried out in the DSTC may be justified. The answer to the ...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
International audienceDesigning dialog policies for voice-enabled interfaces is a tailoring job that...
Abstract—Reinforcement learning is now an acknowledged approach for optimizing the interaction strat...
In spoken dialog systems, dialog state tracking refers to the task of correctly inferring the user's...
In a spoken dialogue system, the function of a dialogue manager is to select actions based on observ...
A spoken dialog system, while commu-nicating with a user, must keep track of what the user wants fro...
A spoken dialog system, while commu-nicating with a user, must keep track of what the user wants fro...
In a spoken dialog system, dialog state tracking refers to the task of correctly inferring the state...
International audienceReinforcement learning is now an acknowledged approach for optimising the inte...
Dialogue Policy Learning is a key component in a task-oriented dialogue system (TDS) that decides th...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
Abstract—Reinforcement learning is now an acknowledged ap-proach for optimising the interaction stra...
There is a strong relationship between evaluation and methods for automatically training language pr...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...
International audienceDesigning dialog policies for voice-enabled interfaces is a tailoring job that...
Abstract—Reinforcement learning is now an acknowledged approach for optimizing the interaction strat...
In spoken dialog systems, dialog state tracking refers to the task of correctly inferring the user's...
In a spoken dialogue system, the function of a dialogue manager is to select actions based on observ...
A spoken dialog system, while commu-nicating with a user, must keep track of what the user wants fro...
A spoken dialog system, while commu-nicating with a user, must keep track of what the user wants fro...
In a spoken dialog system, dialog state tracking refers to the task of correctly inferring the state...
International audienceReinforcement learning is now an acknowledged approach for optimising the inte...
Dialogue Policy Learning is a key component in a task-oriented dialogue system (TDS) that decides th...
International audienceSpoken Dialogue Systems (SDS) are systems which have the ability to interact w...
Abstract—Reinforcement learning is now an acknowledged ap-proach for optimising the interaction stra...
There is a strong relationship between evaluation and methods for automatically training language pr...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
Viewing dialogue management as a reinforcement learning task enables a system to learn to act optima...
Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently most...