Spoken dialog systems typically use a limited number of non- understanding recovery strategies and simple heuristic policies to engage them (e.g. first ask user to repeat, then give help, then transfer to an operator). We propose a supervised, online method for learning a non-understanding recovery policy over a large set of recovery strategies. The approach consists of two steps: first, we construct runtime estimates for the likelihood of success of each recovery strategy, and then we use these estimates to construct a policy. An experiment with a publicly available spoken dialog system shows that the learned policy produced a 12.5% relative improvement in the non-understanding recovery rate. </p
We report evaluation results for real users of a learnt dialogue management policy versus a hand-cod...
Reinforcement learning-based spoken dialogue systems aim to compute an optimal strategy for dialogue...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
International audienceDesigning dialog policies for voice-enabled interfaces is a tailoring job that...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
Behavior cloning is a method of automated decision-making that aims to extract meaningful informatio...
We present results from an extensive empiri-cal analysis of non-understanding errors and ten non-und...
Conversational agents (CA) occasionally fail to understand the user's intention or respond inappropr...
We present results from an extensive empirical analysis of non-understanding errors and ten non-und...
We study the problem of alignment be-tween dialogue participants, using the prac-tical example of “t...
This paper addresses the issue of interpretability and auditability of reinforcement-learning agents...
Thesis (Ph.D.), Computer Science, Washington State UniversityReinforcement learning (RL) has had man...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
Intelligent systems need to be able to recover from mistakes, resolve uncertainty, and adapt to nove...
We report evaluation results for real users of a learnt dialogue management policy versus a hand-cod...
Reinforcement learning-based spoken dialogue systems aim to compute an optimal strategy for dialogue...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
International audienceDesigning dialog policies for voice-enabled interfaces is a tailoring job that...
Reinforcement learning (RL) is now part of the state of the art in the domain of spoken dialogue sys...
Behavior cloning is a method of automated decision-making that aims to extract meaningful informatio...
We present results from an extensive empiri-cal analysis of non-understanding errors and ten non-und...
Conversational agents (CA) occasionally fail to understand the user's intention or respond inappropr...
We present results from an extensive empirical analysis of non-understanding errors and ten non-und...
We study the problem of alignment be-tween dialogue participants, using the prac-tical example of “t...
This paper addresses the issue of interpretability and auditability of reinforcement-learning agents...
Thesis (Ph.D.), Computer Science, Washington State UniversityReinforcement learning (RL) has had man...
To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for meas...
Intelligent systems need to be able to recover from mistakes, resolve uncertainty, and adapt to nove...
We report evaluation results for real users of a learnt dialogue management policy versus a hand-cod...
Reinforcement learning-based spoken dialogue systems aim to compute an optimal strategy for dialogue...
We propose a method for constructing dialogue success classifiers that are capable of making accurat...