We introduce Span-ConveRT, a light-weight model for dialog slot-filling which frames the task as a turn-based span extraction task. This formulation allows for a simple integration of conversational knowledge coded in large pretrained conversational models such as ConveRT (Henderson et al., 2019). We show that leveraging such knowledge in Span-ConveRT is especially useful for few-shot learning scenarios: we report consistent gains over 1) a span extractor that trains representations from scratch in the target domain, and 2) a BERT-based span extractor. In order to inspire more work on span extraction for the slot-filling task, we also release RESTAURANTS-8K, a new challenging data set of 8,198 utterances, compiled from actual conversations...
Robust spoken language understanding in large-scale conversational dialog applications is usually pe...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
Understanding user utterances in human-computer spoken dialogue systems involves a multi-level pragm...
Slot filling is a core operation for utterance understanding in task-oriented dialogue systems. Slot...
Large-scale pre-trained language models have contributed significantly to natural language processin...
As the labeling cost for different modules in task-oriented dialog (ToD) systems is high, a major ch...
Training statistical dialog models in spoken dialog systems (SDS) re-quires large amounts of annotat...
A slot value might be provided segment by segment over multiple-turn interactions in a dialog, espec...
Abstract Conversational modeling is an important task in natural language understanding and machine ...
In this paper, we study the task of selecting the optimal response given a user and system utterance...
Pretraining deep neural networks to perform language modeling - that is, to reconstruct missing word...
Finding threads in textual dialogs is emerging as a need to better organize stored knowledge. We cap...
In this paper we describe a new approach for learning dialog act processing. In this approach we int...
This paper reports an ongoing effort to derive linear discourse structures from a corpus of telephon...
This paper addresses zero-shot slot filling, which tries to build a system that can generalize to un...
Robust spoken language understanding in large-scale conversational dialog applications is usually pe...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
Understanding user utterances in human-computer spoken dialogue systems involves a multi-level pragm...
Slot filling is a core operation for utterance understanding in task-oriented dialogue systems. Slot...
Large-scale pre-trained language models have contributed significantly to natural language processin...
As the labeling cost for different modules in task-oriented dialog (ToD) systems is high, a major ch...
Training statistical dialog models in spoken dialog systems (SDS) re-quires large amounts of annotat...
A slot value might be provided segment by segment over multiple-turn interactions in a dialog, espec...
Abstract Conversational modeling is an important task in natural language understanding and machine ...
In this paper, we study the task of selecting the optimal response given a user and system utterance...
Pretraining deep neural networks to perform language modeling - that is, to reconstruct missing word...
Finding threads in textual dialogs is emerging as a need to better organize stored knowledge. We cap...
In this paper we describe a new approach for learning dialog act processing. In this approach we int...
This paper reports an ongoing effort to derive linear discourse structures from a corpus of telephon...
This paper addresses zero-shot slot filling, which tries to build a system that can generalize to un...
Robust spoken language understanding in large-scale conversational dialog applications is usually pe...
This article describes a methodology for collecting text from the Web to match a target sublanguage ...
Understanding user utterances in human-computer spoken dialogue systems involves a multi-level pragm...