International audienceGenerally speaking, the evaluation of spoken dialog systems is based on objective metrics that are supposed to offer a complete survey of the system's behaviour. Unfortunately, this approach boils down to a measurement of the overall performances of the system. Despite its indisputable interest, this quantitative approach lacks some predictive power to enable a really informative evaluation. On the contrary, we propose a complementary methodology that intends to respect this criterion of predictability. Inspired by some NLP works (Rolbert and Sabatier 1996), it is based on the definition of DQR tests which are specific to each linguistic phenomenon, what warrants the qualitative and predictive nature of the evaluation....
Rating scale design and development for testing speaking is generally conducted using one of two app...
International audienceThe aim of the MEDIA-EVALDA project is to evaluate the understanding capabilit...
International audienceThe aim of the MEDIA-EVALDA project is to evaluate the understanding capabilit...
International audienceGenerally speaking, the evaluation of spoken dialog systems is based on object...
The DCR methodology is a framework that proposes a generic and detailed evaluation of spoken dialog ...
The DCR methodology is a framework that proposes a generic and detailed evaluation of spoken dialog ...
This paper presents a new paradigm of “challenge ” evaluation of Spoken Language Understanding. This...
International audienceEvaluating human-machine dialogue systems is not so efficient, objective, and ...
International audienceEvaluating human-machine dialogue systems is not so efficient, objective, and ...
We present a generic template for spoken dialogue systems integrating speech recognition and synthes...
As spoken language dialogue systems (SLDSs) proliferate in the market place, the issue of SLDS evalu...
The goal of formative evaluation is to provide information during development about where a given sy...
This paper suggests a model and methodology for measuring the breadth and flexibility of a dialog sy...
Abstract. This paper explores an existing spoken language system evaluation method and presents an a...
Rating scale design and development for testing speaking is generally conducted using one of two app...
Rating scale design and development for testing speaking is generally conducted using one of two app...
International audienceThe aim of the MEDIA-EVALDA project is to evaluate the understanding capabilit...
International audienceThe aim of the MEDIA-EVALDA project is to evaluate the understanding capabilit...
International audienceGenerally speaking, the evaluation of spoken dialog systems is based on object...
The DCR methodology is a framework that proposes a generic and detailed evaluation of spoken dialog ...
The DCR methodology is a framework that proposes a generic and detailed evaluation of spoken dialog ...
This paper presents a new paradigm of “challenge ” evaluation of Spoken Language Understanding. This...
International audienceEvaluating human-machine dialogue systems is not so efficient, objective, and ...
International audienceEvaluating human-machine dialogue systems is not so efficient, objective, and ...
We present a generic template for spoken dialogue systems integrating speech recognition and synthes...
As spoken language dialogue systems (SLDSs) proliferate in the market place, the issue of SLDS evalu...
The goal of formative evaluation is to provide information during development about where a given sy...
This paper suggests a model and methodology for measuring the breadth and flexibility of a dialog sy...
Abstract. This paper explores an existing spoken language system evaluation method and presents an a...
Rating scale design and development for testing speaking is generally conducted using one of two app...
Rating scale design and development for testing speaking is generally conducted using one of two app...
International audienceThe aim of the MEDIA-EVALDA project is to evaluate the understanding capabilit...
International audienceThe aim of the MEDIA-EVALDA project is to evaluate the understanding capabilit...