In this paper, we investigate whether a dataset derived from a multi-purpose corpus such as the Spoken Dutch Corpus may be considered appropriate for developing a taxonomy of wh-questions, and a model of the way in which these questions are integrated in spoken discourse. We compare the results obtained from the Spoken Dutch Corpus with a similar analysis of a large random collection of FAQs from the internet. We find substantial differences between the questions in spoken discourse and FAQs. Therefore, it may not be trivial to use a general purpose corpus as a starting point for developing models for human-computer interaction
The Spoken Dutch Corpus (CGN) is a major new resource for research into contemporary spoken Dutch. A...
Spoken language corpora— as used in conversation analytic research, language acquisition studies and...
Speech perception and spoken word recognition are not only affected by what is being said, but also ...
Contains fulltext : 61291.pdf (publisher's version ) (Open Access)26 mei 20044 p
Contains fulltext : 61290.pdf (publisher's version ) (Closed access)[Lisbon, Portu...
The Spoken Dutch Corpus that is currently under construction will constitute a 10-million-word corpu...
The Spoken Dutch Corpus project (1998-2003) is aimed at the development of a corpus of 1,000 hours o...
This chapter discusses the main principles that should be taken into account when choosing an existi...
Based on an analysis of 350 questions and their responses in a corpus of ordinary interactions, this...
We present a lexicon of Dutch Discourse Connectives (DisCoDict). Its content was obtained using a tw...
We present a lexicon of Dutch Discourse Connectives (DisCoDict). Its content was obtained using a tw...
This study set out to investigate how accent placement is pragmatically governed in WH-questions. Ce...
This study set out to investigate how accent placement is pragmatically governed in WH-questions. Ce...
The paper discusses the syntactic annotation for the Spoken Dutch Corpus, a Dutch/Flemish cooperatio...
We have compiled a corpus of 80 Dutch texts from expository and persuasive genres, which we annotate...
The Spoken Dutch Corpus (CGN) is a major new resource for research into contemporary spoken Dutch. A...
Spoken language corpora— as used in conversation analytic research, language acquisition studies and...
Speech perception and spoken word recognition are not only affected by what is being said, but also ...
Contains fulltext : 61291.pdf (publisher's version ) (Open Access)26 mei 20044 p
Contains fulltext : 61290.pdf (publisher's version ) (Closed access)[Lisbon, Portu...
The Spoken Dutch Corpus that is currently under construction will constitute a 10-million-word corpu...
The Spoken Dutch Corpus project (1998-2003) is aimed at the development of a corpus of 1,000 hours o...
This chapter discusses the main principles that should be taken into account when choosing an existi...
Based on an analysis of 350 questions and their responses in a corpus of ordinary interactions, this...
We present a lexicon of Dutch Discourse Connectives (DisCoDict). Its content was obtained using a tw...
We present a lexicon of Dutch Discourse Connectives (DisCoDict). Its content was obtained using a tw...
This study set out to investigate how accent placement is pragmatically governed in WH-questions. Ce...
This study set out to investigate how accent placement is pragmatically governed in WH-questions. Ce...
The paper discusses the syntactic annotation for the Spoken Dutch Corpus, a Dutch/Flemish cooperatio...
We have compiled a corpus of 80 Dutch texts from expository and persuasive genres, which we annotate...
The Spoken Dutch Corpus (CGN) is a major new resource for research into contemporary spoken Dutch. A...
Spoken language corpora— as used in conversation analytic research, language acquisition studies and...
Speech perception and spoken word recognition are not only affected by what is being said, but also ...