This article discusses questions concerning the creation, annotation and sharing of spoken language corpora. We use the Hamburg Map Task Corpus (HAMATAC), a small corpus in which advanced learners of German were recorded solving a map task, as an example to illustrate our main points. We first give an overview of the corpus creation and annotation process including recording, metadata documentation, transcription and semi-automatic annotation of the data. We then discuss the manual annotation of disfluencies as an example case in which many of the typical and challenging problems for data reuse – in particular the reliability of interpretative annotations – are revealed
International audienceAlthough automatic analysis and computer-aided annotation tools are being deve...
International audienceThis presentation discusses the methodological issues involved in finding a sa...
International audienceThis presentation discusses the methodological issues involved in finding a sa...
This paper discusses the technological and methodological challenges in creating and sharing HAMATAC...
Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The sp...
Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus c...
Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus c...
Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus c...
This article provides an overview of methodological and technical issues that arise in the collectio...
This paper is about the workflow for construction and dissemination of FOLK (Forschungs - und Lehrko...
The research project “German Today” aims to determine the amount of regional variation in (near-)sta...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...
This article presents the corpus REDEWIEDERGABE, a German-language historical corpus with detailed a...
International audienceAlthough automatic analysis and computer-aided annotation tools are being deve...
International audienceAlthough automatic analysis and computer-aided annotation tools are being deve...
International audienceThis presentation discusses the methodological issues involved in finding a sa...
International audienceThis presentation discusses the methodological issues involved in finding a sa...
This paper discusses the technological and methodological challenges in creating and sharing HAMATAC...
Audio and two video recordings of map tasks with adult L2 users of German and one L1 speaker. The sp...
Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus c...
Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus c...
Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus c...
This article provides an overview of methodological and technical issues that arise in the collectio...
This paper is about the workflow for construction and dissemination of FOLK (Forschungs - und Lehrko...
The research project “German Today” aims to determine the amount of regional variation in (near-)sta...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...
This article presents the corpus REDEWIEDERGABE, a German-language historical corpus with detailed a...
International audienceAlthough automatic analysis and computer-aided annotation tools are being deve...
International audienceAlthough automatic analysis and computer-aided annotation tools are being deve...
International audienceThis presentation discusses the methodological issues involved in finding a sa...
International audienceThis presentation discusses the methodological issues involved in finding a sa...