In this paper we report on our recent efforts to collect a corpus of spoken lecture material that will enable research directed towards fast, accurate, and easy access to lecture content. Thus far, we have collected a corpus of 270 hours of speech from a variety of undergraduate courses and seminars. We report on an initial analysis of the spontaneous speech phenomena present in these data and the vocabulary usage patterns across three courses. Finally, we examine language model perplexities trained from written and spoken materials, and describe an initial recognition experiment on one course
The aim of this paper is to introduce and describe the EmiBO corpus and present some initial data. E...
In this paper, we describe the RWTH speech recognition sys-tem for English lectures developed within...
Lecture listening and note-taking classes are a common component of EAP programmes and the list of l...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
This paper describes the corpus of university lectures that has been recorded in European Portuguese...
With the demand for recorded lectures to be made available as soon as possible, the University of Ca...
This paper explains our developing Corpus of Japanese classroom Lecture speech Contents (henceforth,...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Increasing student and lecturer mobility along with the spread of English as an academic lingua fran...
The lecture is one of the most valuable genres of audiovisual data. However, spoken lectures are dif...
Making and distributing audio recordings of lectures is cheap and technically straightforward, and...
We will demonstrate the MIT Spoken Lecture Processing Server and an accompanying lecture browser tha...
The paper introduces our project on automatic speech recognition (ASR) of Chinese lectures. For a co...
Speech recognition and language analysis of spontaneous speech arising in naturally spoken conversat...
The aim of this paper is to introduce and describe the EmiBO corpus and present some initial data. E...
In this paper, we describe the RWTH speech recognition sys-tem for English lectures developed within...
Lecture listening and note-taking classes are a common component of EAP programmes and the list of l...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
This paper describes the corpus of university lectures that has been recorded in European Portuguese...
With the demand for recorded lectures to be made available as soon as possible, the University of Ca...
This paper explains our developing Corpus of Japanese classroom Lecture speech Contents (henceforth,...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Increasing student and lecturer mobility along with the spread of English as an academic lingua fran...
The lecture is one of the most valuable genres of audiovisual data. However, spoken lectures are dif...
Making and distributing audio recordings of lectures is cheap and technically straightforward, and...
We will demonstrate the MIT Spoken Lecture Processing Server and an accompanying lecture browser tha...
The paper introduces our project on automatic speech recognition (ASR) of Chinese lectures. For a co...
Speech recognition and language analysis of spontaneous speech arising in naturally spoken conversat...
The aim of this paper is to introduce and describe the EmiBO corpus and present some initial data. E...
In this paper, we describe the RWTH speech recognition sys-tem for English lectures developed within...
Lecture listening and note-taking classes are a common component of EAP programmes and the list of l...