We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Our development has focused on the transcription of lectures in Afrikaans from two faculties at North-West University. A bootstrapping procedure is followed to filter and select well-aligned segments of speech. These segments are then used to train acoustic models. Initial work towards language modeling for LT in a resource-scarce environment is also presented; manual lecture transcriptions are combined with text mined from other sources such as study guides to train language models. Interpolation results indicate that study guides are a useful resource for language modeling, whereas general text (obtained from a publisher of Afrikaans books) i...
Generating accurate word-level transcripts of recorded speech for language documentation is difficul...
South Africa has eleven official languages, ten of which are considered “resource-scarce”. For these...
International audienceWe propose a novel transcription workflow which combines spoken term detection...
Abstract—We present progress towards automated Lecture Transcription (LT) in resource scarce environ...
We present a study where standard semi-supervised training methods are applied in a resource-scarce ...
MSc (Computer Science), North-West University, Vaal Triangle Campus, 2014Classroom note taking is a ...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Recording university lectures through lecture capture systems is increasingly common. However, a sin...
Automatic transcription of lectures is becoming an important task. Possible applications can be foun...
Recording university lectures through lecture capture systems is increasingly common, generating lar...
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania ...
Contains fulltext : 27415.pdf (publisher's version ) (Open Access)Each time a word...
Webcasts are an emerging technology enabled by the expanding availability and capacity of the World ...
Generating accurate word-level transcripts of recorded speech for language documentation is difficul...
South Africa has eleven official languages, ten of which are considered “resource-scarce”. For these...
International audienceWe propose a novel transcription workflow which combines spoken term detection...
Abstract—We present progress towards automated Lecture Transcription (LT) in resource scarce environ...
We present a study where standard semi-supervised training methods are applied in a resource-scarce ...
MSc (Computer Science), North-West University, Vaal Triangle Campus, 2014Classroom note taking is a ...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Recording university lectures through lecture capture systems is increasingly common. However, a sin...
Automatic transcription of lectures is becoming an important task. Possible applications can be foun...
Recording university lectures through lecture capture systems is increasingly common, generating lar...
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania ...
Contains fulltext : 27415.pdf (publisher's version ) (Open Access)Each time a word...
Webcasts are an emerging technology enabled by the expanding availability and capacity of the World ...
Generating accurate word-level transcripts of recorded speech for language documentation is difficul...
South Africa has eleven official languages, ten of which are considered “resource-scarce”. For these...
International audienceWe propose a novel transcription workflow which combines spoken term detection...