Abstract—We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Our devel-opment has focused on the transcription of lectures in Afrikaans from two faculties at North-West University. A bootstrapping procedure is followed to filter and select well-aligned segments of speech. These segments are then used to train acoustic models. Initial work towards language modeling for LT in a resource-scarce environment is also presented; manual lecture transcriptions are combined with text mined from other sources such as study guides to train language models. Interpolation results indicate that study guides are a useful resource for language modeling, whereas general text (obtained from a pub-lisher of Afrikaa...
AbstractThe official languages of South Africa can still be classified as under-resourced with respe...
Spoken dialog systems are slowly becoming and integral part of the human experience due to their var...
Generating accurate word-level transcripts of recorded speech for language documentation is difficul...
We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Ou...
We present a study where standard semi-supervised training methods are applied in a resource-scarce ...
MSc (Computer Science), North-West University, Vaal Triangle Campus, 2014Classroom note taking is a ...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Automatic transcription of lectures is becoming an important task. Possible applications can be foun...
Recording university lectures through lecture capture systems is increasingly common. However, a sin...
Recording university lectures through lecture capture systems is increasingly common, generating lar...
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania ...
Contains fulltext : 27415.pdf (publisher's version ) (Open Access)Each time a word...
South Africa has eleven official languages, ten of which are considered “resource-scarce”. For these...
AbstractThe official languages of South Africa can still be classified as under-resourced with respe...
Spoken dialog systems are slowly becoming and integral part of the human experience due to their var...
Generating accurate word-level transcripts of recorded speech for language documentation is difficul...
We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Ou...
We present a study where standard semi-supervised training methods are applied in a resource-scarce ...
MSc (Computer Science), North-West University, Vaal Triangle Campus, 2014Classroom note taking is a ...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work...
Automatic transcription of lectures is becoming an important task. Possible applications can be foun...
Recording university lectures through lecture capture systems is increasingly common. However, a sin...
Recording university lectures through lecture capture systems is increasingly common, generating lar...
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania ...
Contains fulltext : 27415.pdf (publisher's version ) (Open Access)Each time a word...
South Africa has eleven official languages, ten of which are considered “resource-scarce”. For these...
AbstractThe official languages of South Africa can still be classified as under-resourced with respe...
Spoken dialog systems are slowly becoming and integral part of the human experience due to their var...
Generating accurate word-level transcripts of recorded speech for language documentation is difficul...