AbstractThe official languages of South Africa can still be classified as under-resourced with respect to the speech resources that are required for technology development. Harvesting speech data from existing sources is one means to create additional resources. The aim of the study reported on in this paper was to improve the harvesting and transcription accuracy of a corpus derived from parliamentary data. This aim was achieved by improving on the text normalisation process and pronunciation modelling as well as by iteratively training more accurate in-domain acoustic models. In this manner, more data could be harvested with higher confidence than using baseline pronunciation dictionaries and out-of-domain speech data
We investigate the impact of recent advances in speech recognition techniques for under-resourced l...
Abstract—We present progress towards automated Lecture Transcription (LT) in resource scarce environ...
We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Ou...
South Africa has eleven official languages, ten of which are considered “resource-scarce”. For these...
Thesis (M. Ing. (Computer and Electronical Engineering))--North-West University, Potchefstroom Campu...
This work was supported by the Department of Arts and Culture.The NCHLT speech corpus contains wide-...
Abstract—Smartphones provide an efficient means for the collection of speech data; however, the qual...
In this contribution, the design, collection, annotation and planned distribution of a new spoken la...
The accent with which words are spoken can have a strong effect on the performance of a speech recog...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
Thesis (M.Ing. (Electrical Engineering))--North-West University, Potchefstroom Campus, 2012.As build...
Thesis (M.Ing. (Computer Engineering))--North-West University, Potchefstroom Campus, 2009.The pronun...
We present the design and development of a South African directory enquiries corpus. It contains aud...
This paper reflects on the recently completed African Speech Technology (AST) Project. The AST Proje...
This paper describes past, ongoing and planned work on the collection and transcription of spoken la...
We investigate the impact of recent advances in speech recognition techniques for under-resourced l...
Abstract—We present progress towards automated Lecture Transcription (LT) in resource scarce environ...
We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Ou...
South Africa has eleven official languages, ten of which are considered “resource-scarce”. For these...
Thesis (M. Ing. (Computer and Electronical Engineering))--North-West University, Potchefstroom Campu...
This work was supported by the Department of Arts and Culture.The NCHLT speech corpus contains wide-...
Abstract—Smartphones provide an efficient means for the collection of speech data; however, the qual...
In this contribution, the design, collection, annotation and planned distribution of a new spoken la...
The accent with which words are spoken can have a strong effect on the performance of a speech recog...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
Thesis (M.Ing. (Electrical Engineering))--North-West University, Potchefstroom Campus, 2012.As build...
Thesis (M.Ing. (Computer Engineering))--North-West University, Potchefstroom Campus, 2009.The pronun...
We present the design and development of a South African directory enquiries corpus. It contains aud...
This paper reflects on the recently completed African Speech Technology (AST) Project. The AST Proje...
This paper describes past, ongoing and planned work on the collection and transcription of spoken la...
We investigate the impact of recent advances in speech recognition techniques for under-resourced l...
Abstract—We present progress towards automated Lecture Transcription (LT) in resource scarce environ...
We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Ou...