Low resource languages possess a limited number of digitized texts, making it challenging togenerate a satisfactory language audio corpus and information retrieval services. Low resourcelanguages, especially those spoken exclusively in African countries, lack a well-defined andannotated language corpus, making it a big obstacle for experts to provide a comprehensive textprocessing system. In this study, I Found out the best practices for producing and collectingdata for such zero/low resource languages by means of crowd-sourcing. For the purpose of thisstudy, a number of research articles (n=260) were extracted from Google Scholar, MicrosoftAcademic, and science direct. From these articles, only 60 of them, which met the inclusioncriteria' ...
Thesis (M.Ing. (Electrical Engineering))--North-West University, Potchefstroom Campus, 2012.As build...
International audienceWe present a survey covering the state of the art in low-resource machine tran...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
International audienceMost speech and language technologies are trained with massive amounts of spee...
For many of the 700 million illiterate people around the world, speech recognition technology could ...
Language resources are important for those working on computational methods to analyse and study lan...
Application domains such as digital humanities and tool like chatbots involve some form of processin...
The paper demonstrates the feasibility and scalability of participatory research, with a case study ...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...
Languages are fundamental to human communication and serve as a means to express social and cultural...
We describe the integration of several tools to enable the end-to-end development of an Automatic Sp...
Recently there has been interest in the approaches for training speech recognition systems for langu...
PhD (Linguistics and Literary Theory), North-West University, Potchefstroom Campus, 2014The developm...
Most of the world’s languages are under-resourced, and most under-resourced languages lack a writing...
AbstractScarcity of resources in under resourced languages may leave these languages behind in race ...
Thesis (M.Ing. (Electrical Engineering))--North-West University, Potchefstroom Campus, 2012.As build...
International audienceWe present a survey covering the state of the art in low-resource machine tran...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
International audienceMost speech and language technologies are trained with massive amounts of spee...
For many of the 700 million illiterate people around the world, speech recognition technology could ...
Language resources are important for those working on computational methods to analyse and study lan...
Application domains such as digital humanities and tool like chatbots involve some form of processin...
The paper demonstrates the feasibility and scalability of participatory research, with a case study ...
International audienceOral corpora for linguistic inquiry are frequently built based on the content ...
Languages are fundamental to human communication and serve as a means to express social and cultural...
We describe the integration of several tools to enable the end-to-end development of an Automatic Sp...
Recently there has been interest in the approaches for training speech recognition systems for langu...
PhD (Linguistics and Literary Theory), North-West University, Potchefstroom Campus, 2014The developm...
Most of the world’s languages are under-resourced, and most under-resourced languages lack a writing...
AbstractScarcity of resources in under resourced languages may leave these languages behind in race ...
Thesis (M.Ing. (Electrical Engineering))--North-West University, Potchefstroom Campus, 2012.As build...
International audienceWe present a survey covering the state of the art in low-resource machine tran...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...