There is a growing interest in building language technologies (LTs) for low resource languages (LRLs). However, there are flaws in the planning, data collection and development phases mostly due to the assumption that LRLs are similar to High Resource Languages (HRLs) but only smaller in size. In our paper, we first provide examples of failed LTs for LRLs and provide the reasons for these failures. Second, we discuss the problematic issues with the data for LRLs. Finally, we provide recommendations for building better LTs for LRLs through insights from sociolinguistics and multilingualism. Our goal is not to solve all problems around LTs for LRLs but to raise awareness about the existing issues, provide recommendations toward possible solut...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
International audienceEnormous progress in speech technologies has been achieved over the last twode...
LTs (language technologies) are necessary instruments for all languages, especially for those aiming...
There is a growing interest in building language technologies (LTs) for low resource languages (LRLs...
This paper describes a local effort to bridge the gap between computational and documentary linguist...
Application domains such as digital humanities and tool like chatbots involve some form of processin...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
And Language Technology for MInority Languages. Minority or “lesser used ” languages of the world ar...
Low density languages are typically viewed as those for which few language resources are available. ...
LTs are a necessary instrument for all languages, especially for those aiming at conquering a space ...
This paper discusses the role of low-resource languages in NLP through the lens of different stakeho...
International audienceThe paper presents a method for parsing low-resource languages with very small...
We provide a systematic review of past studies that use multilingual data for text-to-speech (TTS) o...
The coronavirus (COVID-19) pandemic has dramatically changed lifestyles in much of the world. It for...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
International audienceEnormous progress in speech technologies has been achieved over the last twode...
LTs (language technologies) are necessary instruments for all languages, especially for those aiming...
There is a growing interest in building language technologies (LTs) for low resource languages (LRLs...
This paper describes a local effort to bridge the gap between computational and documentary linguist...
Application domains such as digital humanities and tool like chatbots involve some form of processin...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
Advances in statistical machine learning encourage language-independent approaches to linguistic tec...
And Language Technology for MInority Languages. Minority or “lesser used ” languages of the world ar...
Low density languages are typically viewed as those for which few language resources are available. ...
LTs are a necessary instrument for all languages, especially for those aiming at conquering a space ...
This paper discusses the role of low-resource languages in NLP through the lens of different stakeho...
International audienceThe paper presents a method for parsing low-resource languages with very small...
We provide a systematic review of past studies that use multilingual data for text-to-speech (TTS) o...
The coronavirus (COVID-19) pandemic has dramatically changed lifestyles in much of the world. It for...
© Dr Long DuongNatural language processing (NLP) aims, broadly speaking, to teach computers to under...
International audienceEnormous progress in speech technologies has been achieved over the last twode...
LTs (language technologies) are necessary instruments for all languages, especially for those aiming...