Abstract. This paper presents a method for diacritics restoration based on learning mechanisms that act at letter level. This technique is new to our knowledge, and we compare it with the well known techniques for diacritics restoration that learn from words. Our method is particularly useful for languages that lack large electronic dictionaries and where means for generalization beyond words are required. Accuracies of over 99 % at letter level are reported
The purpose of this paper is (1) to make an extensive overview of the field of diacritics restoratio...
Diacritics and punctuation, as well as text structure, may sound a problem with little interest in l...
Igbo is a low-resource African language with orthographic and tonal diacritics, which capture distin...
This paper discusses letter level learning for language independent diacritics restoration
The orthography of many resource-scarce languages includes diacritically marked characters. Falling ...
Corpus of texts in 12 languages. For each language, we provide one training, one development and one...
Statistical language models are utilized in many speech processing algorithms, e.g., automatic speec...
Online ISSN: 2335-884X. http://itc.ktu.lt/index.php/ITC/article/view/18066In this research we compar...
Despite the modern boom in technology, we are still faced with the fact that people write texts with...
Diacritic Restoration is a necessity in the processing of languages with Latinbased scripts that uti...
Abstract. The orthography of many resource-scarce languages includes diacritically marked characters...
Restoring diacritics have for the most part relied either on the letter (grapheme) or the space-deli...
In this paper, we focus on two important problems of social media text normaliza-tion, namely: vowel...
Diacritics restoration became a ubiquitous task in the Latinalphabet-based English-dominated Interne...
When two orthographically similar words are displayed using rapid serial visual presentation (RSVP),...
The purpose of this paper is (1) to make an extensive overview of the field of diacritics restoratio...
Diacritics and punctuation, as well as text structure, may sound a problem with little interest in l...
Igbo is a low-resource African language with orthographic and tonal diacritics, which capture distin...
This paper discusses letter level learning for language independent diacritics restoration
The orthography of many resource-scarce languages includes diacritically marked characters. Falling ...
Corpus of texts in 12 languages. For each language, we provide one training, one development and one...
Statistical language models are utilized in many speech processing algorithms, e.g., automatic speec...
Online ISSN: 2335-884X. http://itc.ktu.lt/index.php/ITC/article/view/18066In this research we compar...
Despite the modern boom in technology, we are still faced with the fact that people write texts with...
Diacritic Restoration is a necessity in the processing of languages with Latinbased scripts that uti...
Abstract. The orthography of many resource-scarce languages includes diacritically marked characters...
Restoring diacritics have for the most part relied either on the letter (grapheme) or the space-deli...
In this paper, we focus on two important problems of social media text normaliza-tion, namely: vowel...
Diacritics restoration became a ubiquitous task in the Latinalphabet-based English-dominated Interne...
When two orthographically similar words are displayed using rapid serial visual presentation (RSVP),...
The purpose of this paper is (1) to make an extensive overview of the field of diacritics restoratio...
Diacritics and punctuation, as well as text structure, may sound a problem with little interest in l...
Igbo is a low-resource African language with orthographic and tonal diacritics, which capture distin...