Despite the modern boom in technology, we are still faced with the fact that people write texts without diacritics. There are two main reasons for this. The first, historical reason stems from the past when the use of diacritics was troublesome and people would write text without them. The second one is the speed - typing without diacritics is usually faster. Text without diacritics is easy to understand for people, but for some types of documents, missing diacritics can cause a problem. This is also an issue when computers process such text. In this paper, we propose an algorithm based on word n-grams (a contiguous sequence of n words) that can restore diacritics of text written in the Slovak language. We also compare and evaluate our resu...
Diacritics and punctuation, as well as text structure, may sound a problem with little interest in l...
Abstract. The orthography of many resource-scarce languages includes diacritically marked characters...
The aim of this thesis is to explore the possibilities of using n-gram language models for spellchec...
The purpose of this paper is (1) to make an extensive overview of the field of diacritics restoratio...
Abstract. This paper presents a method for diacritics restoration based on learning mechanisms that ...
In this paper, we describe a method based on statistical machine translation (SMT) that is able to r...
Statistical language models are utilized in many speech processing algorithms, e.g., automatic speec...
Online ISSN: 2335-884X. http://itc.ktu.lt/index.php/ITC/article/view/18066In this research we compar...
The orthography of many resource-scarce languages includes diacritically marked characters. Falling ...
The goal of this thesis is to develop a Java MIDP application for automatic reconstruction of the di...
Diacritics restoration became a ubiquitous task in the Latinalphabet-based English-dominated Interne...
Corpus of texts in 12 languages. For each language, we provide one training, one development and one...
The subject of this thesis is the implementation of an application that lls in accents to a Czech te...
In this paper, the problem of missing diacritic marks in most of Arabic written resources is investi...
Processing simple or complex texts (MIME type - application) often requires automatic recognition of...
Diacritics and punctuation, as well as text structure, may sound a problem with little interest in l...
Abstract. The orthography of many resource-scarce languages includes diacritically marked characters...
The aim of this thesis is to explore the possibilities of using n-gram language models for spellchec...
The purpose of this paper is (1) to make an extensive overview of the field of diacritics restoratio...
Abstract. This paper presents a method for diacritics restoration based on learning mechanisms that ...
In this paper, we describe a method based on statistical machine translation (SMT) that is able to r...
Statistical language models are utilized in many speech processing algorithms, e.g., automatic speec...
Online ISSN: 2335-884X. http://itc.ktu.lt/index.php/ITC/article/view/18066In this research we compar...
The orthography of many resource-scarce languages includes diacritically marked characters. Falling ...
The goal of this thesis is to develop a Java MIDP application for automatic reconstruction of the di...
Diacritics restoration became a ubiquitous task in the Latinalphabet-based English-dominated Interne...
Corpus of texts in 12 languages. For each language, we provide one training, one development and one...
The subject of this thesis is the implementation of an application that lls in accents to a Czech te...
In this paper, the problem of missing diacritic marks in most of Arabic written resources is investi...
Processing simple or complex texts (MIME type - application) often requires automatic recognition of...
Diacritics and punctuation, as well as text structure, may sound a problem with little interest in l...
Abstract. The orthography of many resource-scarce languages includes diacritically marked characters...
The aim of this thesis is to explore the possibilities of using n-gram language models for spellchec...