The purpose of this paper is (1) to make an extensive overview of the field of diacritics restoration in Romanian texts, (2) to present our own experiments and results and to promote the use of the word-based Viterbi algorithm as a better accuracy solution used already in a free web-based TTS implementation, (3) to announce the production of a new, high-quality, high-volume corpus of Romanian texts, twice the size of the Romanian language subset of the JRC-Acquis
Online ISSN: 2335-884X. http://itc.ktu.lt/index.php/ITC/article/view/18066In this research we compar...
This paper presents the project “The first Romanian bilingual dictionaries (17th century). Digitally...
The paper argues in favour of an electronic form of the thesaurus dictionary of the Romanian languag...
Statistical language models are utilized in many speech processing algorithms, e.g., automatic speec...
Despite the modern boom in technology, we are still faced with the fact that people write texts with...
This study addresses the lack of general and domain-specific text resources for Romanian Automatic S...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
This paper presents a first step towards constructing the diachronic Romanian morphology. First, the...
The orthography of many resource-scarce languages includes diacritically marked characters. Falling ...
This work represents a first step in the direction of reconstructing a diachronic morphology for Rom...
Abstract. This paper presents a method for diacritics restoration based on learning mechanisms that ...
In this paper, we describe a method based on statistical machine translation (SMT) that is able to r...
AbstractOn the principle of adapting the communication to the general evolution of the society, the ...
In the process of constructing an academic edition for old and pre-modern texts, although they thoro...
This study investigates the possibility of using statistical machine translation to create domain-sp...
Online ISSN: 2335-884X. http://itc.ktu.lt/index.php/ITC/article/view/18066In this research we compar...
This paper presents the project “The first Romanian bilingual dictionaries (17th century). Digitally...
The paper argues in favour of an electronic form of the thesaurus dictionary of the Romanian languag...
Statistical language models are utilized in many speech processing algorithms, e.g., automatic speec...
Despite the modern boom in technology, we are still faced with the fact that people write texts with...
This study addresses the lack of general and domain-specific text resources for Romanian Automatic S...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
This paper presents a first step towards constructing the diachronic Romanian morphology. First, the...
The orthography of many resource-scarce languages includes diacritically marked characters. Falling ...
This work represents a first step in the direction of reconstructing a diachronic morphology for Rom...
Abstract. This paper presents a method for diacritics restoration based on learning mechanisms that ...
In this paper, we describe a method based on statistical machine translation (SMT) that is able to r...
AbstractOn the principle of adapting the communication to the general evolution of the society, the ...
In the process of constructing an academic edition for old and pre-modern texts, although they thoro...
This study investigates the possibility of using statistical machine translation to create domain-sp...
Online ISSN: 2335-884X. http://itc.ktu.lt/index.php/ITC/article/view/18066In this research we compar...
This paper presents the project “The first Romanian bilingual dictionaries (17th century). Digitally...
The paper argues in favour of an electronic form of the thesaurus dictionary of the Romanian languag...