Across the world's languages and cultures, most writing systems predate the use of computers. In the early years of ICT, standards and protocols for encoding and rendering the majority of the world's writing systems were not in place. The opportunity to deploy less-commonly used orthographies in cross-platform digital contexts has steadily increased since Unicode became the most widely used encoding on the web in late 2007 (Davis, 2008). But what happens to resources that were developed before Unicode standards became widespread? While many tools have been created to address this problem and other issues related to transliteration and character level substitutions, 1 this paper describes the process undertaken for the Indigenous and endange...
Writing technology is a central issue for Human Language Technology (HLT) both in terms of theory an...
Much of the text data that exists in many languages is locked away in nondigitized books and documen...
A core concern for E-MELD is the need for a common standard for the digitalization of linguistic dat...
Across the world's languages and cultures, most writing systems predate the use of computers. In the...
Much electronic text in the languages of South Asia has been published on the Internet. However, whi...
This text is a practical guide for linguists/ and programmers/ who work with data in multilingual co...
This text is a practical guide for linguists, and programmers, who work with data in multilingual co...
This text is a practical guide for linguists, and programmers, who work with data in multilingual co...
The world of character encoding in 2010 has changed significantly since TEI began in 1987, thanks to...
A universal character encoding is required to produce software that can be localized for any languag...
The Unicode Standard is the de facto “universal” standard for character-encoding in nearly all moder...
This paper describes the rule based approach towards the development of an Oriya Font Converter that...
This chapter first briefly reviews the history of character encoding. Following from this is a discu...
For 70+ years SIL International has been working to study, develop and document the world’s lesser-k...
The World Wide Web and your computer can already display text in the scripts of most South Asian lan...
Writing technology is a central issue for Human Language Technology (HLT) both in terms of theory an...
Much of the text data that exists in many languages is locked away in nondigitized books and documen...
A core concern for E-MELD is the need for a common standard for the digitalization of linguistic dat...
Across the world's languages and cultures, most writing systems predate the use of computers. In the...
Much electronic text in the languages of South Asia has been published on the Internet. However, whi...
This text is a practical guide for linguists/ and programmers/ who work with data in multilingual co...
This text is a practical guide for linguists, and programmers, who work with data in multilingual co...
This text is a practical guide for linguists, and programmers, who work with data in multilingual co...
The world of character encoding in 2010 has changed significantly since TEI began in 1987, thanks to...
A universal character encoding is required to produce software that can be localized for any languag...
The Unicode Standard is the de facto “universal” standard for character-encoding in nearly all moder...
This paper describes the rule based approach towards the development of an Oriya Font Converter that...
This chapter first briefly reviews the history of character encoding. Following from this is a discu...
For 70+ years SIL International has been working to study, develop and document the world’s lesser-k...
The World Wide Web and your computer can already display text in the scripts of most South Asian lan...
Writing technology is a central issue for Human Language Technology (HLT) both in terms of theory an...
Much of the text data that exists in many languages is locked away in nondigitized books and documen...
A core concern for E-MELD is the need for a common standard for the digitalization of linguistic dat...