This master’s thesis describes the work in creating a customised optical character recognition (OCR) application; intended for use in digitisation of theses submitted to the Uppsala University in the 18th and 19th centuries. For this purpose, an open source software called Gamera has been used for recognition and classification of the characters in the documents. The software provides specific algorithms for analysis of heritage documents and is designed to be used as a tool for creating domain-specific (i.e. customised) recognition applications. By using the Gamera classifier training interface, classifier data was created which reflects the characters in the particular theses. The data can then be used in automatic recognition of ‘new’ ch...
Although OCR (Optical Character Recognition) is a topic which has been a subject of research since t...
Abstract — Optical Character Recognition (OCR) is the mechanical or electronic translation of image...
Attēli mūsdienās bieži tiek izmantoti, lai attēlotu vai pārsūtītu tekstuālu informāciju, bet šo info...
This master’s thesis describes the work in creating a customised optical character recognition (OCR)...
This master’s thesis describes the work in creating a customised optical character recognition (OCR)...
As our world enters an electronic era, it has become important to be able to quickly and easily pres...
This paper presents a new toolkit for the creation of customized structured document recognition app...
This paper describes the Gamera framework for building custom document recognition systems. This ope...
This paper presents a new toolkit for the creation of customized structured document recognition app...
In this thesis we work on recognizing the text in the book ``Rerum Frisicarum Historia'' by Ubbo Emm...
In this paper a complete OCR methodology for recognizing historical documents, either printed or han...
This report documents the algorithm of Optical Character Recognition, to the purpose of detecting an...
In this paper a complete OCR methodology for recognizing historical documents, either printed or han...
Purpose – The authors investigate optical character recognition (OCR) technology and discuss its imp...
As an effort to improve accessibility to historical documents, digitization of historical archives h...
Although OCR (Optical Character Recognition) is a topic which has been a subject of research since t...
Abstract — Optical Character Recognition (OCR) is the mechanical or electronic translation of image...
Attēli mūsdienās bieži tiek izmantoti, lai attēlotu vai pārsūtītu tekstuālu informāciju, bet šo info...
This master’s thesis describes the work in creating a customised optical character recognition (OCR)...
This master’s thesis describes the work in creating a customised optical character recognition (OCR)...
As our world enters an electronic era, it has become important to be able to quickly and easily pres...
This paper presents a new toolkit for the creation of customized structured document recognition app...
This paper describes the Gamera framework for building custom document recognition systems. This ope...
This paper presents a new toolkit for the creation of customized structured document recognition app...
In this thesis we work on recognizing the text in the book ``Rerum Frisicarum Historia'' by Ubbo Emm...
In this paper a complete OCR methodology for recognizing historical documents, either printed or han...
This report documents the algorithm of Optical Character Recognition, to the purpose of detecting an...
In this paper a complete OCR methodology for recognizing historical documents, either printed or han...
Purpose – The authors investigate optical character recognition (OCR) technology and discuss its imp...
As an effort to improve accessibility to historical documents, digitization of historical archives h...
Although OCR (Optical Character Recognition) is a topic which has been a subject of research since t...
Abstract — Optical Character Recognition (OCR) is the mechanical or electronic translation of image...
Attēli mūsdienās bieži tiek izmantoti, lai attēlotu vai pārsūtītu tekstuālu informāciju, bet šo info...