Part 7: Deep Learning - Convolutional ANNInternational audienceThis work aims at data preparation for OCR systems based on recurrent neural networks. Precisely annotated data are necessary for training a network as well as for evaluation of OCR methods. It is possible to synthesize the data, however such data are not that realistic as the real ones. Manual annotation is thus still needed in many cases, especially in the case of historical documents we are focusing on. Although there are several complex systems for historical document processing, to the best of our knowledge, a simple annotation tool for OCR data is completely missing. Therefore, we propose and implement a set of tools utilizing artificial intelligence that simplify the anno...
Historical documents are a valuable source of cultural knowledge and can provide information about p...
<p>This poster was presented at the National eScience Symposium and describes the research project t...
The creation of a high-quality optical character recognition system (OCR) requires a large amount of...
Part 7: Deep Learning - Convolutional ANNInternational audienceThis paper presents an overview of tr...
This work deals with the creation of a system that allows uploading and annotating scans of historic...
The aim of this Master's thesis is to design methods of active learning and to experiment with datas...
The aim of this Master's thesis is to design and implement an OCR system for archival historical doc...
The goal of the thesis project is the development of a solution allowing the extraction of informati...
Preserving historical archival heritage involves not only physical measures to safeguard these valua...
We present an efficient and effective approach to train OCR engines using the Aletheia document anal...
This paper presents an overview of training strategies for optical character recognition of historic...
Automatic analysis of scanned historical documents comprises a wide range of image analysis tasks, w...
Abstract. The need for accessing information through the web and other kind of distributed media mak...
In spite of the improvement of Commercial Optical Character Recognition (OCR) during the last years,...
The aim of this work is to create a system for historical documents classification . The task is spe...
Historical documents are a valuable source of cultural knowledge and can provide information about p...
<p>This poster was presented at the National eScience Symposium and describes the research project t...
The creation of a high-quality optical character recognition system (OCR) requires a large amount of...
Part 7: Deep Learning - Convolutional ANNInternational audienceThis paper presents an overview of tr...
This work deals with the creation of a system that allows uploading and annotating scans of historic...
The aim of this Master's thesis is to design methods of active learning and to experiment with datas...
The aim of this Master's thesis is to design and implement an OCR system for archival historical doc...
The goal of the thesis project is the development of a solution allowing the extraction of informati...
Preserving historical archival heritage involves not only physical measures to safeguard these valua...
We present an efficient and effective approach to train OCR engines using the Aletheia document anal...
This paper presents an overview of training strategies for optical character recognition of historic...
Automatic analysis of scanned historical documents comprises a wide range of image analysis tasks, w...
Abstract. The need for accessing information through the web and other kind of distributed media mak...
In spite of the improvement of Commercial Optical Character Recognition (OCR) during the last years,...
The aim of this work is to create a system for historical documents classification . The task is spe...
Historical documents are a valuable source of cultural knowledge and can provide information about p...
<p>This poster was presented at the National eScience Symposium and describes the research project t...
The creation of a high-quality optical character recognition system (OCR) requires a large amount of...