Ground truth manually produced within the scope of the CGPG project (Calfa GREgORI Patrologia Graeca), led by Jean-Marie Auwers (UCLouvain), that aim to OCRize the remaining non-digital versions of the Patrologia Graeca volumes. This dataset compiles annotations from 2021 to 2022, and has been used for the Programming Historian online lesson "Transcription automatisée de graphies non latines" (link to come). The dataset contains a set of 100 images from the Patrologia Graeca, with their corresponding pageXML files. Annotation has been performed with the Calfa Vision platform. Different level of annotations are proposed to overcome two tasks: (task1) detection of text regions in Greek (annotations at the region level only) and (task 2) rec...
The first post-WWII years in Greece were devastating. After a brutal Nazi occupation, the Greek Civi...
In this thesis we work on recognizing the text in the book ``Rerum Frisicarum Historia'' by Ubbo Emm...
Περιέχει τη περίληψηIn this paper we propose a novel OCR system for Greek printed early books, combi...
GT4HistOCR contains ground truth for research in Optical Character Recognition (OCR) technology appl...
While digital libraries based on page images and automat-ically generated text have made possible ma...
Dataset for the paper: "A System for Processing and Recognition of Greek Byzantine and Post-Byzantin...
Dataset for Paper: Text Line Detection and Recognition of Greek Polytonic Documents, P. Kaddas, B. G...
These are supplementary materials for an open dataset of scanned images and OCR texts from 19th and ...
© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
The dataset originates from a Greek handwritten codex that dates from around 1500-1530. This is the ...
Together with critical editions and translations, commentaries are one of the main genres of publica...
Automatic transcription of historical handwritten documents is a challenging research problem, requi...
Gloria Mugelli, Giulia Re, Andrea Taddei & Federico Boschetti describe the 'EphoriaEDU' system, a re...
Ground truth data (png and xml files) for a an OCR model. Will be continually updated. Originally t...
Line and groundtrouth pairs generated from human-edited texts, as well as classifiers generated from...
The first post-WWII years in Greece were devastating. After a brutal Nazi occupation, the Greek Civi...
In this thesis we work on recognizing the text in the book ``Rerum Frisicarum Historia'' by Ubbo Emm...
Περιέχει τη περίληψηIn this paper we propose a novel OCR system for Greek printed early books, combi...
GT4HistOCR contains ground truth for research in Optical Character Recognition (OCR) technology appl...
While digital libraries based on page images and automat-ically generated text have made possible ma...
Dataset for the paper: "A System for Processing and Recognition of Greek Byzantine and Post-Byzantin...
Dataset for Paper: Text Line Detection and Recognition of Greek Polytonic Documents, P. Kaddas, B. G...
These are supplementary materials for an open dataset of scanned images and OCR texts from 19th and ...
© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
The dataset originates from a Greek handwritten codex that dates from around 1500-1530. This is the ...
Together with critical editions and translations, commentaries are one of the main genres of publica...
Automatic transcription of historical handwritten documents is a challenging research problem, requi...
Gloria Mugelli, Giulia Re, Andrea Taddei & Federico Boschetti describe the 'EphoriaEDU' system, a re...
Ground truth data (png and xml files) for a an OCR model. Will be continually updated. Originally t...
Line and groundtrouth pairs generated from human-edited texts, as well as classifiers generated from...
The first post-WWII years in Greece were devastating. After a brutal Nazi occupation, the Greek Civi...
In this thesis we work on recognizing the text in the book ``Rerum Frisicarum Historia'' by Ubbo Emm...
Περιέχει τη περίληψηIn this paper we propose a novel OCR system for Greek printed early books, combi...