Automatic transcription of historical handwritten documents is a challenging research problem, requiring in general expensive transcriptions from expert paleographers. In Codice Ratio is designed to be an end-to-end architecture requiring instead limited labeling effort, whose aim is the automatic transcription of a portion of the Vatican Secret Archives (one of the largest historical libraries in the world). In this paper, we describe in particular the design of our OCR component for Latin characters. To this end, we first annotated a large corpus of Latin characters with a custom crowdsourcing platform. Leveraging over recent progresses in deep learning, we designed and trained a deep convolutional network achieving an overall accuracy of...
Historians who aspire to explore texts with the help of the computer are more than often confronted ...
Historical manuscripts are the main source of information about past. In recent years, digitization ...
Preserving historical archival heritage involves not only physical measures to safeguard these valua...
Automatic transcription of historical handwritten documents is a challenging research problem, requi...
In Codice Ratio is a research project to study techniques for analyzing the contents of historical d...
In Codice Ratio is a research project to study tools and techniques for analyzing the contents of hi...
Huge amounts of handwritten historical documents are being published by digital libraries world wide...
Our project, In Codice Ratio, is an interdisciplinary research initiative for analyzing content of ...
While digital libraries based on page images and automat-ically generated text have made possible ma...
Handwritten materials are increasingly being digitized and made available for scholarly analysis and...
Crowdsourcing approaches for post-correction of OCR output (Optical Character Recognition) have been...
This article presents the results of a series of experiments with open-source neural network OCR sof...
Together with critical editions and translations, commentaries are one of the main genres of publica...
Automatic transcription of handwritten texts has made important progress in the recent years. This i...
GT4HistOCR contains ground truth for research in Optical Character Recognition (OCR) technology appl...
Historians who aspire to explore texts with the help of the computer are more than often confronted ...
Historical manuscripts are the main source of information about past. In recent years, digitization ...
Preserving historical archival heritage involves not only physical measures to safeguard these valua...
Automatic transcription of historical handwritten documents is a challenging research problem, requi...
In Codice Ratio is a research project to study techniques for analyzing the contents of historical d...
In Codice Ratio is a research project to study tools and techniques for analyzing the contents of hi...
Huge amounts of handwritten historical documents are being published by digital libraries world wide...
Our project, In Codice Ratio, is an interdisciplinary research initiative for analyzing content of ...
While digital libraries based on page images and automat-ically generated text have made possible ma...
Handwritten materials are increasingly being digitized and made available for scholarly analysis and...
Crowdsourcing approaches for post-correction of OCR output (Optical Character Recognition) have been...
This article presents the results of a series of experiments with open-source neural network OCR sof...
Together with critical editions and translations, commentaries are one of the main genres of publica...
Automatic transcription of handwritten texts has made important progress in the recent years. This i...
GT4HistOCR contains ground truth for research in Optical Character Recognition (OCR) technology appl...
Historians who aspire to explore texts with the help of the computer are more than often confronted ...
Historical manuscripts are the main source of information about past. In recent years, digitization ...
Preserving historical archival heritage involves not only physical measures to safeguard these valua...