This object contains the dataset and python code used for the paper: S. Brenner and R. Sablatnig. On the Use of Artificially Degraded Manuscripts for Quality Assessment of Readability Enhancement Methods. Accepted for OAGM Workshop 2019, Steyr, Austria. The dataset is a modified subset of the UCL Multispectral Processed Images of Parchment Damage Dataset (10.14324/000.ds.1469099). The accompanying code documents how the modified version was created and how the evaluations described in the paper were performed
Iterating with new and improved OCR solutions enforces decision making when it comes to targeting th...
Is an average OCR quality of 70% enough for my study? What OCR quality should we ask from external s...
This package contains the dataset of the manuscript "An Empirical Study on the Fault-Inducing Effect...
The data presented here is a set of 2,800 multispectral images of an actual parchment, taken before ...
The huge amount of degraded documents stored in libraries and archives around the world needs automa...
This dataset is used in the experiments in the paper "A First Look at Fairness of Automatic Code Rev...
Dataset for the paper: "A System for Processing and Recognition of Greek Byzantine and Post-Byzantin...
This is just a preview containing data for 5 test images (out of 250) for review purposes; the full ...
This repository contains the dataset of the manuscript: "An Empirical Study on the Usage and Availa...
Over the past years, considerable effort has been put into digitising library collections. As part o...
In this paper, a new dataset, called Multi-distortion Historical Document Image Database (MHDID), to...
This dataset was used in the context of the article Ground-truth Free Evaluation of HTR on Old Frenc...
A reproduction package for the paper "Data Quality for Software Vulnerability Datasets"</p
The dataset contains data for testing the Scholexplorer API (https://scholexplorer.openaire.eu/) bas...
In scientific computing and data science, computer programs employing mathematical and statistical m...
Iterating with new and improved OCR solutions enforces decision making when it comes to targeting th...
Is an average OCR quality of 70% enough for my study? What OCR quality should we ask from external s...
This package contains the dataset of the manuscript "An Empirical Study on the Fault-Inducing Effect...
The data presented here is a set of 2,800 multispectral images of an actual parchment, taken before ...
The huge amount of degraded documents stored in libraries and archives around the world needs automa...
This dataset is used in the experiments in the paper "A First Look at Fairness of Automatic Code Rev...
Dataset for the paper: "A System for Processing and Recognition of Greek Byzantine and Post-Byzantin...
This is just a preview containing data for 5 test images (out of 250) for review purposes; the full ...
This repository contains the dataset of the manuscript: "An Empirical Study on the Usage and Availa...
Over the past years, considerable effort has been put into digitising library collections. As part o...
In this paper, a new dataset, called Multi-distortion Historical Document Image Database (MHDID), to...
This dataset was used in the context of the article Ground-truth Free Evaluation of HTR on Old Frenc...
A reproduction package for the paper "Data Quality for Software Vulnerability Datasets"</p
The dataset contains data for testing the Scholexplorer API (https://scholexplorer.openaire.eu/) bas...
In scientific computing and data science, computer programs employing mathematical and statistical m...
Iterating with new and improved OCR solutions enforces decision making when it comes to targeting th...
Is an average OCR quality of 70% enough for my study? What OCR quality should we ask from external s...
This package contains the dataset of the manuscript "An Empirical Study on the Fault-Inducing Effect...