The main idea of this dataset is to analyse the impact of training data. How many training data specific to the document, you are transcribing, is necessary? general data: This is a collection of heterogeneous documents to train an initial system. For each text line there is an image file of that line, a file with the ground truth text and an information file containing an automatically generated surrounding polygon. specific data: The specific data contains documents related to the test data. For the specific systems only the images of the train list may be used. The file are of the same type as the general data. test data: The test data contains only the images and the information files. More Information, some published results and...
This dataset arises from the READ project (Horizon 2020). The dataset consists of a subset of docum...
This dataset contains the training, evaluation, and test set for the ICDAR 2019 Competition on Basel...
Web includes digital libraries and billions of text documents. A fast and simple search through this...
The main idea of this dataset is to analyse the impact of training data. How many training data spec...
Train-A: Dataset of pages with manually revised baselines and the corresponding transcripts associat...
Train-A Dataset of pages with manually revised baselines and the corresponding transcripts associate...
Train-B Dataset. Dataset of pages without any layout or text line information. The corresponding t...
This dataset comprises the dataset used for the ICDAR 2015 Competition on Handwritten Text Recognit...
<p>This system has been trained using the first 40 pages of Train-A (https://zenodo.org/record/43980...
© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
This dataset contains the training and test set for the ICDAR 2017 Competition on Baseline Detection...
<p><strong>Test-B2</strong>: a batch of page images annotated with the geometry of regions where to...
Dataset for Paper: Text Line Detection and Recognition of Greek Polytonic Documents, P. Kaddas, B. G...
This data set contains sentences belonging to either of two classes: Transcripts of spoken (informal...
This dataset arises from the READ project (Horizon 2020). The dataset consists of a subset of docum...
This dataset contains the training, evaluation, and test set for the ICDAR 2019 Competition on Basel...
Web includes digital libraries and billions of text documents. A fast and simple search through this...
The main idea of this dataset is to analyse the impact of training data. How many training data spec...
Train-A: Dataset of pages with manually revised baselines and the corresponding transcripts associat...
Train-A Dataset of pages with manually revised baselines and the corresponding transcripts associate...
Train-B Dataset. Dataset of pages without any layout or text line information. The corresponding t...
This dataset comprises the dataset used for the ICDAR 2015 Competition on Handwritten Text Recognit...
<p>This system has been trained using the first 40 pages of Train-A (https://zenodo.org/record/43980...
© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
This dataset contains the training and test set for the ICDAR 2017 Competition on Baseline Detection...
<p><strong>Test-B2</strong>: a batch of page images annotated with the geometry of regions where to...
Dataset for Paper: Text Line Detection and Recognition of Greek Polytonic Documents, P. Kaddas, B. G...
This data set contains sentences belonging to either of two classes: Transcripts of spoken (informal...
This dataset arises from the READ project (Horizon 2020). The dataset consists of a subset of docum...
This dataset contains the training, evaluation, and test set for the ICDAR 2019 Competition on Basel...
Web includes digital libraries and billions of text documents. A fast and simple search through this...