This paper presents a systematic literature review of image datasets for document image analysis, focusing on historical documents, such as handwritten manuscripts and early prints. Finding appropriate datasets for historical document analysis is a crucial prerequisite to facilitate research using different machine learning algorithms. However, because of the very large variety of the actual data (e.g., scripts, tasks, dates, support systems, and amount of deterioration), the different formats for data and label representation, and the different evaluation processes and benchmarks, finding appropriate datasets is a difficult task. This work fills this gap, presenting a meta-study on existing datasets. After a systematic selection process (a...
Digitization has changed history research. The materials are available, and online archives make it ...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
Recent advances in object detection facilitated by deep learning have led to numerous solutions in a...
This paper presents a systematic literature review of image datasets for document image analysis, fo...
Scanned documents are a rich source of various information that can be processed utilizing a docume...
In this paper, we present a pipeline for image extraction from historical documents using foundation...
Historical documents are a valuable source of cultural knowledge and can provide information about p...
This dataset contains the training and test set used in the ICDAR 2019 Competition on Image Retrieva...
Context. Image searching in historical handwritten documents is a challenging problem in computer vi...
Automatic analysis of scanned historical documents comprises a wide range of image analysis tasks, w...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
In 2022, it is a common place that digital historical newspapers (DHN) have become increasingly avai...
This paper presents a deep learning approach for image retrieval and pattern spotting in digital col...
In recent years, libraries and archives all around the world have increased their efforts to digitiz...
International audienceThe use of different texture-based methods is pervasive in different sub-field...
Digitization has changed history research. The materials are available, and online archives make it ...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
Recent advances in object detection facilitated by deep learning have led to numerous solutions in a...
This paper presents a systematic literature review of image datasets for document image analysis, fo...
Scanned documents are a rich source of various information that can be processed utilizing a docume...
In this paper, we present a pipeline for image extraction from historical documents using foundation...
Historical documents are a valuable source of cultural knowledge and can provide information about p...
This dataset contains the training and test set used in the ICDAR 2019 Competition on Image Retrieva...
Context. Image searching in historical handwritten documents is a challenging problem in computer vi...
Automatic analysis of scanned historical documents comprises a wide range of image analysis tasks, w...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
In 2022, it is a common place that digital historical newspapers (DHN) have become increasingly avai...
This paper presents a deep learning approach for image retrieval and pattern spotting in digital col...
In recent years, libraries and archives all around the world have increased their efforts to digitiz...
International audienceThe use of different texture-based methods is pervasive in different sub-field...
Digitization has changed history research. The materials are available, and online archives make it ...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
Recent advances in object detection facilitated by deep learning have led to numerous solutions in a...