Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training and testing logical-layout analysis and recognition system on French historical documents published between 1900 and 1950. The original data is part of the "Fond régional: Franche Comté", which is curated by Gallica, the digital portal of the Bibliothèque nationale de France (BnF). This dataset is divided into a train and a test set. The train and test datasets have been designed to cover as much as possible the various possible layouts that exist in the "Fond régional: Franche Comté" dataset. To do so, we have divided them into three layout-types: * 1c: documents where the text is displayed in one column, as in books; * 2c: documents whe...
This dataset contains 50 pages of ground truth data for digitized historical newspapers from the Ber...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
This release contains the public domain portion of the dataset used in the paper Page Layout Analysi...
Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training...
Dataset for Logical-layout analysis on French historical newspapers This dataset is intended for tr...
In recent years, libraries and archives led important digitisation campaigns that opened the access ...
International audienceBackground. In recent years, libraries and archives led important digitisation...
Background. In recent years, libraries and archives led importantdigitisation campaigns that opened ...
Background. In recent years, libraries and archives led important digitisation campaigns that opened...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
This paper presents a research dataset of historical newspapers comprising over 500 page images, uni...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
International audienceNewspapers are documents made of news item and informative articles. They are ...
The dataset comprises French newspaper pages from 19th and early 20th century with annotated text. T...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
This dataset contains 50 pages of ground truth data for digitized historical newspapers from the Ber...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
This release contains the public domain portion of the dataset used in the paper Page Layout Analysi...
Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training...
Dataset for Logical-layout analysis on French historical newspapers This dataset is intended for tr...
In recent years, libraries and archives led important digitisation campaigns that opened the access ...
International audienceBackground. In recent years, libraries and archives led important digitisation...
Background. In recent years, libraries and archives led importantdigitisation campaigns that opened ...
Background. In recent years, libraries and archives led important digitisation campaigns that opened...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
This paper presents a research dataset of historical newspapers comprising over 500 page images, uni...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
International audienceNewspapers are documents made of news item and informative articles. They are ...
The dataset comprises French newspaper pages from 19th and early 20th century with annotated text. T...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
This dataset contains 50 pages of ground truth data for digitized historical newspapers from the Ber...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
This release contains the public domain portion of the dataset used in the paper Page Layout Analysi...