Background. In recent years, libraries and archives led importantdigitisation campaigns that opened the access to vast collections of historicaldocuments. While such documents are often available as XML ALTO documents, theylack information about their logical structure. In this paper, we address theproblem of Logical Layout Analysis applied to historical documents in French.We propose a rule-based method, that we evaluate and compare with twoMachine-Learning models, namely RIPPER and Gradient Boosting. Our data setcontains French newspapers, periodicals and magazines, published in the firsthalf of the twentieth century in the Franche-Comt\'e Region. Results. Ourrule-based system outperforms the two other models in nearly all evaluations.It ...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
In recent decades, major efforts to digitize historical documents led to the creation of large machi...
International audienceThis article describes the work performed in the Pattern Redundancy Analysis f...
Background. In recent years, libraries and archives led important digitisation campaigns that opened...
International audienceBackground. In recent years, libraries and archives led important digitisation...
In recent years, libraries and archives led important digitisation campaigns that opened the access ...
This dataset is intended for training and testing Logical Layout Analysis and recognition system on ...
Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training...
International audienceNewspapers are documents made of news item and informative articles. They are ...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
The current spread of digital documents raised the need of effective content-based retrieval techni...
This thesis focuses on automatic recognition of historical French registers. These documents contain...
International audienceThis work focuses on the layout analysis of historical handwritten registers, ...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
In recent decades, major efforts to digitize historical documents led to the creation of large machi...
International audienceThis article describes the work performed in the Pattern Redundancy Analysis f...
Background. In recent years, libraries and archives led important digitisation campaigns that opened...
International audienceBackground. In recent years, libraries and archives led important digitisation...
In recent years, libraries and archives led important digitisation campaigns that opened the access ...
This dataset is intended for training and testing Logical Layout Analysis and recognition system on ...
Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training...
International audienceNewspapers are documents made of news item and informative articles. They are ...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
The current spread of digital documents raised the need of effective content-based retrieval techni...
This thesis focuses on automatic recognition of historical French registers. These documents contain...
International audienceThis work focuses on the layout analysis of historical handwritten registers, ...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
In recent decades, major efforts to digitize historical documents led to the creation of large machi...
International audienceThis article describes the work performed in the Pattern Redundancy Analysis f...