International audienceBackground. In recent years, libraries and archives led important digitisation campaigns that opened the access to vast collections of historical documents. While such documents are often available as XML ALTO documents, they lack information about their logical structure. In this paper, we address the problem of Logical Layout Analysis applied to historical documents in French. We propose a rulebased method, that we evaluate and compare with two Machine-Learning models, namely RIPPER and Gradient Boosting. Our data set contains French newspapers, periodicals and magazines, published in the first half of the twentieth century in the Franche-Comté Region. Results. Our rule-based system outperforms the two other models i...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
In recent decades, major efforts to digitize historical documents led to the creation of large machi...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
International audienceBackground. In recent years, libraries and archives led important digitisation...
Background. In recent years, libraries and archives led importantdigitisation campaigns that opened ...
Background. In recent years, libraries and archives led important digitisation campaigns that opened...
In recent years, libraries and archives led important digitisation campaigns that opened the access ...
Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training...
Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training...
International audienceNewspapers are documents made of news item and informative articles. They are ...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
The current spread of digital documents raised the need of effective content-based retrieval techni...
International audienceThis work focuses on the layout analysis of historical handwritten registers, ...
This thesis focuses on automatic recognition of historical French registers. These documents contain...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
In recent decades, major efforts to digitize historical documents led to the creation of large machi...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...
International audienceBackground. In recent years, libraries and archives led important digitisation...
Background. In recent years, libraries and archives led importantdigitisation campaigns that opened ...
Background. In recent years, libraries and archives led important digitisation campaigns that opened...
In recent years, libraries and archives led important digitisation campaigns that opened the access ...
Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training...
Dataset for Logical-layout analysis on French Historical Newspapers This is a dataset for training...
International audienceNewspapers are documents made of news item and informative articles. They are ...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
The current spread of digital documents raised the need of effective content-based retrieval techni...
International audienceThis work focuses on the layout analysis of historical handwritten registers, ...
This thesis focuses on automatic recognition of historical French registers. These documents contain...
The massive amounts of digitized historical documents acquired over the last decades naturally lend ...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
In recent decades, major efforts to digitize historical documents led to the creation of large machi...
In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of his...