National audienceDocument Analysis and Recognition consist in translating their images into an electronic form that can be reusable. The analysis extracts the document layout structure from its image, and the recognition assigns to the layout structure components their logical functions in the document. In this article, we present our work on recognition of a category of documents in which the logical structure is based on typographical tagging such as table of contents. We propose a perceptual approach that extracts these typographical tagging directly from document images. However, the structures of such documents are complex and variable. Their complexity can cause errors in the analysis output, which influence directly the recognition t...
Determining the reading order for layout components extracted from a document image can be a crucial...
Background. In recent years, libraries and archives led importantdigitisation campaigns that opened ...
An innumerable number of documents is being printed, scanned, faxed, photographed every day. These d...
National audienceDocument Analysis and Recognition consist in translating their images into an elect...
The automatic processing of written documents is a very active field in the industry. Indeed, due to...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
Cette thèse s'attache à l'étude de la structuration des documents dits à "typographie riche et récur...
National audienceThe aim of this paper is to give a short view on automatic document analysis and re...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
In this paper we present and discuss a novel approach to modeling logical structures of documents, b...
This thesis focuses on automatic recognition of historical French registers. These documents contain...
International audienceBackground. In recent years, libraries and archives led important digitisation...
An innumerable number of documents is being printed, scanned, faxed, photographed every day. These d...
International audienceNewspapers are documents made of news item and informative articles. They are ...
This paper addresses the problem of layout and logical structure extraction from image documents. Tw...
Determining the reading order for layout components extracted from a document image can be a crucial...
Background. In recent years, libraries and archives led importantdigitisation campaigns that opened ...
An innumerable number of documents is being printed, scanned, faxed, photographed every day. These d...
National audienceDocument Analysis and Recognition consist in translating their images into an elect...
The automatic processing of written documents is a very active field in the industry. Indeed, due to...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
Cette thèse s'attache à l'étude de la structuration des documents dits à "typographie riche et récur...
National audienceThe aim of this paper is to give a short view on automatic document analysis and re...
This work introduces a practical method for performing logical layout analysis on heterogeneous peri...
In this paper we present and discuss a novel approach to modeling logical structures of documents, b...
This thesis focuses on automatic recognition of historical French registers. These documents contain...
International audienceBackground. In recent years, libraries and archives led important digitisation...
An innumerable number of documents is being printed, scanned, faxed, photographed every day. These d...
International audienceNewspapers are documents made of news item and informative articles. They are ...
This paper addresses the problem of layout and logical structure extraction from image documents. Tw...
Determining the reading order for layout components extracted from a document image can be a crucial...
Background. In recent years, libraries and archives led importantdigitisation campaigns that opened ...
An innumerable number of documents is being printed, scanned, faxed, photographed every day. These d...