. This paper deals with the representation of document models used in the field of document recognition. A novel formalism called generalized n-gram is presented, which is shown to be accurate for the recognition task and well adapted to automatic learning by examples. The paper addresses also the thorny problem of integrating models for document analysis with existing standards used for document manipulation and production. 1 Introduction The benefits of using high level descriptions to handle structured documents has been widely recognized by the scientific community dealing with electronic document production. The SGML language providing the DTD mechanism [11] has become increasingly important during the last five years. The document an...
We present a general approach for the hierarchical segmentation and labeling of document layout stru...
The paper considers the issues of building an information retrieval system by using the algorithm of...
We study the problem of automatic generation of a document type definition (DTD) for a set of Standa...
Abstract. This paper deals with the representation of document models used in the field of document ...
In this paper we present and discuss a novel approach to modeling logical structures of documents, b...
Most of the electronic documents available from todays huge number of electronic information sources...
This paper presents a new research theme at our institute in the field of document engineering; it d...
The use of generic model for a document class as the knowledge base in a Document Analysis System fa...
National audienceThe aim of this paper is to give a short view on automatic document analysis and re...
. Successful applications of digital libraries require structured access to sources of information....
Document image analysis is the study of converting documents from paper form to an electronic form t...
International audienceWith the recent development of new information and communication technologies,...
Abstract--Surveys of the basic concepts and underlying techniques are presented in this paper. A bas...
In literature, many feature types and learning algorithms are proposed for document classification. ...
This paper presents U new model based document image segmentation scheme that uses XML-DTDs (extensi...
We present a general approach for the hierarchical segmentation and labeling of document layout stru...
The paper considers the issues of building an information retrieval system by using the algorithm of...
We study the problem of automatic generation of a document type definition (DTD) for a set of Standa...
Abstract. This paper deals with the representation of document models used in the field of document ...
In this paper we present and discuss a novel approach to modeling logical structures of documents, b...
Most of the electronic documents available from todays huge number of electronic information sources...
This paper presents a new research theme at our institute in the field of document engineering; it d...
The use of generic model for a document class as the knowledge base in a Document Analysis System fa...
National audienceThe aim of this paper is to give a short view on automatic document analysis and re...
. Successful applications of digital libraries require structured access to sources of information....
Document image analysis is the study of converting documents from paper form to an electronic form t...
International audienceWith the recent development of new information and communication technologies,...
Abstract--Surveys of the basic concepts and underlying techniques are presented in this paper. A bas...
In literature, many feature types and learning algorithms are proposed for document classification. ...
This paper presents U new model based document image segmentation scheme that uses XML-DTDs (extensi...
We present a general approach for the hierarchical segmentation and labeling of document layout stru...
The paper considers the issues of building an information retrieval system by using the algorithm of...
We study the problem of automatic generation of a document type definition (DTD) for a set of Standa...