The paper considers the issues of building an information retrieval system by using the algorithm of automated classification and recognition of the structure of fulltext documents. It describes the selected approaches, as well as the algorithm for identifying the document type and the algorithm for recognizing its logical structure, developed on the basis of these approaches, with the aim of further semantic processing. It introduces a multi stage method for automated recognition and formation of a model of the logical structure of a document. Experimental studies of this method have been conducted on the array of reporting documents “Rosfinmonitoring”
Abstract: Technical documentation such as user manual and manufacturing document is now an important...
National audienceThe aim of this paper is to give a short view on automatic document analysis and re...
The automated discovery of logical structure in text documents is an important problem that has rece...
A document preparation system is characterised not only by the features included in its implementati...
International audienceIn information retrieval systems, the indexation task is usually conducted irr...
Documents often display a structure, e.g., several sections, each with several subsections and so on...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
This paper presents a review of some of the more innovative and successful projects in the area of ...
International audienceWith the recent development of new information and communication technologies,...
. This paper deals with the representation of document models used in the field of document recognit...
We present a methodology for document processing that exploits logic-based machine learning techniqu...
In this paper we present and discuss a novel approach to modeling logical structures of documents, b...
International audienceTechnical documentation such as user manual and manufacturing document is now ...
In this paper, a logical-mathematical model of the task of analyzing the composition and structure o...
This paper deals with automatic classification of text documents, showing advantages of the classifi...
Abstract: Technical documentation such as user manual and manufacturing document is now an important...
National audienceThe aim of this paper is to give a short view on automatic document analysis and re...
The automated discovery of logical structure in text documents is an important problem that has rece...
A document preparation system is characterised not only by the features included in its implementati...
International audienceIn information retrieval systems, the indexation task is usually conducted irr...
Documents often display a structure, e.g., several sections, each with several subsections and so on...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
This paper presents a review of some of the more innovative and successful projects in the area of ...
International audienceWith the recent development of new information and communication technologies,...
. This paper deals with the representation of document models used in the field of document recognit...
We present a methodology for document processing that exploits logic-based machine learning techniqu...
In this paper we present and discuss a novel approach to modeling logical structures of documents, b...
International audienceTechnical documentation such as user manual and manufacturing document is now ...
In this paper, a logical-mathematical model of the task of analyzing the composition and structure o...
This paper deals with automatic classification of text documents, showing advantages of the classifi...
Abstract: Technical documentation such as user manual and manufacturing document is now an important...
National audienceThe aim of this paper is to give a short view on automatic document analysis and re...
The automated discovery of logical structure in text documents is an important problem that has rece...