A study on the possibility of adopting a supervised inductive learning approach to the problem of document understanding is presented. A representation language used to describe a page layout is introduced and the opportunity of extending such a language by means of intentionally defined predicates is discussed. Experimental results obtained by using a well-known learning system, FOCL, are presented. They confirm the exigency of redefining the problem of document understanding in terms of a new strategy of supervised inductive learning, called contextual learning. Some experiments in which a dependence hierarchy between concepts is defined show that contextual rules increase predictive accuracy and decrease learning time for labeling proble...
It can be very difficult to manually create software systems which capture the knowledge of an exper...
It can be very difficult to manually create software systems which capture the knowledge of an exper...
WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML...
In this paper, we propose a supervised inductive learning approach for the problem of document under...
A hybrid method of using empirical and supervised learning to acquire knowledge expressed in the for...
A fundamental task of document image understanding is to recognize semantically relevant components ...
In this paper, a methodology for document classification and understanding is proposed. It is based ...
Document understanding requires discovery of meaningful patterns in text, which in turn involves ana...
Knowledge-based approaches to document cate-gorization make use of well elaborated and pow-erful pat...
The current spread of digital documents raised the need of effective content-based retrieval techni...
We describe the results of extensive experiments using optimized rule-based induction methods on lar...
A paper document processing system is an information system component which transforms information o...
It can be very difficult to create software systems which capture the knowledge of an expert. It is...
In Document Image Understanding, one of the fundamental tasks is that of recognizing semantically re...
WISDOM is a intelligent document processing system that transforms printed information into a symbol...
It can be very difficult to manually create software systems which capture the knowledge of an exper...
It can be very difficult to manually create software systems which capture the knowledge of an exper...
WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML...
In this paper, we propose a supervised inductive learning approach for the problem of document under...
A hybrid method of using empirical and supervised learning to acquire knowledge expressed in the for...
A fundamental task of document image understanding is to recognize semantically relevant components ...
In this paper, a methodology for document classification and understanding is proposed. It is based ...
Document understanding requires discovery of meaningful patterns in text, which in turn involves ana...
Knowledge-based approaches to document cate-gorization make use of well elaborated and pow-erful pat...
The current spread of digital documents raised the need of effective content-based retrieval techni...
We describe the results of extensive experiments using optimized rule-based induction methods on lar...
A paper document processing system is an information system component which transforms information o...
It can be very difficult to create software systems which capture the knowledge of an expert. It is...
In Document Image Understanding, one of the fundamental tasks is that of recognizing semantically re...
WISDOM is a intelligent document processing system that transforms printed information into a symbol...
It can be very difficult to manually create software systems which capture the knowledge of an exper...
It can be very difficult to manually create software systems which capture the knowledge of an exper...
WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML...