This paper aims at presenting the application of first-order logic machine learning techniques to two document domains in order to learn rules for recognizing the semantic role of their logical components. Specifically, the multistrategy incremental learning system INTHELEX has been applied to multi-format scientific papers and documents concerning European films from the 20's and 30's. The challenge comes from the different levels of formatting standards in these domains: from (more or less) standardized layouts, in scientific papers, to documents with almost no standard, in historical cultural heritage material. Experimental results in both domains and a comparison with the Progol system assess the advantages that the exploitation of INTH...
The current spread of digital documents raised the need of effective content-based retrieval techni...
WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML...
The paper introduces a descriptive data mining method to discover knowledge for the task of automati...
This work presents the application of a first-order logic incremental learning system, INTHELEX, to ...
In real-world Digital Libraries, Artificial Intelligence techniques are essential for tackling the a...
This work presents the application of a new, enhanced version of the incremental learning system INT...
This work presents the application of incremental symbolic learning strategies for the automatic ind...
In the last years, the spread of computers and the Internet caused a significant amount of documents...
This work presents the application of a multistrategy approach to some document processing tasks. Th...
We present a methodology for document processing that exploits logic-based machine learning techniqu...
A paper document processing system is an information system component which transforms information o...
Document image understanding refers to logical and semantic analysis of document images in order to ...
A fundamental task of document image understanding is to recognize semantically relevant components ...
In Document Image Understanding, one of the fundamental tasks is that of recognizing semantically re...
Document image understanding denotes the recognition of semantically relevant components in the layo...
The current spread of digital documents raised the need of effective content-based retrieval techni...
WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML...
The paper introduces a descriptive data mining method to discover knowledge for the task of automati...
This work presents the application of a first-order logic incremental learning system, INTHELEX, to ...
In real-world Digital Libraries, Artificial Intelligence techniques are essential for tackling the a...
This work presents the application of a new, enhanced version of the incremental learning system INT...
This work presents the application of incremental symbolic learning strategies for the automatic ind...
In the last years, the spread of computers and the Internet caused a significant amount of documents...
This work presents the application of a multistrategy approach to some document processing tasks. Th...
We present a methodology for document processing that exploits logic-based machine learning techniqu...
A paper document processing system is an information system component which transforms information o...
Document image understanding refers to logical and semantic analysis of document images in order to ...
A fundamental task of document image understanding is to recognize semantically relevant components ...
In Document Image Understanding, one of the fundamental tasks is that of recognizing semantically re...
Document image understanding denotes the recognition of semantically relevant components in the layo...
The current spread of digital documents raised the need of effective content-based retrieval techni...
WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML...
The paper introduces a descriptive data mining method to discover knowledge for the task of automati...