This paper describes a tool for recombining the logical structure from an XML document with the typeset appearance of the corresponding PDF document. The tool uses the XML representation as a template for the insertion of the logical structure into the existing PDF document, thereby creating a Structured/Tagged PDF. The addition of logical structure adds value to the PDF in three ways: the accessibility is improved (PDF screen readers for visually impaired users perform better), media options are enhanced (the ability to reflow PDF documents, using structure as a guide, makes PDF viable for use on hand-held devices) and the re-usability of the PDF documents benefits greatly from the presence of an XML-like structure tree to guide the proces...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
XML as the most successful data representation format makes it easy to start working with structured...
In order to present most XML documents for human consumption, formatting information must be introdu...
This paper describes a tool for recombining the logical structure from an XML document with the type...
Documents are often marked up in XML-based tagsets to delineate major structural components such as ...
Document representations can rapidly become unwieldy if they try to encapsulate all possible documen...
Rapid improvement in the number of documents stored electronically presents a challenge for automati...
This article presents Xed, a reverse engineering tool for PDF documents, which extracts the original...
L’amélioration rapide du nombre de documents stockés électroniquement représente un défi pour la cla...
L amélioration rapide du nombre de documents stockés électroniquement représente un défi pour la cla...
The Portable Document Format (PDF), defined by Adobe Systems Inc. as the basis of its Acrobat produc...
XML is among the preferred formats for storing the structure of documents such as scientic articles,...
Document structures are a crucial mechanism for the creation and the usability of complex hypermedia...
Physical and logical structure recovering from electronic documents is still an open issue. In this ...
document image analysis system that can transform paper documents into XML format [1]. An effective ...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
XML as the most successful data representation format makes it easy to start working with structured...
In order to present most XML documents for human consumption, formatting information must be introdu...
This paper describes a tool for recombining the logical structure from an XML document with the type...
Documents are often marked up in XML-based tagsets to delineate major structural components such as ...
Document representations can rapidly become unwieldy if they try to encapsulate all possible documen...
Rapid improvement in the number of documents stored electronically presents a challenge for automati...
This article presents Xed, a reverse engineering tool for PDF documents, which extracts the original...
L’amélioration rapide du nombre de documents stockés électroniquement représente un défi pour la cla...
L amélioration rapide du nombre de documents stockés électroniquement représente un défi pour la cla...
The Portable Document Format (PDF), defined by Adobe Systems Inc. as the basis of its Acrobat produc...
XML is among the preferred formats for storing the structure of documents such as scientic articles,...
Document structures are a crucial mechanism for the creation and the usability of complex hypermedia...
Physical and logical structure recovering from electronic documents is still an open issue. In this ...
document image analysis system that can transform paper documents into XML format [1]. An effective ...
The availability of large, heterogeneous repositories of electronic documents is increasing rapidly,...
XML as the most successful data representation format makes it easy to start working with structured...
In order to present most XML documents for human consumption, formatting information must be introdu...