This paper describes a tool for recombining the logical structure from an XML document with the typeset appearance of the corresponding PDF document. The tool uses the XML representation as a template for the insertion of the logical structure into the existing PDF document, thereby creating a Structured/Tagged PDF. The addition of logical structure adds value to the PDF in three ways: the accessibility is improved (PDF screen readers for visually impaired users perform better), media options are enhanced (the ability to reflow PDF documents, using structure as a guide, makes PDF viable for use on hand-held devices) and the re-usability of the PDF documents benefits greatly from the presence of an XML-like structure tree to guide the proces...
In order to present most XML documents for human consumption, formatting information must be introdu...
Abstract. Accessing the structured content of PDF document is a difficult task, requiring pre-proces...
In der Darstellung, Weitergabe und Aufbewahrung elektronischer Publikationen steht das Format PDF un...
This paper describes a tool for recombining the logical structure from an XML document with the type...
Documents are often marked up in XML-based tagsets to delineate major structural components such as ...
Document representations can rapidly become unwieldy if they try to encapsulate all possible documen...
This article presents Xed, a reverse engineering tool for PDF documents, which extracts the original...
The Portable Document Format (PDF) is a page-oriented, graphically rich document format based on Pos...
L’amélioration rapide du nombre de documents stockés électroniquement représente un défi pour la cla...
Rapid improvement in the number of documents stored electronically presents a challenge for automati...
document image analysis system that can transform paper documents into XML format [1]. An effective ...
L amélioration rapide du nombre de documents stockés électroniquement représente un défi pour la cla...
Abstract. Tables are a common structuring element in many documents, such as PDF files. To reuse suc...
Physical and logical structure recovering from electronic documents is still an open issue. In this ...
The Portable Document Format (PDF), defined by Adobe Systems Inc. as the basis of its Acrobat produc...
In order to present most XML documents for human consumption, formatting information must be introdu...
Abstract. Accessing the structured content of PDF document is a difficult task, requiring pre-proces...
In der Darstellung, Weitergabe und Aufbewahrung elektronischer Publikationen steht das Format PDF un...
This paper describes a tool for recombining the logical structure from an XML document with the type...
Documents are often marked up in XML-based tagsets to delineate major structural components such as ...
Document representations can rapidly become unwieldy if they try to encapsulate all possible documen...
This article presents Xed, a reverse engineering tool for PDF documents, which extracts the original...
The Portable Document Format (PDF) is a page-oriented, graphically rich document format based on Pos...
L’amélioration rapide du nombre de documents stockés électroniquement représente un défi pour la cla...
Rapid improvement in the number of documents stored electronically presents a challenge for automati...
document image analysis system that can transform paper documents into XML format [1]. An effective ...
L amélioration rapide du nombre de documents stockés électroniquement représente un défi pour la cla...
Abstract. Tables are a common structuring element in many documents, such as PDF files. To reuse suc...
Physical and logical structure recovering from electronic documents is still an open issue. In this ...
The Portable Document Format (PDF), defined by Adobe Systems Inc. as the basis of its Acrobat produc...
In order to present most XML documents for human consumption, formatting information must be introdu...
Abstract. Accessing the structured content of PDF document is a difficult task, requiring pre-proces...
In der Darstellung, Weitergabe und Aufbewahrung elektronischer Publikationen steht das Format PDF un...