The problem of line breaking consists of finding the best way to split paragraphs into lines. It has been cleverly addressed by the total-fit algorithm exposed by Knuth and Plass in a well-known paper. Similarly, page-breaking algorithms break the content flow of a document into page units. Formatting languages—such as the World Wide Web Consortium standard Extensible Stylesheet Language Formatting Objects (XSL-FO)—allow users to set which content should be kept in the same page and how many isolated lines are acceptable at the beginning/end of each page. The strategies most formatters adopt to meet these requirements, however, are not satisfactory for many publishing contexts as they very often generate unpleasant empty areas. In that case...
We present the design of a markup language that is based on W3C standards and allows document author...
Digitization of newspapers is of interest for many reasons including preservation of history, access...
The XSL Formatting Objects specification has been a published recommendation for over a year. During...
The problem of line breaking consists of finding the best way to split paragraphs into lines. It has...
The line breaking problem is as follows: given some text and a page to print to, where are the best ...
© 1981 ACM. A basic problem in text formatting is that of determining the break points for separatin...
A single-parameter text-line extraction algorithm is described along with an efficient technique for...
The pagination problem of complex documents is in placing text and floating objects on pages in suc...
This paper proposes an extension of the XSL-FO standard which allows the specification of an unlimit...
Since the 1980s, two paradigms have dominated the representation of formatted electronic documents: ...
Adobe\u27s newest page layout program, InDesign, includes a multi-line composing engine. This feat...
In this document we analyzed the possibility to use CSS as a solid successor of XSL-FO for producing...
Abstract: The Constrained Run-Length Algorithm (CRLA) is a well-known technique for page segmentatio...
High volume print jobs are getting more common due to the growing demand for personalized documents....
Column segmentation logically precedes OCR in the document analysis process. The trainable algorithm...
We present the design of a markup language that is based on W3C standards and allows document author...
Digitization of newspapers is of interest for many reasons including preservation of history, access...
The XSL Formatting Objects specification has been a published recommendation for over a year. During...
The problem of line breaking consists of finding the best way to split paragraphs into lines. It has...
The line breaking problem is as follows: given some text and a page to print to, where are the best ...
© 1981 ACM. A basic problem in text formatting is that of determining the break points for separatin...
A single-parameter text-line extraction algorithm is described along with an efficient technique for...
The pagination problem of complex documents is in placing text and floating objects on pages in suc...
This paper proposes an extension of the XSL-FO standard which allows the specification of an unlimit...
Since the 1980s, two paradigms have dominated the representation of formatted electronic documents: ...
Adobe\u27s newest page layout program, InDesign, includes a multi-line composing engine. This feat...
In this document we analyzed the possibility to use CSS as a solid successor of XSL-FO for producing...
Abstract: The Constrained Run-Length Algorithm (CRLA) is a well-known technique for page segmentatio...
High volume print jobs are getting more common due to the growing demand for personalized documents....
Column segmentation logically precedes OCR in the document analysis process. The trainable algorithm...
We present the design of a markup language that is based on W3C standards and allows document author...
Digitization of newspapers is of interest for many reasons including preservation of history, access...
The XSL Formatting Objects specification has been a published recommendation for over a year. During...