Abstract—A persistent flaw in the evaluation of page segmentation algorithms is examined. Index Terms—X-Y tree, page segmentation, layout analysis. Ç A detailed comparative evaluation of six page segmentation methods, reported recently by Shafait et al. [1], follows the experimental protocol of an earlier study by Mao and Kanungo [2] that purported to show that recursive X-Y cut (RXYC) segmentation is much more error-prone than competitive methods, even on isothetic layouts. It does not, however, require much experimentation or reflection to discover that all pixel-projection methods (for either segmentation or skew determination) require removing any black background introduced by optical scanning. We reported good results for RXYC segment...
<p>Web pages consist of different segments, serving different purposes. Most common types of these s...
There is a significant need to objectively evaluate layout analysis (page segmentation and region cl...
There is an ever increasing number of publications which do not have the “traditional” layout where ...
Page segmentation is an important field to analyse patterns from the OCR Systems. In this paper we t...
Column segmentation logically precedes OCR in the document analysis process. The trainable algorithm...
We describe a new approach for the automatic and objective evaluation of page segmentation (zoning) ...
This paper describes fast and efficient method for page segmentation of document containing nonrecta...
We describe a new approach for evaluating page segmentation algorithms. Unlike techniques that rely ...
Many image segmentation algorithms are known, but often there is an inherent obstacle in the unbias...
There is a significant need to objectively evaluate layout analysis (page segmentation and region cl...
A method is presented for the efficient segmentation of text lines from scanned images of technical ...
There is an established need for objective evaluation of layout analysis methods, in realistic circu...
Page layout analysis has been extensively studied since the 1980`s, particularly after computers beg...
Image thresholding and page segmentation are necessary components of any image understanding and rec...
The goal of this work is to add the capability to segment documents containing text, graphics, and p...
<p>Web pages consist of different segments, serving different purposes. Most common types of these s...
There is a significant need to objectively evaluate layout analysis (page segmentation and region cl...
There is an ever increasing number of publications which do not have the “traditional” layout where ...
Page segmentation is an important field to analyse patterns from the OCR Systems. In this paper we t...
Column segmentation logically precedes OCR in the document analysis process. The trainable algorithm...
We describe a new approach for the automatic and objective evaluation of page segmentation (zoning) ...
This paper describes fast and efficient method for page segmentation of document containing nonrecta...
We describe a new approach for evaluating page segmentation algorithms. Unlike techniques that rely ...
Many image segmentation algorithms are known, but often there is an inherent obstacle in the unbias...
There is a significant need to objectively evaluate layout analysis (page segmentation and region cl...
A method is presented for the efficient segmentation of text lines from scanned images of technical ...
There is an established need for objective evaluation of layout analysis methods, in realistic circu...
Page layout analysis has been extensively studied since the 1980`s, particularly after computers beg...
Image thresholding and page segmentation are necessary components of any image understanding and rec...
The goal of this work is to add the capability to segment documents containing text, graphics, and p...
<p>Web pages consist of different segments, serving different purposes. Most common types of these s...
There is a significant need to objectively evaluate layout analysis (page segmentation and region cl...
There is an ever increasing number of publications which do not have the “traditional” layout where ...