This paper presents a feature-based system which utilizes domain knowledge to segment and classify scanned image documents. Documents usually consists of a mixture of text and image. Text block possesses an interesting property that the x-profile or y-profile of text block is a periodic pattern. Image block possesses generate the connectivity histogram by summing the number of dark pixels with the same connectivity value. Initially, one-scan run-length smearing algorithm (RLSA) with block merging is proposed to segment the document. After segmentation process, the next task is to classify the segmented block. The classification task is then performed based on the rules induced from the features or primitives associated with each document. I...
Page layout analysis is a fundamental step of any document image understanding system. We introduce ...
A document image, beside text, may contain pictures, graphs, signatures, logos, barcodes, hand-drawn...
A method is presented for the efficient segmentation of text lines from scanned images of technical ...
Page layout analysis has been extensively studied since the 1980`s, particularly after computers beg...
Document page segmentation andclassification are important parts of the documentanalysis process. Pa...
AbstractText/Image region separation is the process of identifying location of various text and imag...
The main objective of this thesis is to develop a system to automatically segment and label a variet...
Image thresholding and page segmentation are necessary components of any image understanding and rec...
The object of research is the process of recognizing the areas of scanned documents images. The pape...
The object of research is the process of recognizing the areas of scanned documents images. The pape...
International audienceThis paper presents a Document Image Analysis (DIA) system able to extract hom...
Currently text line segmentation is an important stage of research in historical document processing...
Page segmentation and classification are important parts of the document analysis process. The aim i...
This paper describes a method for extracting words, textlines and text blocks by analyzing the spati...
This paper outlines the requirements and components for a proposed Document Analysis System, which a...
Page layout analysis is a fundamental step of any document image understanding system. We introduce ...
A document image, beside text, may contain pictures, graphs, signatures, logos, barcodes, hand-drawn...
A method is presented for the efficient segmentation of text lines from scanned images of technical ...
Page layout analysis has been extensively studied since the 1980`s, particularly after computers beg...
Document page segmentation andclassification are important parts of the documentanalysis process. Pa...
AbstractText/Image region separation is the process of identifying location of various text and imag...
The main objective of this thesis is to develop a system to automatically segment and label a variet...
Image thresholding and page segmentation are necessary components of any image understanding and rec...
The object of research is the process of recognizing the areas of scanned documents images. The pape...
The object of research is the process of recognizing the areas of scanned documents images. The pape...
International audienceThis paper presents a Document Image Analysis (DIA) system able to extract hom...
Currently text line segmentation is an important stage of research in historical document processing...
Page segmentation and classification are important parts of the document analysis process. The aim i...
This paper describes a method for extracting words, textlines and text blocks by analyzing the spati...
This paper outlines the requirements and components for a proposed Document Analysis System, which a...
Page layout analysis is a fundamental step of any document image understanding system. We introduce ...
A document image, beside text, may contain pictures, graphs, signatures, logos, barcodes, hand-drawn...
A method is presented for the efficient segmentation of text lines from scanned images of technical ...