This report focuses on analysis steps necessary for a paper document processing. It is divided in three major parts: a document image preprocessing, a knowledge-based geometric classification of the image, and a expectation-driven text recognition. It first illustrates the several low level image processing procedures providing the physical document structure of a scanned document image. Furthermore, it describes a knowledge-based approach, developed for the identification of logical objects (e.g., sender or the footnote of a letter) in a document image. The logical identifiers provide a context-restricted consideration of the containing text. While using specific logical dictionaries, a expectation-driven text recognition is possible to id...
This paper presents a structured, multi-level architecture of a lexicon which is a central component...
Document analysis is responsible for an essential progress in office automation. This paper is part ...
This paper analyzes the problems of document image recognition and the existing solutions. Document ...
ABSTRACT Office Automation by electronic text processing has not reduced the amount of paper used fo...
In the past, many people have proclaimed the vision of the paperless office, but today offices consu...
Document analysis is responsible for an essential progress in office automation. This paper is part ...
Document analysis is responsible for an essential progress in office automation. This paper is part ...
Project ALV conducted research in Document Analysis. The main goal of ALV was developing a phototypi...
A paper document processing system is an information system component which transforms information o...
This thesis explores the domain of document analysis and document classification within the PDF docu...
The economic feasibility of creating a large database of documentimage has left a tremendous need fo...
The digital processing of electronic documents is widely exploited across many domains to improve th...
Document Image Processing allows systems like OCR, writer identification, writer recognition, check ...
Abstract--Surveys of the basic concepts and underlying techniques are presented in this paper. A bas...
Intelligent document segmentation can bring electronic browsing within the reach of most users. The ...
This paper presents a structured, multi-level architecture of a lexicon which is a central component...
Document analysis is responsible for an essential progress in office automation. This paper is part ...
This paper analyzes the problems of document image recognition and the existing solutions. Document ...
ABSTRACT Office Automation by electronic text processing has not reduced the amount of paper used fo...
In the past, many people have proclaimed the vision of the paperless office, but today offices consu...
Document analysis is responsible for an essential progress in office automation. This paper is part ...
Document analysis is responsible for an essential progress in office automation. This paper is part ...
Project ALV conducted research in Document Analysis. The main goal of ALV was developing a phototypi...
A paper document processing system is an information system component which transforms information o...
This thesis explores the domain of document analysis and document classification within the PDF docu...
The economic feasibility of creating a large database of documentimage has left a tremendous need fo...
The digital processing of electronic documents is widely exploited across many domains to improve th...
Document Image Processing allows systems like OCR, writer identification, writer recognition, check ...
Abstract--Surveys of the basic concepts and underlying techniques are presented in this paper. A bas...
Intelligent document segmentation can bring electronic browsing within the reach of most users. The ...
This paper presents a structured, multi-level architecture of a lexicon which is a central component...
Document analysis is responsible for an essential progress in office automation. This paper is part ...
This paper analyzes the problems of document image recognition and the existing solutions. Document ...