We present strategies and results for identifying the symbol type (lower-case, upper-case, digit, and punctu-ation or special symbols) of every character in a text document by using various kinds of information from neighboring characters. In the expectation of reason-able word and character segmentation for shape cluster-ing, we designed several type recognition methods that depend on cluster n-grams, shape codes, and within-word context. On an ASCII test corpus of 925 arti-cles that simulates perfect image-level processing, these methods achieve a substantial improvement over default assignment of all characters to lower case. 1
This work deals with the process of text location and recognition in an image document. It discusses...
Microsoft, Motorola, Siemens, Hitachi, NICI, IUF This paper describes treebased classification of c...
The paper deals with extraction of individual components from handwritten word image consisting of s...
Techniques are described to enhance the output of an hypothesized character-recognition machine by p...
Improvement of specific character recognition systems through the use of contextual constraints has ...
A contour-tracing technique originally divised by Clemens and Mason was modified and used with sever...
Abstract- The importance of contextual information, at various different levels, for the satisfactor...
Despite ubiquitous claims that optical character recog-nition (OCR) is a “solved problem, ” many cat...
Abstract—Document image decoding (DID) is a trial to understand the contents of a whole document wit...
Although OCR techniques work very reliably for high-resolution documents, the recognition of superim...
Despite ubiquitous claims that optical character recog- nition (OCR) is a “solved problem,” many cat...
This paper describes a two-stage classification method for (1) classification of isolated characters...
This paper describes a two-stage classification method for (1) classification of isolated characters...
Although OCR techniques work very reliably for high-resolution documents, the recognition of superim...
This work reports experiments with four hierarchical clustering algorithms and two clustering indice...
This work deals with the process of text location and recognition in an image document. It discusses...
Microsoft, Motorola, Siemens, Hitachi, NICI, IUF This paper describes treebased classification of c...
The paper deals with extraction of individual components from handwritten word image consisting of s...
Techniques are described to enhance the output of an hypothesized character-recognition machine by p...
Improvement of specific character recognition systems through the use of contextual constraints has ...
A contour-tracing technique originally divised by Clemens and Mason was modified and used with sever...
Abstract- The importance of contextual information, at various different levels, for the satisfactor...
Despite ubiquitous claims that optical character recog-nition (OCR) is a “solved problem, ” many cat...
Abstract—Document image decoding (DID) is a trial to understand the contents of a whole document wit...
Although OCR techniques work very reliably for high-resolution documents, the recognition of superim...
Despite ubiquitous claims that optical character recog- nition (OCR) is a “solved problem,” many cat...
This paper describes a two-stage classification method for (1) classification of isolated characters...
This paper describes a two-stage classification method for (1) classification of isolated characters...
Although OCR techniques work very reliably for high-resolution documents, the recognition of superim...
This work reports experiments with four hierarchical clustering algorithms and two clustering indice...
This work deals with the process of text location and recognition in an image document. It discusses...
Microsoft, Motorola, Siemens, Hitachi, NICI, IUF This paper describes treebased classification of c...
The paper deals with extraction of individual components from handwritten word image consisting of s...