This thesis discusses the design and implementation of an OCR post processing system. The system is used to perform automatic spelling detection and correction on noisy, OCR generated text. Unlike previous post processing systems, this system works in conjunction with an inverted file database system. The initial results obtained from post processing 10,000 pages of OCR\u27ed text are encouraging. These results indicate that the use of global and local document information extracted from the inverted file system can be effectively used to correct OCR generated spelling errors
Digital Humanities researchers often make use of software that helps them in the task of finding non...
In order to improve OCR quality in texts originally typeset in Gothic script, we have built an autom...
Presented in this thesis is a study of the effect of OCR errors on short documents. OCR recognizes a...
Optical Character Recognition (OCR) Post Processing involves data cleaning steps for documents that ...
In this thesis we describe a spelling correction system designed specifically for OCR (Optical Chara...
This paper describes a new automatic spelling correction program to deal with OCR generated errors. ...
In this paper, we describe a spelling correction system designed specifically for OCR-generated text...
Post-OCR is an important processing step that follows optical character recognition (OCR) and is mea...
In this paper we describe our efforts in reducing and correcting OCR errors in the context of buildi...
Abstract. In this paper, we describe a spelling correction system designed specifically for OCR-gene...
International audienceWe present an experiment conducted on the automatic spelling correction of tex...
International audienceThis paper describes the second round of the ICDAR 2019 competition on post-OC...
This paper describes a new expert system for automatically correcting errors made by optical charact...
Understanding handwritten and printed text is easier for humans but computers do not have the same l...
In this paper, we describe a new and original approach for post-processing step in an OCR system. Th...
Digital Humanities researchers often make use of software that helps them in the task of finding non...
In order to improve OCR quality in texts originally typeset in Gothic script, we have built an autom...
Presented in this thesis is a study of the effect of OCR errors on short documents. OCR recognizes a...
Optical Character Recognition (OCR) Post Processing involves data cleaning steps for documents that ...
In this thesis we describe a spelling correction system designed specifically for OCR (Optical Chara...
This paper describes a new automatic spelling correction program to deal with OCR generated errors. ...
In this paper, we describe a spelling correction system designed specifically for OCR-generated text...
Post-OCR is an important processing step that follows optical character recognition (OCR) and is mea...
In this paper we describe our efforts in reducing and correcting OCR errors in the context of buildi...
Abstract. In this paper, we describe a spelling correction system designed specifically for OCR-gene...
International audienceWe present an experiment conducted on the automatic spelling correction of tex...
International audienceThis paper describes the second round of the ICDAR 2019 competition on post-OC...
This paper describes a new expert system for automatically correcting errors made by optical charact...
Understanding handwritten and printed text is easier for humans but computers do not have the same l...
In this paper, we describe a new and original approach for post-processing step in an OCR system. Th...
Digital Humanities researchers often make use of software that helps them in the task of finding non...
In order to improve OCR quality in texts originally typeset in Gothic script, we have built an autom...
Presented in this thesis is a study of the effect of OCR errors on short documents. OCR recognizes a...