In this paper, we describe a new and original approach for post-processing step in an OCR system. This approach is based on new method of spelling correction to correct automatically misspelled words resulting from a character recognition step of scanned documents by combining both ontologies and bigram code in order to create a robust system able to solve automatically the anomalies of classical approaches. The proposed approach is based on a hybrid method which is spread over two stages, first one is character recognition by using the ontological model and the second one is word recognition based on spelling correction approach based on bigram codification for detection and correction of errors. The spelling error is broadly classified in...
Optical Character Recognition (OCR), is extraction of textual data from scanned text documents to fa...
Post-OCR is an important processing step that follows optical character recognition (OCR) and is mea...
Optical Character Recognition (OCR), is extraction of textual data from scanned text documents to fa...
Optical Character Recognition (OCR) Post Processing involves data cleaning steps for documents that ...
In Optical character recognition (OCR), the characteristics of Arabic text cause more errors than in...
Abstract. In this paper, we describe a spelling correction system designed specifically for OCR-gene...
International audienceWe present an experiment conducted on the automatic spelling correction of tex...
In this paper, we describe a spelling correction system designed specifically for OCR-generated text...
This paper describes a new expert system for automatically correcting errors made by optical charact...
In this thesis we describe a spelling correction system designed specifically for OCR (Optical Chara...
In the last decades, a huge number of documents has been digitised, before undergoing optical charac...
This paper describes a new automatic spelling correction program to deal with OCR generated errors. ...
This thesis discusses the design and implementation of an OCR post processing system. The system is ...
International audienceIn this paper we present a novel approach to the automatic correction of OCR-i...
In this paper we describe our efforts in reducing and correcting OCR errors in the context of buildi...
Optical Character Recognition (OCR), is extraction of textual data from scanned text documents to fa...
Post-OCR is an important processing step that follows optical character recognition (OCR) and is mea...
Optical Character Recognition (OCR), is extraction of textual data from scanned text documents to fa...
Optical Character Recognition (OCR) Post Processing involves data cleaning steps for documents that ...
In Optical character recognition (OCR), the characteristics of Arabic text cause more errors than in...
Abstract. In this paper, we describe a spelling correction system designed specifically for OCR-gene...
International audienceWe present an experiment conducted on the automatic spelling correction of tex...
In this paper, we describe a spelling correction system designed specifically for OCR-generated text...
This paper describes a new expert system for automatically correcting errors made by optical charact...
In this thesis we describe a spelling correction system designed specifically for OCR (Optical Chara...
In the last decades, a huge number of documents has been digitised, before undergoing optical charac...
This paper describes a new automatic spelling correction program to deal with OCR generated errors. ...
This thesis discusses the design and implementation of an OCR post processing system. The system is ...
International audienceIn this paper we present a novel approach to the automatic correction of OCR-i...
In this paper we describe our efforts in reducing and correcting OCR errors in the context of buildi...
Optical Character Recognition (OCR), is extraction of textual data from scanned text documents to fa...
Post-OCR is an important processing step that follows optical character recognition (OCR) and is mea...
Optical Character Recognition (OCR), is extraction of textual data from scanned text documents to fa...