The choice of a commercial Optical Character Recognition (OCR) engine is important for the process of automatically indexing technical drawings from their title blocks. We would like to benchmark commercial OCR engines with respect to their inclusion in the global digitalisation chain from scanning to understanding the text information contained in a technical drawing document. The crucial (costly) point is the manual correction of OCR recognition errors. By benchmarking, we intend to identify, for our application domain, the causes for OCR errors which are the most costly to correct
This report documents the algorithm of Optical Character Recognition, to the purpose of detecting an...
The millions of pages of historical documents that are digitized in libraries are increasingly used ...
Digitized collections of printed historical texts are important for research in Digital Humanities. ...
Introduction: Digitization is a crucial step towards achieving automation in production quality cont...
The user expectation from a digitized collection is that a full text search can be performed and tha...
We offer a perspective on the performance of current OCR systems by illustrating and explaining actu...
Optical Character Recognition (OCR) is a technique, used to convert scanned image into editable text...
Optical Character Recognition (OCR) is a technique, used to convert scanned image into editable text...
Optical Character Recognition (OCR) is a process of converting text from images to a machine-readabl...
This paper describes a new expert system for automatically correcting errors made by optical charact...
This paper presents a systematic review on optical character recognition techniques. In this review ...
Over the past years, considerable effort has been put into digitising library collections. As part o...
Post-OCR is an important processing step that follows optical character recognition (OCR) and is mea...
this paper were developed to establish a uniform method of evaluating the recognition of optical cha...
Owing to a boom of information technologies optical character recognition has recently become a popu...
This report documents the algorithm of Optical Character Recognition, to the purpose of detecting an...
The millions of pages of historical documents that are digitized in libraries are increasingly used ...
Digitized collections of printed historical texts are important for research in Digital Humanities. ...
Introduction: Digitization is a crucial step towards achieving automation in production quality cont...
The user expectation from a digitized collection is that a full text search can be performed and tha...
We offer a perspective on the performance of current OCR systems by illustrating and explaining actu...
Optical Character Recognition (OCR) is a technique, used to convert scanned image into editable text...
Optical Character Recognition (OCR) is a technique, used to convert scanned image into editable text...
Optical Character Recognition (OCR) is a process of converting text from images to a machine-readabl...
This paper describes a new expert system for automatically correcting errors made by optical charact...
This paper presents a systematic review on optical character recognition techniques. In this review ...
Over the past years, considerable effort has been put into digitising library collections. As part o...
Post-OCR is an important processing step that follows optical character recognition (OCR) and is mea...
this paper were developed to establish a uniform method of evaluating the recognition of optical cha...
Owing to a boom of information technologies optical character recognition has recently become a popu...
This report documents the algorithm of Optical Character Recognition, to the purpose of detecting an...
The millions of pages of historical documents that are digitized in libraries are increasingly used ...
Digitized collections of printed historical texts are important for research in Digital Humanities. ...