A number of projects are creating searchable digital libraries of printed books. These include the Million Book Project, the Google Book project and similar eorts from Yahoo and Microsoft. Content-based on line book retrieval usually requires rst converting printed text into machine readable (e.g. ASCII) text using an optical character recognition (OCR) engine and then doing full text search on the results. Many of these books are old and there are a variety of processing steps that are required to create an end to end system. Changing any step (including the scanning process) can aect OCR performance and hence a good automatic statistical evaluation of OCR performance on book length material is needed. Evaluating OCR performance on the ent...
In this paper, we present the implementation and evaluation of first order and second order Hidden M...
Todays digital libraries increasingly include not only printed text but also scanned handwritten pag...
International audienceSince 2006 the national library of France (BnF) has developed many mass digiti...
A number of projects are creating searchable digital libraries of printed books. These include the M...
A number of projects are creating searchable digital libraries of printed books. These include the M...
This paper aims to evaluate the accuracy of optical character recognition (OCR) systems on real scan...
Millions of books from public libraries and private collections have been scanned by various organiz...
Abstract—This paper evaluates an automated scheme for aligning and combining optical character recog...
This paper evaluates an automated scheme for aligning and combining optical character recognition (O...
Whole-book recognition is a document image analysis strategy that operates on the complete set of a ...
In this paper, we present a workflow for reworking digitized versions of early modern books, freely ...
In this paper, we present a workflow for reworking digitized versions of early modern books, freely ...
In this paper, we present a workflow for reworking digitized versions of early modern books, freely ...
In this paper, we present a workflow for reworking digitized versions of early modern books, freely ...
In this paper, we present the implementation and evaluation of first order and second order Hidden M...
In this paper, we present the implementation and evaluation of first order and second order Hidden M...
Todays digital libraries increasingly include not only printed text but also scanned handwritten pag...
International audienceSince 2006 the national library of France (BnF) has developed many mass digiti...
A number of projects are creating searchable digital libraries of printed books. These include the M...
A number of projects are creating searchable digital libraries of printed books. These include the M...
This paper aims to evaluate the accuracy of optical character recognition (OCR) systems on real scan...
Millions of books from public libraries and private collections have been scanned by various organiz...
Abstract—This paper evaluates an automated scheme for aligning and combining optical character recog...
This paper evaluates an automated scheme for aligning and combining optical character recognition (O...
Whole-book recognition is a document image analysis strategy that operates on the complete set of a ...
In this paper, we present a workflow for reworking digitized versions of early modern books, freely ...
In this paper, we present a workflow for reworking digitized versions of early modern books, freely ...
In this paper, we present a workflow for reworking digitized versions of early modern books, freely ...
In this paper, we present a workflow for reworking digitized versions of early modern books, freely ...
In this paper, we present the implementation and evaluation of first order and second order Hidden M...
In this paper, we present the implementation and evaluation of first order and second order Hidden M...
Todays digital libraries increasingly include not only printed text but also scanned handwritten pag...
International audienceSince 2006 the national library of France (BnF) has developed many mass digiti...