Há uma vasta quantidade de informação nos textos antigos manuscritos e tipografados, e grandes esforços para a digitalização e disponibilização desses documentos têm sido feitos nos últimos anos. No entanto, os sistemas de Reconhecimento Óptico de Caracteres (OCR) não têm grande sucesso nesses documentos por diversas razões, por exemplo, devido a defeitos por envelhecimento do papel, manchas, iluminação desigual, dobras, escrita do verso transparecendo na frente, pouco contraste entre texto e fundo, entre outros. Uma das etapas importantes para o sucesso de um OCR é a boa segmentação da parte escrita e do fundo da imagem (binarização) e essa etapa é particularmente sensível a esses efeitos que são próprios de documentos históricos. Tanto ...
The amount of information stored in the form of historical documents is enormous and their treatment...
Proces segmentace historických obrazových dokumentů je klíčový pro jejich následné převedení do text...
Abstract: Palm leaf manuscripts were one of the earliest forms of written media and were used in So...
Abstract — A number of binarization techniques have been proposed in the past for automatic document...
Computerized analysis of handwritten documents is an active research area in image analysis and comp...
Správná segmentace obrazových dokumentů je jednou z nejdůležitějších součástí OCR systémů. Během ní ...
Historical documents present many challenges for Optical Character Recognition Systems (OCR), ...
Text line segmentation is one of the pre-stages of modern optical characterrecognition systems. The ...
As our world enters an electronic era, it has become important to be able to quickly and easily pres...
In this paper a complete OCR methodology for recognizing historical documents, either printed or han...
To effortlessly digitise historical documents has risen to be of great interest for some time. Part ...
The aim of this thesis is to build and evaluate how a word segmentation algorithm performs when extr...
The emergence of large scale digitization projects transforming printed heritage into digitally ava...
Digital archiving has helped in the development of the exploitation of historical documents, but the...
In this thesis we work on recognizing the text in the book ``Rerum Frisicarum Historia'' by Ubbo Emm...
The amount of information stored in the form of historical documents is enormous and their treatment...
Proces segmentace historických obrazových dokumentů je klíčový pro jejich následné převedení do text...
Abstract: Palm leaf manuscripts were one of the earliest forms of written media and were used in So...
Abstract — A number of binarization techniques have been proposed in the past for automatic document...
Computerized analysis of handwritten documents is an active research area in image analysis and comp...
Správná segmentace obrazových dokumentů je jednou z nejdůležitějších součástí OCR systémů. Během ní ...
Historical documents present many challenges for Optical Character Recognition Systems (OCR), ...
Text line segmentation is one of the pre-stages of modern optical characterrecognition systems. The ...
As our world enters an electronic era, it has become important to be able to quickly and easily pres...
In this paper a complete OCR methodology for recognizing historical documents, either printed or han...
To effortlessly digitise historical documents has risen to be of great interest for some time. Part ...
The aim of this thesis is to build and evaluate how a word segmentation algorithm performs when extr...
The emergence of large scale digitization projects transforming printed heritage into digitally ava...
Digital archiving has helped in the development of the exploitation of historical documents, but the...
In this thesis we work on recognizing the text in the book ``Rerum Frisicarum Historia'' by Ubbo Emm...
The amount of information stored in the form of historical documents is enormous and their treatment...
Proces segmentace historických obrazových dokumentů je klíčový pro jejich následné převedení do text...
Abstract: Palm leaf manuscripts were one of the earliest forms of written media and were used in So...