The task of printed Optical Character Recognition (OCR), though considered ``solved'' by many, still poses several challenges. The complex grapheme structure of many scripts, such as Devanagari and Urdu Nastaleeq, greatly lowers the performance of state-of-the-art OCR systems. Moreover, the digitization of historical and multilingual documents still require much probing. Lack of benchmark datasets further complicates the development of reliable OCR systems. This thesis aims to find the answers to some of these challenges using contemporary machine learning technologies. Specifically, the Long Short-Term Memory (LSTM) networks, have been employed to OCR modern as well historical monolingual documents. The excellent OCR results obtained on...
Abstract—Recurrent neural networks (RNN) have been suc-cessfully applied for recognition of cursive ...
The digitization of historical handwritten document images is important for the preservation of cult...
Building a robust Optical Character Recognition (OCR) system for languages, such as Arabic with curs...
This paper reports on high-performance Optical Character Recognition (OCR) experiments using Long Sh...
This paper reports on high-performance Optical Character Recognition (OCR) experiments using Long Sh...
This paper reports on high-performance Optical Character Recognition (OCR) experiments using Long Sh...
Applications based on Long-Short-Term Memory (LSTM) require large amounts of data for their training...
Optical Character Recognition (OCR) is a system of converting images, including text,into editable t...
The word error rate of any optical character recognition system (OCR) is usually substantially below...
The word error rate of any optical character recognition system (OCR) is usually substantially below...
Most of the low resource languages do not have the necessary resources to create even a substantial ...
The goal of this work is to develop statistical natural language models and processing techniques b...
The word error rate of any optical character recognition system (OCR) is usually substantially below...
Recognition of text on word or line images, without the need for sub-word segmentation has become th...
We tested the capabilities of Transformer-based text recognition technology when dealing with (multi...
Abstract—Recurrent neural networks (RNN) have been suc-cessfully applied for recognition of cursive ...
The digitization of historical handwritten document images is important for the preservation of cult...
Building a robust Optical Character Recognition (OCR) system for languages, such as Arabic with curs...
This paper reports on high-performance Optical Character Recognition (OCR) experiments using Long Sh...
This paper reports on high-performance Optical Character Recognition (OCR) experiments using Long Sh...
This paper reports on high-performance Optical Character Recognition (OCR) experiments using Long Sh...
Applications based on Long-Short-Term Memory (LSTM) require large amounts of data for their training...
Optical Character Recognition (OCR) is a system of converting images, including text,into editable t...
The word error rate of any optical character recognition system (OCR) is usually substantially below...
The word error rate of any optical character recognition system (OCR) is usually substantially below...
Most of the low resource languages do not have the necessary resources to create even a substantial ...
The goal of this work is to develop statistical natural language models and processing techniques b...
The word error rate of any optical character recognition system (OCR) is usually substantially below...
Recognition of text on word or line images, without the need for sub-word segmentation has become th...
We tested the capabilities of Transformer-based text recognition technology when dealing with (multi...
Abstract—Recurrent neural networks (RNN) have been suc-cessfully applied for recognition of cursive ...
The digitization of historical handwritten document images is important for the preservation of cult...
Building a robust Optical Character Recognition (OCR) system for languages, such as Arabic with curs...