We present an adaptive Hindi OCR implemented as part of a rapidly retargetable language tool ef-fort. The system includes: script identification, character segmentation, training sample creation, and character recognition. In script identification, Hindi words are identified from bilingual or multilingual documents based on features of the Devanagari script or using Support Vector Ma-chines. Identified words are then segmented into individual characters in the next step, where the composite characters are identified and further segmented based on the structural properties of the script and statistical information. Segmented characters are recognized using generalized Hausdorff image comparison (GHIC) and postprocessing is applied to improve...
In a country like India a number of scripts (a total of 13) are used to write official languages (a ...
In a multi-lingual country like India, in most of the official papers, school text books, magazines,...
This paper gives a summary of the current research in optical character recognition (OCR) systems. M...
In this paper, we present an adaptive Hindi OCR using generalized Hausdor image comparison implemen...
Digital document processing is becoming popular for applications to office and library automation, b...
systems for Indic scripts are not robust enough for recognizing arbitrary collection of printed docu...
India is a multilingual multi-script country. In every state of India there are two languages one is...
Devanagari, the most accepted script in India and Hindi is the only dialect which is widely spoken a...
This paper presents the recognition of handwritten Hindi Characters based on the modified exponentia...
Abstract-- Optical character recognition, usually abbreviated to OCR, is the mechanical or electroni...
India is a multi-lingual country. A significantly large number of scripts are used to represent thes...
Development of OCRs for Indian script is an active area of activity today. Indian scripts present gr...
Hindi is a national language of India spoken in many states in our countries, like Bihar, Uttar Prad...
The Handwritten Character Recognition has been a challenging task for the past many decades. This is...
In this paper, we propose a recognition scheme for the Indian script of Hindi. Recognition accuracy ...
In a country like India a number of scripts (a total of 13) are used to write official languages (a ...
In a multi-lingual country like India, in most of the official papers, school text books, magazines,...
This paper gives a summary of the current research in optical character recognition (OCR) systems. M...
In this paper, we present an adaptive Hindi OCR using generalized Hausdor image comparison implemen...
Digital document processing is becoming popular for applications to office and library automation, b...
systems for Indic scripts are not robust enough for recognizing arbitrary collection of printed docu...
India is a multilingual multi-script country. In every state of India there are two languages one is...
Devanagari, the most accepted script in India and Hindi is the only dialect which is widely spoken a...
This paper presents the recognition of handwritten Hindi Characters based on the modified exponentia...
Abstract-- Optical character recognition, usually abbreviated to OCR, is the mechanical or electroni...
India is a multi-lingual country. A significantly large number of scripts are used to represent thes...
Development of OCRs for Indian script is an active area of activity today. Indian scripts present gr...
Hindi is a national language of India spoken in many states in our countries, like Bihar, Uttar Prad...
The Handwritten Character Recognition has been a challenging task for the past many decades. This is...
In this paper, we propose a recognition scheme for the Indian script of Hindi. Recognition accuracy ...
In a country like India a number of scripts (a total of 13) are used to write official languages (a ...
In a multi-lingual country like India, in most of the official papers, school text books, magazines,...
This paper gives a summary of the current research in optical character recognition (OCR) systems. M...