This paper discusses the Urdu script characteristics, Urdu Nastaleeq and a simple but a novel and robust technique to recognize the printed Urdu script without a lexicon. Urdu being a family of Arabic script is cursive and complex script in its nature, the main complexity of Urdu compound/connected text is not its connections but the forms/shapes the characters change when it is placed at initial, middle or at the end of a word. The characters recognition technique presented here is using the inherited complexity of Urdu script to solve the problem. A word is scanned and analyzed for the level of its complexity, the point where the level of complexity changes is marked for a character, segmented and feeded to Neural Networks. A prototype of...
Finite state technology is being used since long to model NLP (Natural Language Processing) applicat...
Urdu is a widely spoken and narrated language in several South-Asian countries and communities world...
Arabic script character recognition is challenging task due to complexity of the script and huge num...
system for printed Urdu, a popular Pakistani/Indian script and is the third largest understandable l...
This paper deals with an Optical Character Recognition system for printed Urdu, a popular Pakistani/...
Character recognition in cursive scripts or handwritten Latin script has attracted researchers’ atte...
Optical Character Recognition (OCR) is a technique that generates text from an image. Recognizing th...
Extracting Handwritten text is one of the most important components of digitizing information and ma...
In this paper we have presented a novel segmentation technique for the implementation of an OCR (Opt...
Optical character recognition is popular field for researchers during last decade of research, which...
Abstract This paper presents a segmentation-free optical character recognition system for printed Ur...
Work on the problem of handwritten text recognition in Urdu script has been an active research area....
The electronically available Urdu data is in image form which is very difficult to process. Printed ...
Optical character recognition is a technique that is used to recognized printed and handwritten text...
OCR (OpticalCharacter Recognition) is a technology in which text image is used to understand and wri...
Finite state technology is being used since long to model NLP (Natural Language Processing) applicat...
Urdu is a widely spoken and narrated language in several South-Asian countries and communities world...
Arabic script character recognition is challenging task due to complexity of the script and huge num...
system for printed Urdu, a popular Pakistani/Indian script and is the third largest understandable l...
This paper deals with an Optical Character Recognition system for printed Urdu, a popular Pakistani/...
Character recognition in cursive scripts or handwritten Latin script has attracted researchers’ atte...
Optical Character Recognition (OCR) is a technique that generates text from an image. Recognizing th...
Extracting Handwritten text is one of the most important components of digitizing information and ma...
In this paper we have presented a novel segmentation technique for the implementation of an OCR (Opt...
Optical character recognition is popular field for researchers during last decade of research, which...
Abstract This paper presents a segmentation-free optical character recognition system for printed Ur...
Work on the problem of handwritten text recognition in Urdu script has been an active research area....
The electronically available Urdu data is in image form which is very difficult to process. Printed ...
Optical character recognition is a technique that is used to recognized printed and handwritten text...
OCR (OpticalCharacter Recognition) is a technology in which text image is used to understand and wri...
Finite state technology is being used since long to model NLP (Natural Language Processing) applicat...
Urdu is a widely spoken and narrated language in several South-Asian countries and communities world...
Arabic script character recognition is challenging task due to complexity of the script and huge num...