13 May 2019 Recognition of printed Urdu ligatures using convolutional neural networks
Israr Uddin, Nizwa Javed, Imran A. Siddiqi, Shehzad Khalid, Khurram Khurshid
Author Affiliations +
Abstract
We present a holistic technique for recognition of text in cursive scripts using printed Urdu ligatures as a case study. Convolutional neural networks (CNNs) are trained on high-frequency ligature clusters for feature extraction and classification. A query ligature presented to the system is first divided into primary and secondary ligatures that are separately recognized and later associated in a postprocessing step to recognize the complete ligature. Experiments are carried out using transfer learning on pretrained networks as well as by training a network from scratch. The technique is evaluated on ligatures extracted from two standard databases of printed Urdu text, Urdu printed text image (UPTI) and Center of Language Engineering (CLE), as well as by combining the ligatures of the two datasets. The system realizes high recognition rates of 97.81% and 89.20% on the UPTI and CLE databases, respectively.
© 2019 SPIE and IS&T 1017-9909/2019/$25.00 © 2019 SPIE and IS&T
Israr Uddin, Nizwa Javed, Imran A. Siddiqi, Shehzad Khalid, and Khurram Khurshid "Recognition of printed Urdu ligatures using convolutional neural networks," Journal of Electronic Imaging 28(3), 033004 (13 May 2019). https://doi.org/10.1117/1.JEI.28.3.033004
Received: 29 January 2019; Accepted: 23 April 2019; Published: 13 May 2019
Lens.org Logo
CITATIONS
Cited by 8 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Databases

Convolutional neural networks

Optical character recognition

Feature extraction

Network architectures

Image segmentation

Error analysis

Back to Top