A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks

被引:2
|
作者
Ganai, Aejaz Farooq [1 ]
Khursheed, Farida [1 ]
机构
[1] NIT Srinagar India, Dept E&C Engn, Srinagar, India
关键词
Handwritten urdu; Optical character recognition(OCR); Urdu nastaliq handwritten dataset (UNHD); Convolutional neural network(CNN);
D O I
10.1007/s10032-022-00414-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten Urdu recognition has been the least explored to date due to unavailability of a standard hand-written Urdu dataset, huge variation among writing styles of different Urdu writers, irregular positioning of diacritics associated with ligatures, similarity in shape of some Urdu characters in writing, and unavailability of an efficient learning and training technique. Few researchers have proposed the handwritten Urdu datasets among which only Urdu Nastaliq handwritten dataset (UNHD) is publicly available. The UNHD contains ligatures of only up to five characters and does not cover the entire Urdu ligature corpus. Hence, we present a novel comprehensive handwritten Urdu dataset named UHLD for the 'Urdu Handwritten Ligature Dataset':-which consists of ligatures of up to seven-character length and covers most of the ligature corpus of the Urdu language. The UHLD is written by both genders independent of age of person, paper color, paper type (blank or ruled), ink color, pen type. We propose an unconstrained handwritten Urdu recognition system that can recognize handwritten Urdu ligatures with up to six characters. A new robust algorithm has also been proposed here that is able to divide a complete ligature into primary and secondary components with 98% accuracy on a large Urdu dataset. Our proposed holistic handwritten Urdu recognition system ensures independent recognition of both primary and secondary components of a word/ligature. The proposed recognition technique is transformation invariant and computationally efficient and achieves a better recognition rate of 97% for UHLD and 93% for UNHD.
引用
收藏
页码:351 / 371
页数:21
相关论文
共 50 条
  • [1] A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks
    Aejaz Farooq Ganai
    Farida Khursheed
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 351 - 371
  • [2] Holistic Handwritten Uyghur Word Recognition Using Convolutional Neural Networks
    Simayi, Wujiahemaiti
    Hamdulla, Askar
    Liu, Cheng-Lin
    [J]. PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 846 - 851
  • [3] Recognition of Urdu Handwritten Characters Using Convolutional Neural Network
    Husnain, Mujtaba
    Missen, Malik Muhammad Saad
    Mumtaz, Shahzad
    Jhanidr, Muhammad Zeeshan
    Coustaty, Mickael
    Luqman, Muhammad Muzzamil
    Ogier, Jean-Marc
    Choi, Gyu Sang
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [4] Recognition of Urdu Handwritten Alphabet Using Convolutional Neural Network (CNN)
    Ahmed, Gulzar
    Alyas, Tahir
    Iqbal, Muhammad Waseem
    Ashraf, Muhammad Usman
    Alghamdi, Ahmed Mohammed
    Bahaddad, Adel A.
    Almarhabi, Khalid Ali
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 2967 - 2984
  • [5] Unconstrained Handwritten Word Recognition Using a Combination of Neural Networks
    Luna-Perez, Rodolfo
    Gomez-Gil, Pilar
    [J]. WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS 1 AND 2, 2010, : 525 - 528
  • [6] Recognition of printed Urdu ligatures using convolutional neural networks
    Uddin, Israr
    Javed, Nizwa
    Siddiqi, Imran
    Khalid, Shehzad
    Khurshid, Khurram
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (03)
  • [7] Handwritten Hangul recognition using deep convolutional neural networks
    Kim, In-Jung
    Xie, Xiaohui
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (01) : 1 - 13
  • [8] Handwritten Bangla Numeral Recognition using Convolutional Neural Networks
    Paul, Jaya
    Sarkar, Anasua
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS, MATERIALS ENGINEERING & NANO-TECHNOLOGY (IEMENTECH), 2018, : 64 - 67
  • [9] Handwritten Hangul recognition using deep convolutional neural networks
    In-Jung Kim
    Xiaohui Xie
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 1 - 13
  • [10] Urdu Natural Scene Character Recognition using Convolutional Neural Networks
    Ali, Asghar
    Pickering, Mark
    Shafi, Kamran
    [J]. 2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2018, : 29 - 34