Computationally efficient recognition of unconstrained handwritten Urdu script using BERT with vision transformers

被引:0
|
作者
Ganai, Aejaz Farooq [1 ]
Khursheed, Farida [1 ]
机构
[1] Natl Inst Technol, Dept Elect & Commun Engn, Srinagar 190006, India
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 34期
关键词
Handwritten Urdu; Vision transformer; BERT model; OCR; Multi-layer perceptron; Multi-head attention; Ligature error rate (LER);
D O I
10.1007/s00521-023-08976-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The handwritten Urdu text recognition is a challenging area in pattern recognition and has gained much importance after the rapid emergence of several camera-based applications on portable devices, which facilitate the daily processing of plenty of images. The various challenges encountered in handwritten Urdu recognition are writer-dependent variations amongst different Urdu writers, irregular positioning of diacritics associated with a character, context sensitivity of characters, and cursive nature of Urdu script. These challenges also make it difficult to formulate a large generalized handwritten Urdu dataset. The state-of-the-art approaches proposed for the recognition of handwritten Urdu text mostly focus on implicit approaches. These approaches are error prone and do not yield significant recognition rates. The holistic approach of handwritten Urdu recognition has been least explored to date and the existing holistic approaches are complex and time consuming as they mostly rely on convolutional/recurrent neural networks or statistical methods. Hence, in this research, a novel and efficient vision transformer-based methodology using BERT architecture has been proposed to the recognition of handwritten Urdu text. The proposed approach uses convolution feature maps as word embedding in the transformer that makes full use of the powerful attention mechanism of the vision transformer to focus on a particular connected component (ligature) in handwritten Urdu text. To cover the entire Urdu corpus, we have pre-trained several benchmark handwritten Urdu datasets such as UNHD and NUST-UHWR and tested unconstrained handwritten Urdu text. In comparison with the state-of-the-art techniques, the experimental evaluation of the proposed approach reports the better results of the various performance parameters such as Ligature Error Rate (LER), precision, sensitivity, specificity, f1-score, and accuracy. The great success of the proposed approach lies in (i) the significant reduction of training time needed to train a large handwritten Urdu dataset, (ii) minimum computational complexity as there is no overhead of diacritic separation and re-association as used in most of the state-of-the-art techniques, and (iii) the proposed approach registers a new state-of-the-art LER of up to 3% only on unconstrained handwritten Urdu text.
引用
收藏
页码:24161 / 24177
页数:17
相关论文
共 50 条
  • [1] Computationally efficient recognition of unconstrained handwritten Urdu script using BERT with vision transformers
    Aejaz Farooq Ganai
    Farida Khursheed
    Neural Computing and Applications, 2023, 35 : 24161 - 24177
  • [2] Numeral Recognition for Urdu Script in Unconstrained Environment
    Razzak, Muhammad Imran
    Hussain, S. A.
    Sher, Muhammad
    ICET: 2009 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2009, : 44 - +
  • [3] A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks
    Aejaz Farooq Ganai
    Farida Khursheed
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 351 - 371
  • [4] A novel holistic unconstrained handwritten urdu recognition system using convolutional neural networks
    Ganai, Aejaz Farooq
    Khursheed, Farida
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (04) : 351 - 371
  • [5] UNCONSTRAINED EAR RECOGNITION USING TRANSFORMERS
    Alejo, Marwin B.
    JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2021, 7 (04): : 326 - 336
  • [6] Recognition of handwritten Urdu digits using Shape Context
    Yusuf, M
    Haider, T
    INMIC 2004: 8th International Multitopic Conference, Proceedings, 2004, : 569 - 572
  • [7] Printed Urdu Nastalique Script Recognition Using Analytical Approach
    Mir, Sabahat
    Zaman, Safdar
    Anwar, Muhammad Waqas
    2015 13TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT), 2015, : 334 - 340
  • [8] Recognition of Urdu Handwritten Characters Using Convolutional Neural Network
    Husnain, Mujtaba
    Missen, Malik Muhammad Saad
    Mumtaz, Shahzad
    Jhanidr, Muhammad Zeeshan
    Coustaty, Mickael
    Luqman, Muhammad Muzzamil
    Ogier, Jean-Marc
    Choi, Gyu Sang
    APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [9] Unconstrained handwritten character recognition using metaclasses of characters
    Koerich, AL
    Kalva, PR
    2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 2045 - 2048
  • [10] An Efficient Local Word Augment Approach for Mongolian Handwritten Script Recognition
    Zhang, Haoran
    Chen, Wei
    Su, Xiangdong
    Guo, Hui
    Xu, Huali
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 429 - 443