A Multi-Layer Holistic Approach for Cursive Text Recognition

被引:0
|
作者
Umair, Muhammad [1 ]
Zubair, Muhammad [1 ]
Dawood, Farhan [1 ]
Ashfaq, Sarim [1 ]
Bhatti, Muhammad Shahid [1 ]
Hijji, Mohammad [2 ]
Sohail, Abid [3 ]
机构
[1] Univ Cent Punjab, Fac Informat Technol & Comp Sci, Lahore 54000, Pakistan
[2] Univ Tabuk, Fac Comp & Informat Technol, Tabuk 47921, Saudi Arabia
[3] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus, Lahore 54000, Pakistan
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
text detection; text recognition; natural language processing; natural language understanding; machine learning; deep learning applications; URDU-TEXT; FEATURES; VIDEO;
D O I
10.3390/app122412652
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Urdu is a widely spoken and narrated language in several South-Asian countries and communities worldwide. It is relatively hard to recognize Urdu text compared to other languages due to its cursive writing style. The Urdu text script belongs to a non-Latin cursive family script like Arabic, Hindi and Chinese. Urdu is written in several writing styles, among which 'Nastaleeq' is the most popular and widely used font style. A gap still poses a challenge for localization/detection and recognition of Urdu Nastaleeq text as it follows modified version of Arabic script. This research study presents a methodology to recognize and classify Urdu text in Nastaleeq font, regardless of the text position in the image. The proposed solution is comprised of a two-step methodology. In the first step, text detection is performed using the Connected Component Analysis (CCA) and Long Short-Term Memory Neural Network (LSTM). In the second step, a hybrid Convolution Neural Network and Recurrent Neural Network (CNN-RNN) architecture is deployed to recognize the detected text. The image containing Urdu text is binarized and segmented to produce a single-line text image fed to the hybrid CNN-RNN model, which recognizes the text and saves it in a text file. The proposed technique outperforms the existing ones by achieving an overall accuracy of 97.47%.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] MLRMV: Multi-layer representation for multi-view action recognition
    Liu, Zhigang
    Yin, Ziyang
    Wu, Yin
    IMAGE AND VISION COMPUTING, 2021, 116 (116)
  • [32] Approach to human activity multi-scale analysis and recognition based on multi-layer dynamic Bayesian network
    Du, You-Tian
    Chen, Feng
    Xu, Wen-Li
    Zidonghua Xuebao/ Acta Automatica Sinica, 2009, 35 (03): : 225 - 232
  • [33] A Multi-Layer Fusion-Based Facial Expression Recognition Approach with Optimal Weighted AUs
    Jia, Xibin
    Liu, Shuangqiao
    Powers, David
    Cardiff, Barry
    APPLIED SCIENCES-BASEL, 2017, 7 (02):
  • [34] An approach to empirical Optical Character recognition paradigm using Multi-Layer Perceptorn Neural Network
    Abdullah-al-Mamun, Md.
    Alam, Tanjina
    2015 18th International Conference on Computer and Information Technology (ICCIT), 2015, : 132 - 137
  • [35] Recognition of cursive video text using a deep learning framework
    Mirza, Ali
    Siddiqi, Imran
    IET IMAGE PROCESSING, 2020, 14 (14) : 3444 - 3455
  • [36] Analysis of Cursive Text Recognition Systems: A Systematic Literature Review
    Khan, Sulaiman
    Nazir, Shah
    Khan, Habib Ullah
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)
  • [37] Impact of Pre-Processing on Recognition of Cursive Video Text
    Mirza, Ali
    Siddiqi, Imran
    Mustufa, Syed Ghulam
    Hussain, Mazahir
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT I, 2020, 11867 : 565 - 576
  • [38] A multi-layer quantum neural networks recognition system for handwritten digital recognition
    Zhu, Daqi
    Wu, Rushi
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2007, : 718 - +
  • [39] Offline recognition of syntax-constrained cursive handwritten text
    González, J
    Salvador, I
    Toselli, AH
    Juan, A
    Vidal, E
    Casacuberta, F
    ADVANCES IN PATTERN RECOGNITION, 2000, 1876 : 143 - 153
  • [40] Off-line recognition of cursive handwritten Czech text
    Smrz, P
    Hrbácek, S
    Martinásek, M
    SOFSEM'98: THEORY AND PRACTICE OF INFORMATICS, 1998, 1521 : 437 - 442