A Multi-Layer Holistic Approach for Cursive Text Recognition

被引:0
|
作者
Umair, Muhammad [1 ]
Zubair, Muhammad [1 ]
Dawood, Farhan [1 ]
Ashfaq, Sarim [1 ]
Bhatti, Muhammad Shahid [1 ]
Hijji, Mohammad [2 ]
Sohail, Abid [3 ]
机构
[1] Univ Cent Punjab, Fac Informat Technol & Comp Sci, Lahore 54000, Pakistan
[2] Univ Tabuk, Fac Comp & Informat Technol, Tabuk 47921, Saudi Arabia
[3] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus, Lahore 54000, Pakistan
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
text detection; text recognition; natural language processing; natural language understanding; machine learning; deep learning applications; URDU-TEXT; FEATURES; VIDEO;
D O I
10.3390/app122412652
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Urdu is a widely spoken and narrated language in several South-Asian countries and communities worldwide. It is relatively hard to recognize Urdu text compared to other languages due to its cursive writing style. The Urdu text script belongs to a non-Latin cursive family script like Arabic, Hindi and Chinese. Urdu is written in several writing styles, among which 'Nastaleeq' is the most popular and widely used font style. A gap still poses a challenge for localization/detection and recognition of Urdu Nastaleeq text as it follows modified version of Arabic script. This research study presents a methodology to recognize and classify Urdu text in Nastaleeq font, regardless of the text position in the image. The proposed solution is comprised of a two-step methodology. In the first step, text detection is performed using the Connected Component Analysis (CCA) and Long Short-Term Memory Neural Network (LSTM). In the second step, a hybrid Convolution Neural Network and Recurrent Neural Network (CNN-RNN) architecture is deployed to recognize the detected text. The image containing Urdu text is binarized and segmented to produce a single-line text image fed to the hybrid CNN-RNN model, which recognizes the text and saves it in a text file. The proposed technique outperforms the existing ones by achieving an overall accuracy of 97.47%.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] FPGA implementation of multi-layer perceptrons for speech recognition
    Ortigosa, EM
    Ortigosa, PM
    Cañas, A
    Ros, E
    Agís, R
    Ortega, J
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2003, 2778 : 1048 - 1052
  • [22] Pollen Recognition Using a Multi-Layer Hierarchical Classifier
    Daood, Amar
    Ribeiro, Eraldo
    Bush, Mark
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3091 - 3096
  • [23] Fuzzy multi-layer perceptron for binary pattern recognition
    Canuto, AMP
    Howells, WGJ
    Fairhurst, MC
    SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, 1999, (465): : 260 - 264
  • [24] A multi-layer grid approach for fluid animation
    Tan Jie
    Yang XuBo
    Zhao Xin
    Yang ZhanXin
    SCIENCE CHINA-INFORMATION SCIENCES, 2011, 54 (11) : 2269 - 2278
  • [25] A multi-layer grid approach for fluid animation
    TAN Jie 1
    2 Digital Art Lab
    Science China(Information Sciences), 2011, 54 (11) : 2269 - 2278
  • [26] Multi-layer Filtering Approach for Map Images
    Chen, Minjie
    Xu, Mantao
    Fraenti, Pasi
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3953 - 3956
  • [27] A Multi-layer Approach for Customizing Business Services
    Taher, Yehia
    Haque, Rafiqul
    Parkin, Michael
    van den Heuvell, Willem-Jan
    Richardson, Ita
    Whelan, Eoin
    E-COMMERCE AND WEB TECHNOLOGIES, 2011, 85 : 64 - +
  • [28] A multi-layer grid approach for fluid animation
    Jie Tan
    XuBo Yang
    Xin Zhao
    ZhanXin Yang
    Science China Information Sciences, 2011, 54 : 2269 - 2278
  • [29] Application of a multi-layer approach for morphological modelling
    Steetzel, HJ
    de Vroeg, H
    COASTAL SEDIMENTS '99, VOLS 1-3, 1999, : 2206 - 2218
  • [30] A multi-layer planning approach for WDM Networks
    VanParys, W
    Wauters, N
    Demeester, P
    PHOTONIC NETWORKS, OPTICAL TECHNOLOGY AND INFRASTRUCTURE - NOC '97, 1997, : 87 - 94