A Multi-Layer Holistic Approach for Cursive Text Recognition

被引:0
|
作者
Umair, Muhammad [1 ]
Zubair, Muhammad [1 ]
Dawood, Farhan [1 ]
Ashfaq, Sarim [1 ]
Bhatti, Muhammad Shahid [1 ]
Hijji, Mohammad [2 ]
Sohail, Abid [3 ]
机构
[1] Univ Cent Punjab, Fac Informat Technol & Comp Sci, Lahore 54000, Pakistan
[2] Univ Tabuk, Fac Comp & Informat Technol, Tabuk 47921, Saudi Arabia
[3] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus, Lahore 54000, Pakistan
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
text detection; text recognition; natural language processing; natural language understanding; machine learning; deep learning applications; URDU-TEXT; FEATURES; VIDEO;
D O I
10.3390/app122412652
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Urdu is a widely spoken and narrated language in several South-Asian countries and communities worldwide. It is relatively hard to recognize Urdu text compared to other languages due to its cursive writing style. The Urdu text script belongs to a non-Latin cursive family script like Arabic, Hindi and Chinese. Urdu is written in several writing styles, among which 'Nastaleeq' is the most popular and widely used font style. A gap still poses a challenge for localization/detection and recognition of Urdu Nastaleeq text as it follows modified version of Arabic script. This research study presents a methodology to recognize and classify Urdu text in Nastaleeq font, regardless of the text position in the image. The proposed solution is comprised of a two-step methodology. In the first step, text detection is performed using the Connected Component Analysis (CCA) and Long Short-Term Memory Neural Network (LSTM). In the second step, a hybrid Convolution Neural Network and Recurrent Neural Network (CNN-RNN) architecture is deployed to recognize the detected text. The image containing Urdu text is binarized and segmented to produce a single-line text image fed to the hybrid CNN-RNN model, which recognizes the text and saves it in a text file. The proposed technique outperforms the existing ones by achieving an overall accuracy of 97.47%.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Multi-Layer Sparse Coding: The Holistic Way
    Aberdam, Aviad
    Sulam, Jeremias
    Elad, Michael
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2019, 1 (01): : 46 - 77
  • [2] A segmentation based adaptive approach for cursive handwritten text recognition
    Verma, Brijesh
    Lee, Hong
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2212 - 2216
  • [3] Toward Multi-Layer Holistic Evaluation of System Designs
    Kleanthous, Marios
    Sazeides, Yiannakis
    Ozer, Emre
    Nicopoulos, Chrysostomos
    Nikolaou, Panagiota
    Hadjilambrou, Zacharias
    IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (01) : 58 - 61
  • [4] Multi-layer Lacunarity for Texture Recognition
    Mazurek, Przemyslaw
    Oszutowska-Mazurek, Dorota
    COMPUTER VISION AND GRAPHICS, ICCVG 2016, 2016, 9972 : 174 - 183
  • [5] Multi-Layer Perceptrons for Subvocal Recognition
    Coe, Brian
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 293 - 300
  • [6] Multi-layer boosting for pattern recognition
    Fleuret, Francois
    PATTERN RECOGNITION LETTERS, 2009, 30 (03) : 237 - 241
  • [7] Holistic cursive word recognition based on perceptual features
    Ruiz-Pinales, Jose
    Jaime-Rivas, Rene
    Castro-Bleda, Maria Jose
    PATTERN RECOGNITION LETTERS, 2007, 28 (13) : 1600 - 1609
  • [8] Multi-Layer Text Classification with Voting for Consumer Reviews
    Zhu, Yan
    Moh, Melody
    Moh, Teng-Sheng
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1991 - 1999
  • [9] Attention model with multi-layer supervision for text Classification
    Yue, Chunyi
    Cao, Hanqiang
    Xu, Guoping
    Dong, Youli
    2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 103 - 109
  • [10] Multi-Layer Discourse Annotation of a Dutch Text Corpus
    Redeker, Gisela
    Berzlanovich, Ildiko
    van der Vliet, Nynke
    Bouma, Gosse
    Egg, Markus
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2820 - 2825