A Multi-Layer Holistic Approach for Cursive Text Recognition

被引:0
|
作者
Umair, Muhammad [1 ]
Zubair, Muhammad [1 ]
Dawood, Farhan [1 ]
Ashfaq, Sarim [1 ]
Bhatti, Muhammad Shahid [1 ]
Hijji, Mohammad [2 ]
Sohail, Abid [3 ]
机构
[1] Univ Cent Punjab, Fac Informat Technol & Comp Sci, Lahore 54000, Pakistan
[2] Univ Tabuk, Fac Comp & Informat Technol, Tabuk 47921, Saudi Arabia
[3] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus, Lahore 54000, Pakistan
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
text detection; text recognition; natural language processing; natural language understanding; machine learning; deep learning applications; URDU-TEXT; FEATURES; VIDEO;
D O I
10.3390/app122412652
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Urdu is a widely spoken and narrated language in several South-Asian countries and communities worldwide. It is relatively hard to recognize Urdu text compared to other languages due to its cursive writing style. The Urdu text script belongs to a non-Latin cursive family script like Arabic, Hindi and Chinese. Urdu is written in several writing styles, among which 'Nastaleeq' is the most popular and widely used font style. A gap still poses a challenge for localization/detection and recognition of Urdu Nastaleeq text as it follows modified version of Arabic script. This research study presents a methodology to recognize and classify Urdu text in Nastaleeq font, regardless of the text position in the image. The proposed solution is comprised of a two-step methodology. In the first step, text detection is performed using the Connected Component Analysis (CCA) and Long Short-Term Memory Neural Network (LSTM). In the second step, a hybrid Convolution Neural Network and Recurrent Neural Network (CNN-RNN) architecture is deployed to recognize the detected text. The image containing Urdu text is binarized and segmented to produce a single-line text image fed to the hybrid CNN-RNN model, which recognizes the text and saves it in a text file. The proposed technique outperforms the existing ones by achieving an overall accuracy of 97.47%.
引用
收藏
页数:16
相关论文
共 50 条
  • [11] An algorithmic approach to multi-layer wrinkling
    Lejeune, Emma
    Javili, Ali
    Linder, Christian
    EXTREME MECHANICS LETTERS, 2016, 7 : 10 - 17
  • [12] A multi-layer omics approach to cancer
    Denise Waldron
    Nature Reviews Genetics, 2016, 17 : 437 - 437
  • [13] A multi-layer omics approach to cancer
    Waldron, Denise
    NATURE REVIEWS GENETICS, 2016, 17 (08) : 437 - 437
  • [14] Cursive Stroke Sequencing for Handwritten Text Documents Recognition
    Panwar, Subhash
    Nain, Neeta
    2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
  • [15] Offline recognition of large vocabulary cursive handwritten text
    Vinciarelli, A
    Bengio, S
    Bunke, H
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1101 - 1105
  • [16] ARABIC CHARACTER-RECOGNITION SYSTEM - A STATISTICAL APPROACH FOR RECOGNIZING CURSIVE TYPEWRITTEN TEXT
    ELDABI, SS
    RAMSIS, R
    KAMEL, A
    PATTERN RECOGNITION, 1990, 23 (05) : 485 - 495
  • [17] Handwritten Text Recognition (HTR) for TibetanManuscripts in Cursive Script
    Griffiths, Rachael M.
    REVUE D ETUDES TIBETAINES, 2024, (72): : 43 - 51
  • [18] Detection and recognition of cursive text from video frames
    Ali Mirza
    Ossama Zeshan
    Muhammad Atif
    Imran Siddiqi
    EURASIP Journal on Image and Video Processing, 2020
  • [19] Detection and recognition of cursive text from video frames
    Mirza, Ali
    Zeshan, Ossama
    Atif, Muhammad
    Siddiqi, Imran
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [20] Multi-layer Local Graph Words for Object Recognition
    Karaman, Svebor
    Benois-Pineau, Jenny
    Megret, Remi
    Bugeau, Aurelie
    ADVANCES IN MULTIMEDIA MODELING, 2012, 7131 : 29 - +