A Multi-Layer Holistic Approach for Cursive Text Recognition

被引：0

作者：

Umair, Muhammad ^{[1
]}

Zubair, Muhammad ^{[1
]}

Dawood, Farhan ^{[1
]}

Ashfaq, Sarim ^{[1
]}

Bhatti, Muhammad Shahid ^{[1
]}

Hijji, Mohammad ^{[2
]}

Sohail, Abid ^{[3
]}

机构：

[1] Univ Cent Punjab, Fac Informat Technol & Comp Sci, Lahore 54000, Pakistan

[2] Univ Tabuk, Fac Comp & Informat Technol, Tabuk 47921, Saudi Arabia

[3] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus, Lahore 54000, Pakistan

来源：

APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期

关键词：

text detection; text recognition; natural language processing; natural language understanding; machine learning; deep learning applications; URDU-TEXT; FEATURES; VIDEO;

D O I：

10.3390/app122412652

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Urdu is a widely spoken and narrated language in several South-Asian countries and communities worldwide. It is relatively hard to recognize Urdu text compared to other languages due to its cursive writing style. The Urdu text script belongs to a non-Latin cursive family script like Arabic, Hindi and Chinese. Urdu is written in several writing styles, among which 'Nastaleeq' is the most popular and widely used font style. A gap still poses a challenge for localization/detection and recognition of Urdu Nastaleeq text as it follows modified version of Arabic script. This research study presents a methodology to recognize and classify Urdu text in Nastaleeq font, regardless of the text position in the image. The proposed solution is comprised of a two-step methodology. In the first step, text detection is performed using the Connected Component Analysis (CCA) and Long Short-Term Memory Neural Network (LSTM). In the second step, a hybrid Convolution Neural Network and Recurrent Neural Network (CNN-RNN) architecture is deployed to recognize the detected text. The image containing Urdu text is binarized and segmented to produce a single-line text image fed to the hybrid CNN-RNN model, which recognizes the text and saves it in a text file. The proposed technique outperforms the existing ones by achieving an overall accuracy of 97.47%.

引用

页数：16

共 50 条

[1] Multi-Layer Sparse Coding: The Holistic Way
Aberdam, Aviad
Sulam, Jeremias
Elad, Michael
SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2019, 1 (01): : 46 - 77
[2] A segmentation based adaptive approach for cursive handwritten text recognition
Verma, Brijesh
Lee, Hong
2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2212 - 2216
[3] Toward Multi-Layer Holistic Evaluation of System Designs
Kleanthous, Marios
Sazeides, Yiannakis
Ozer, Emre
Nicopoulos, Chrysostomos
Nikolaou, Panagiota
Hadjilambrou, Zacharias
IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (01) : 58 - 61
[4] Multi-layer Lacunarity for Texture Recognition
Mazurek, Przemyslaw
Oszutowska-Mazurek, Dorota
COMPUTER VISION AND GRAPHICS, ICCVG 2016, 2016, 9972 : 174 - 183
[5] Multi-Layer Perceptrons for Subvocal Recognition
Coe, Brian
2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 293 - 300
[6] Multi-layer boosting for pattern recognition
Fleuret, Francois
PATTERN RECOGNITION LETTERS, 2009, 30 (03) : 237 - 241
[7] Holistic cursive word recognition based on perceptual features
Ruiz-Pinales, Jose
Jaime-Rivas, Rene
Castro-Bleda, Maria Jose
PATTERN RECOGNITION LETTERS, 2007, 28 (13) : 1600 - 1609
[8] Multi-Layer Text Classification with Voting for Consumer Reviews
Zhu, Yan
Moh, Melody
Moh, Teng-Sheng
2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1991 - 1999
[9] Attention model with multi-layer supervision for text Classification
Yue, Chunyi
Cao, Hanqiang
Xu, Guoping
Dong, Youli
2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 103 - 109
[10] Multi-Layer Discourse Annotation of a Dutch Text Corpus
Redeker, Gisela
Berzlanovich, Ildiko
van der Vliet, Nynke
Bouma, Gosse
Egg, Markus
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2820 - 2825

← 1 2 3 4 5 →