The optical character recognition of Urdu-like cursive scripts

被引:92
|
作者
Naz, Saeeda [1 ]
Hayat, Khizar [1 ,4 ]
Razzak, Muhammad Imran [2 ]
Anwar, Muhammad Waqas [1 ]
Madani, Sajjad A. [1 ]
Khan, Samee U. [3 ]
机构
[1] COMSATS Inst Informat Technol, Abbottabad, Pakistan
[2] King Saud Abdulaziz Univ Hlth Sci, Riyadh, Saudi Arabia
[3] N Dakota State Univ, Fargo, ND 58108 USA
[4] Univ Nizwa, Birkat Al Mawz, Oman
关键词
Optical character recognition; Ligature; Character; SPOTTING BASED RETRIEVAL; FUZZY-LOGIC; ONLINE; DATABASE; OFFLINE; FEATURES;
D O I
10.1016/j.patcog.2013.09.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We survey the optical character recognition (OCR) literature with reference to the Urdu-like cursive scripts. In particular, the Urdu, Pushto, and Sindhi languages are discussed, with the emphasis being on the Nasta'liq and Naskh scripts. Before detaining the OCR works, the peculiarities of the Urdu-like scripts are outlined, which are followed by the presentation of the available text image databases. For the sake of clarity, the various attempts are grouped into three parts, namely: (a) printed, (b) handwritten, and (c) online character recognition. Within each part, the works are analyzed par rapport a typical OCR pipeline with an emphasis on the preprocessing, segmentation, feature extraction, classification, and recognition. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1229 / 1248
页数:20
相关论文
共 50 条
  • [1] Optical Character Recognition System for Nastalique Urdu-Like Script Languages Using Supervised Learning
    Rizvi, S. S. R.
    Sagheer, A.
    Adnan, K.
    Muhammad, A.
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (10)
  • [2] Deep Extreme Learning Machine-Based Optical Character Recognition System for Nastalique Urdu-Like Script Languages
    Rizvi, Syed Saqib Raza
    Khan, Muhammad Adnan
    Abbas, Sagheer
    Asadullah, Muhammad
    Anwer, Nida
    Fatima, Areej
    [J]. COMPUTER JOURNAL, 2022, 65 (02): : 331 - 344
  • [3] Optical character recognition for cursive handwriting
    Arica, N
    Yarman-Vural, FT
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (06) : 801 - 813
  • [4] Urdu Nastaleeq Optical Character Recognition
    Ahmad, Zaheer
    Orakzai, Jehanzeb Khan
    Shamsher, Inam
    Adnan, Awais
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 249 - 252
  • [5] COMPUTER RECOGNITION OF ARABIC CURSIVE SCRIPTS
    ELSHEIKH, TS
    GUINDI, RM
    [J]. PATTERN RECOGNITION, 1988, 21 (04) : 293 - 302
  • [6] Cursive character recognition system
    Toscano, Karina
    Sanchez, Gabriel
    Nakano, Mariko
    Perez, Hector
    Yasuhara, Makoto
    [J]. CERMA2006: ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE VOL 2, PROCEEDINGS, 2006, : 62 - +
  • [7] A survey on optical character recognition for Bangla and Devanagari scripts
    SOUMEN BAG
    GAURAV HARIT
    [J]. Sadhana, 2013, 38 : 133 - 168
  • [8] A survey on optical character recognition for Bangla and Devanagari scripts
    Bag, Soumen
    Harit, Gaurav
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2013, 38 (01): : 133 - 168
  • [9] A Finite State Model for Urdu Nastalique Optical Character Recognition
    Sattar, Sohail Abdul
    Shams-ul Haque
    Pathan, Mahmood Khan
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (09): : 116 - 122
  • [10] Optical Character Recognition System for Urdu Words in Nastaliq Font
    Shabbir, Safia
    Siddiqi, Imran
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (05) : 567 - 576