New architectural optical character recognition approach for cursive fonts: the historical Maghrebian font as an example

被引:0
|
作者
Omar I.O. [1 ]
Haboubi S. [1 ]
Benzarti F. [1 ]
机构
[1] LR11ES17 – Signals, Images and Information Technologies Laboratory, National Engineering School of Tunis, University of Tunis El Manar, BP 37, Tunis
关键词
cursive historical documents; deep learning; Maghrebian font database; OCR; optical character recognition;
D O I
10.1504/IJICA.2023.129361
中图分类号
学科分类号
摘要
The historical Maghrebian font is an Arabic font that dominated in several North African lands. Various cultural and scientific papers of major importance were developed using this font. In this paper, the full OCR architecture that is able to treat the specificity of the historical Maghrebian font is revealed. Further, a complete design with the accuracy of each module is provided. The novel OCR architecture includes a binarisation module based on deep neural networks with an accuracy of 98.1%. Moreover, it involves three segmentation tasks based on deep learning approaches for text/non-text separation, columns division and connected components segmentations. The classification task is based on the DenseNet model with an accuracy of 98.95%. The post-processing module is also based on deep learning approaches based on sequential modelling with an accuracy of 81.3%. It also includes a user-feedback stage with an accuracy of 94.7%. The total system accuracy is 89.06%. Copyright © 2023 Inderscience Enterprises Ltd.
引用
收藏
页码:91 / 103
页数:12
相关论文
共 50 条
  • [1] Optical character recognition for cursive handwriting
    Arica, N
    Yarman-Vural, FT
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (06) : 801 - 813
  • [2] Font Recognition for Persian Optical Character Recognition System
    Eghbali, Koorosh
    Veisi, Hadi
    Mirzaie, Mohsen
    Behbahani, Yasser Mohseni
    [J]. 2017 10TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2017, : 252 - 257
  • [3] Optical font recognition of single chinese character
    Chen, L
    Ding, XQ
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL X, 2003, 5010 : 219 - 226
  • [4] The optical character recognition of Urdu-like cursive scripts
    Naz, Saeeda
    Hayat, Khizar
    Razzak, Muhammad Imran
    Anwar, Muhammad Waqas
    Madani, Sajjad A.
    Khan, Samee U.
    [J]. PATTERN RECOGNITION, 2014, 47 (03) : 1229 - 1248
  • [5] Optical Character Recognition System for Urdu Words in Nastaliq Font
    Shabbir, Safia
    Siddiqi, Imran
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (05) : 567 - 576
  • [6] Recognition of Hand written and Printed Text of Cursive Writing Utilizing Optical Character Recognition
    Duth, Sudharshan P.
    Amulya, B.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 576 - 581
  • [7] Character recognition of Tibetan Historical document in Uchen font: Dataset and bench mark
    Li, Zhenjiang
    Wang, Weilan
    Wang, Yiqun
    Zhang, Qianxue
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2022, 22 (05) : 1779 - 1794
  • [8] Performance analysis of character segmentation approach for cursive script recognition on benchmark database
    Rehman, Amjad
    Saba, Tanzila
    [J]. DIGITAL SIGNAL PROCESSING, 2011, 21 (03) : 486 - 490
  • [9] ARABIC CHARACTER-RECOGNITION SYSTEM - A STATISTICAL APPROACH FOR RECOGNIZING CURSIVE TYPEWRITTEN TEXT
    ELDABI, SS
    RAMSIS, R
    KAMEL, A
    [J]. PATTERN RECOGNITION, 1990, 23 (05) : 485 - 495
  • [10] A new hybrid approach to large vocabulary cursive handwriting recognition
    Rigoll, G
    Kosmala, A
    Willett, D
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1512 - 1514