Arabic Optical Character Recognition: A Review

被引:6
|
作者
Alghyaline, Salah [1 ]
机构
[1] World Islamic Sci & Educ Univ, Dept Comp Sci, Amman 110111947, Jordan
来源
关键词
Arabic Optical Character Recognition (OCR); Arabic OCR software; Arabic OCR datasets; Arabic OCR evaluation; SEGMENTATION-FREE; TEXT RECOGNITION; TRANSFORM; SCRIPTS; SYSTEM; ROBUST; MODEL;
D O I
10.32604/cmes.2022.024555
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This study aims to review the latest contributions in Arabic Optical Character Recognition (OCR) during the last decade, which helps interested researchers know the existing techniques and extend or adapt them accordingly. The study describes the characteristics of the Arabic language, different types of OCR systems, different stages of the Arabic OCR system, the researcher's contributions in each step, and the evaluation metrics for OCR. The study reviews the existing datasets for the Arabic OCR and their characteristics. Additionally, this study implemented some preprocessing and segmentation stages of Arabic OCR. The study compares the performance of the existing methods in terms of recognition accuracy. In addition to researchers' OCR methods, commercial and open-source systems are used in the comparison. The Arabic language is morphologically rich and written cursive with dots and diacritics above and under the characters. Most of the existing approaches in the literature were evaluated on isolated characters or isolated words under a controlled environment, and few approaches were tested on page-level scripts. Some comparative studies show that the accuracy of the existing Arabic OCR commercial systems is low, under 75% for printed text, and further improvement is needed. Moreover, most of the current approaches are offline OCR systems, and there is no remarkable contribution to online OCR systems.
引用
收藏
页码:1825 / 1861
页数:37
相关论文
共 50 条
  • [1] Arabic optical character recognition software: A review
    Alkhateeb F.
    Abu Doush I.
    Albsoul A.
    [J]. Pattern Recognition and Image Analysis, 2017, 27 (4) : 763 - 776
  • [2] Design of an Embedded Arabic Optical Character Recognition
    Al-Marakeby, A.
    Kimura, F.
    Zaki, M.
    Rashid, A.
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 70 (03): : 249 - 258
  • [3] Design of an Embedded Arabic Optical Character Recognition
    A. Al-Marakeby
    F. Kimura
    M. Zaki
    A. Rashid
    [J]. Journal of Signal Processing Systems, 2013, 70 : 249 - 258
  • [4] Automated System for Arabic Optical Character Recognition
    Aljarrah, Inad
    Al-Khaleel, Osama
    Mhaidat, Khaldoon
    Alrefai, Mu'ath
    Alzu'bi, Abdullah
    Rabab'ah, Mohammad
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS'12), 2012,
  • [5] Optical Character Recognition of Arabic Printed Text
    Taha, Safwa
    Babiker, Yusra
    Abbas, Mohamed
    [J]. 2012 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2012,
  • [6] Optical character recognition of arabic printed text
    Electrical and Electronics Engineering Department, University of Khartoum, Sudan
    [J]. SCOReD - IEEE Stud. Conf. Res. Dev., (235-240):
  • [7] A recognition-based Arabic optical character recognition system
    Cheung, A
    Bennamoun, M
    Bergmann, NW
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 4189 - 4194
  • [8] Off line Arabic character recognition - A review
    Khorsheed, MS
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2002, 5 (01) : 31 - 45
  • [9] A Novel Approach to Printed Arabic Optical Character Recognition
    Mansoor A. Al Ghamdi
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 2219 - 2237
  • [10] A Novel Approach to Printed Arabic Optical Character Recognition
    Al Ghamdi, Mansoor A.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (02) : 2219 - 2237