ADOCRNet: A Deep Learning OCR for Arabic Documents Recognition

被引:3
|
作者
Mosbah, Lamia [1 ]
Moalla, Ikram [1 ,2 ]
Hamdani, Tarek M. [1 ,3 ]
Neji, Bilel [4 ]
Beyrouthy, Taha [4 ]
Alimi, Adel M. [1 ,5 ]
机构
[1] Univ Sfax, Natl Engn Sch Sfax ENIS, ReGIM Lab, REs Grp Intelligent Machines, Sfax 3038, Tunisia
[2] Al Baha Univ, Coll Comp Sci & Informat Technol, Al Bahah 65511, Saudi Arabia
[3] Univ Monastir, Higher Inst Comp Sci Mahdia ISIMa, Monastir 5000, Tunisia
[4] Amer Univ Middle East, Coll Engn & Technol, Egaila 54200, Kuwait
[5] Univ Johannesburg, Fac Engn & Built Environm, Dept Elect & Elect Engn Sci, Johannesburg 3038, South Africa
关键词
Arabic; document recognition; CNNs; CTC; deep learning; BLSTM; OCR; NEURAL-NETWORKS; CHARACTER-RECOGNITION;
D O I
10.1109/ACCESS.2024.3379530
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, Optical character recognition (OCR) has experienced a resurgence of interest especially for contemporary Arabic data. In fact, OCR development for printed and handwritten Arabic script is still a challenging task. These challenges are due to the specific characteristics of the Arabic script. In this work, we attempt to address these challenges by creating a deep learning OCR for Arabic document recognition called ADOCRNet. It is a novel deep learning framework whose architecture is built of layers of Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (BLSTM) trained using Connectionist Temporal Classification (CTC) algorithm. In order to assess the performance of our OCR, the proposed system is performed on two printed text datasets which are P-KHATT (text line images) and APTI (word images). It's also evaluated on a handwritten Arabic text dataset IFN/ENIT (word images). According to the practical tests, the conceived model achieves strength recognition rates on the three datasets. ADOCRNet reaches a Character Error Rate (CER) of 0.01% on the P-KHATT dataset, 0.03% on the APTI dataset and a Word Error Rate (WER) of 1.09% on the IFN/ENIT dataset, which significantly outperforms the outcomes of the current systems.
引用
收藏
页码:55620 / 55631
页数:12
相关论文
共 50 条
  • [11] Arabic Handwritten Recognition Using Deep Learning: A Survey
    Alrobah, Naseem
    Albahli, Saleh
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9943 - 9963
  • [12] A deep learning approach for handwritten Arabic names recognition
    Mustafa M.E.
    Elbashir M.K.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (01): : 678 - 682
  • [13] Deep Learning Approach for Arabic Named Entity Recognition
    Gridach, Mourad
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 439 - 451
  • [14] A Hybrid Deep Learning Model for Arabic Text Recognition
    Fasha, Mohammad
    Hammo, Bassam
    Obeid, Nadim
    AlWidian, Jabir
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (08) : 122 - 130
  • [15] Spoken Arabic Digits Recognition Using Deep Learning
    Wazir, Abdulaziz Saleh Mahfoudh B. A.
    Chuah, Joon Huang
    2019 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS (I2CACIS), 2019, : 339 - 344
  • [16] Deep Learning Application for Handwritten Arabic Word Recognition
    Alzrrog, Nori
    Bousquet, Jean-Francois
    El-Feghi, Idris
    2022 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2022, : 95 - 100
  • [17] Arabic Handwritten Recognition Using Deep Learning: A Survey
    Naseem Alrobah
    Saleh Albahli
    Arabian Journal for Science and Engineering, 2022, 47 : 9943 - 9963
  • [18] Multimodal Arabic emotion recognition using deep learning
    Al Roken, Noora
    Barlas, Gerassimos
    SPEECH COMMUNICATION, 2023, 155
  • [19] Arabic Name Entity Recognition Using Deep Learning
    Awad, David
    Sabty, Caroline
    Elmahdy, Mohamed
    Abdennadher, Slim
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 105 - 116