ADOCRNet: A Deep Learning OCR for Arabic Documents Recognition

被引:3
|
作者
Mosbah, Lamia [1 ]
Moalla, Ikram [1 ,2 ]
Hamdani, Tarek M. [1 ,3 ]
Neji, Bilel [4 ]
Beyrouthy, Taha [4 ]
Alimi, Adel M. [1 ,5 ]
机构
[1] Univ Sfax, Natl Engn Sch Sfax ENIS, ReGIM Lab, REs Grp Intelligent Machines, Sfax 3038, Tunisia
[2] Al Baha Univ, Coll Comp Sci & Informat Technol, Al Bahah 65511, Saudi Arabia
[3] Univ Monastir, Higher Inst Comp Sci Mahdia ISIMa, Monastir 5000, Tunisia
[4] Amer Univ Middle East, Coll Engn & Technol, Egaila 54200, Kuwait
[5] Univ Johannesburg, Fac Engn & Built Environm, Dept Elect & Elect Engn Sci, Johannesburg 3038, South Africa
关键词
Arabic; document recognition; CNNs; CTC; deep learning; BLSTM; OCR; NEURAL-NETWORKS; CHARACTER-RECOGNITION;
D O I
10.1109/ACCESS.2024.3379530
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, Optical character recognition (OCR) has experienced a resurgence of interest especially for contemporary Arabic data. In fact, OCR development for printed and handwritten Arabic script is still a challenging task. These challenges are due to the specific characteristics of the Arabic script. In this work, we attempt to address these challenges by creating a deep learning OCR for Arabic document recognition called ADOCRNet. It is a novel deep learning framework whose architecture is built of layers of Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (BLSTM) trained using Connectionist Temporal Classification (CTC) algorithm. In order to assess the performance of our OCR, the proposed system is performed on two printed text datasets which are P-KHATT (text line images) and APTI (word images). It's also evaluated on a handwritten Arabic text dataset IFN/ENIT (word images). According to the practical tests, the conceived model achieves strength recognition rates on the three datasets. ADOCRNet reaches a Character Error Rate (CER) of 0.01% on the P-KHATT dataset, 0.03% on the APTI dataset and a Word Error Rate (WER) of 1.09% on the IFN/ENIT dataset, which significantly outperforms the outcomes of the current systems.
引用
收藏
页码:55620 / 55631
页数:12
相关论文
共 50 条
  • [21] Arabic Text Documents Recommendation Using Joint Deep Representations Learning
    Meddeb, Ons
    Maraoui, Mohsen
    Zrigui, Mounir
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 812 - 821
  • [22] Arabic Sign Language Recognition Using Deep Learning Models
    Al-Barham, Muhammad
    Abu Sa'aleek, Ahmad
    Al-Odat, Mohammad
    Hamad, Ghada
    Al-Yaman, Musa
    Elnagar, Ashraf
    2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 226 - 231
  • [23] Deep Learning based Isolated Arabic Scene Character Recognition
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yousaf, Rubiyah
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 46 - 51
  • [24] Arabic Sign Language Recognition Using Deep Machine Learning
    Suliman, Wael
    Deriche, Mohamed
    Luqman, Hamzah
    Mohandes, Mohamed
    2021 4TH INTERNATIONAL SYMPOSIUM ON ADVANCED ELECTRICAL AND COMMUNICATION TECHNOLOGIES (ISAECT), 2021,
  • [25] Learning Deep Wavelet Networks for Recognition System of Arabic Words
    Bouallegue, Amira
    Hassairi, Salima
    Ejbali, Ridha
    Zaied, Mourad
    INTERNATIONAL JOINT CONFERENCE SOCO'16- CISIS'16-ICEUTE'16, 2017, 527 : 498 - 507
  • [26] Deep Learning-Based Segmentation of Connected Components in Arabic Handwritten Documents
    Gader, Takwa Ben Aïcha
    Echi, Afef Kacem
    Communications in Computer and Information Science, 2022, 1589 CCIS : 93 - 106
  • [27] Word-based correction tor retrieval of arabic OCR degraded documents
    Magdy, Walid
    Darwish, Kareem
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2006, 4209 : 205 - 216
  • [28] Deep Learning, Ensemble and Supervised Machine Learning for Arabic Speech Emotion Recognition
    Ismaiel, Wahiba
    Alhalangy, Abdalilah
    Mohamed, Adil. O. Y.
    Musa, Abdalla Ibrahim Abdalla
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (02) : 13757 - 13764
  • [29] Arabic named entity recognition in crime documents
    Asharef, M.
    Omar, N.
    Albared, M.
    Journal of Theoretical and Applied Information Technology, 2012, 44 (01) : 1 - 6
  • [30] Digital Learning for Summarizing Arabic Documents
    Boudabous, Mohamed Mahdi
    Maaloul, Mohamed Hedi
    Belguith, Lamia Hadrich
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 79 - +