Deep Learning Application for Handwritten Arabic Word Recognition

被引:2
|
作者
Alzrrog, Nori [1 ]
Bousquet, Jean-Francois [1 ]
El-Feghi, Idris [2 ]
机构
[1] Dalhousie Univ, Elect & Comp Engn Dept, Halifax, NS B3H 4R2, Canada
[2] Univ Misurata, Fac Informat Technol, Misurata, Libya
关键词
component; formatting; style; styling; insert;
D O I
10.1109/CCECE49351.2022.9918375
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic handwriting recognition is the process of converting online and offline letters or words as a graphical form into its text format. Automatic Arabic handwriting words recognition using deep learning neural networks is still in the early stages in terms of research. There are no general, complete, and reliable Arabic Handwritten words database (lexicon) that can be used as a reference or a benchmark for all researchers who want to extend the work on automatic Arabic handwriting word recognition. Also, many historic Arabic manuscripts have deteriorated because of inappropriate storage and most of them have not been digitized due to the lack of reliable database that can be used to recognize the words of Arabic manuscripts. Deep Convolutional Neural Networks (DCNNs) can be used to solve the problems of automatic Arabic handwriting words recognition. In this work, a new DCNN algorithm applied to a new dataset of Handwritten Arabic words representing the seven days of the week named Arabic Handwritten Weekdays Dataset (AHWD) has been programmed, tested, and analyzed. Our dataset contains 21357 words equally distributed between the seven classes and prepared by 1000. So, it can be used for training and testing on a reliable DCNN model that will be able, after training to generalize to new datasets. The model works by training a (DCNN) model on a balanced-randomly-selected dataset using different structures. The results are improved by adding drop-out, image regularization, proper learning rate to avoid overfitting of the data. Finally, a blind test has been performed on the hidden test set and the performance was reported using a confusion matrix and learning curves as a validation tool for the model. Results show that our model's performance is promising, achieving accuracy rate of 0.9939 with error rate of 0.0461 using AHWD dataset, and accuracy rate of 0.9971 with error rate of 0.0171 using IFN/ENIT dataset.
引用
收藏
页码:95 / 100
页数:6
相关论文
共 50 条
  • [21] Handwritten Arabic and Roman word recognition using holistic approach
    Malakar, Samir
    Sahoo, Samanway
    Chakraborty, Anuran
    Sarkar, Ram
    Nasipuri, Mita
    VISUAL COMPUTER, 2023, 39 (07): : 2909 - 2932
  • [22] Dynamic Hierarchical Bayesian Network for Arabic Handwritten Word Recognition
    Jayech, Khaoula
    Trimech, Nesrine
    Mahjoub, Mohamed Ali
    Ben Amara, Najoua Essoukri
    2013 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2013,
  • [23] A Novel Approach for the Recognition of a Wide Arabic Handwritten Word Lexicon
    Ben Cheikh, I.
    Belaid, A.
    Kacem, A.
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3055 - 3058
  • [24] Handwritten Arabic and Roman word recognition using holistic approach
    Samir Malakar
    Samanway Sahoo
    Anuran Chakraborty
    Ram Sarkar
    Mita Nasipuri
    The Visual Computer, 2023, 39 : 2909 - 2932
  • [25] Arabic Handwritten Word Recognition based on Dynamic Bayesian Network
    Jayech, Khaoula
    Mahjoub, Mohamed Ali
    Ben Amara, Najoua
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (6B) : 1024 - 1031
  • [26] Arabic Handwritten Word Recognition Based on Stationary Wavelet Transform Technique using Machine Learning
    Al-Shatnawi, Atallah Mahmoud
    Al-Saqqar, Faisal
    Souri, Alireza
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (03)
  • [27] A new Arabic handwritten character recognition deep learning system (AHCR-DLS)
    Balaha, Hossam Magdy
    Ali, Hesham Arafat
    Saraya, Mohamed
    Badawy, Mahmoud
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (11): : 6325 - 6367
  • [28] Deep Learning-Based Child Handwritten Arabic Character Recognition and Handwriting Discrimination
    Alwagdani, Maram Saleh
    Jaha, Emad Sami
    SENSORS, 2023, 23 (15)
  • [29] HACR-MDL: HANDWRITTEN ARABIC CHARACTER RECOGNITION MODEL USING DEEP LEARNING
    Elagamy, Mazen Nabil
    Khalil, Miar Mamdouh
    Ismail, Esraa
    GEOSPATIAL WEEK 2023, VOL. 10-1, 2023, : 123 - 128
  • [30] A new Arabic handwritten character recognition deep learning system (AHCR-DLS)
    Hossam Magdy Balaha
    Hesham Arafat Ali
    Mohamed Saraya
    Mahmoud Badawy
    Neural Computing and Applications, 2021, 33 : 6325 - 6367