Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm

被引:2
|
作者
Eltay, Mohamed [1 ]
Zidouri, Abdelmalek [1 ]
Ahmad, Irfan [2 ]
Elarian, Yousef [3 ]
机构
[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Elect Engn Dept, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[3] Cambrian Coll, Sudbury, ON, Canada
关键词
Handwriting recognition; Deep Learning Neural Network; Data augmentation; Recurrent Neural Network; Connectionist temporal classification;
D O I
10.1007/978-3-030-86198-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has increased the performance of classification and object detection, but it generally requires large amounts of labeled data for training. In this paper, we introduce a new data augmentation algorithm that promotes diversity between classes, representing the characters of the Arabic script, and can balance samples between different classes. This algorithm gives each word in the lexicon a weight. The weight of a word is based on the occurrence probabilities of the characters constituting the word. Minority classes are given higher weight as compared to the classes frequently occurring in the text. The data augmentation technique was evaluated on a handwritten word recognition task using the publicly available IFN/ENIT and AHDB datasets. We see significant improvement in results by employing our data augmentation technique, and we achieve state-of-the-art results on both datasets.
引用
收藏
页码:322 / 335
页数:14
相关论文
共 50 条
  • [21] A Hybrid Approach for Deep Generative Handwritten Arabic Text Recognition
    Lamtougui, Hicham
    El Moubtahij, Hicham
    Fouadi, Hassan
    Satori, Khalid
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (10) : 1138 - 1147
  • [22] A Review of Feature Extraction Techniques for Handwritten Arabic Text Recognition
    El qacimy, Bouchra
    Hammouch, Ahmed
    Ait Kerroum, Mounir
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES (ICEIT 2015), 2015, : 241 - 245
  • [23] A Database for Arabic Handwritten Text Image Recognition and Writer Identification
    Mezghani, Anis
    Kanoun, Slim
    Khemakhem, Maher
    El Abed, Haikal
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 399 - 402
  • [24] Handwritten arabic text recognition using principal component analysis and support vector machines
    Al-Saqqar F.
    Al-Diabat M.
    Aloun M.
    Al-Shatnawi A.M.
    Intl. J. Adv. Comput. Sci. Appl., 2019, 12 (195-200): : 195 - 200
  • [25] Handwritten Arabic Text Recognition using Principal Component Analysis and Support Vector Machines
    Al-Saqqar, Faisal
    Al-Diabat, Mofleh
    Aloun, Mesbah
    AL-Shatnawi, Atallah M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (12) : 195 - 200
  • [26] Artificial Immune Algorithm for Handwritten Arabic Word Recognition
    Nemmour, Hassiba
    Chibani, Youcef
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (02) : 186 - 194
  • [27] Data augmentation for handwritten digit recognition using generative adversarial networks
    Ganesh Jha
    Hubert Cecotti
    Multimedia Tools and Applications, 2020, 79 : 35055 - 35068
  • [28] Segmentation Algorithm for Arabic Handwritten Text based on Contour Analysis
    Osman, Yusra
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 447 - 452
  • [29] Data augmentation for handwritten digit recognition using generative adversarial networks
    Jha, Ganesh
    Cecotti, Hubert
    Multimedia Tools and Applications, 2020, 79 (47-48): : 35055 - 35068
  • [30] A Study of Data Augmentation for Handwritten Character Recognition Using Deep Learning
    Hayashi, Taihei
    Gyohten, Keiji
    Ohki, Hidehiro
    Takami, Toshiya
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 552 - 557