Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm

被引:2
|
作者
Eltay, Mohamed [1 ]
Zidouri, Abdelmalek [1 ]
Ahmad, Irfan [2 ]
Elarian, Yousef [3 ]
机构
[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Elect Engn Dept, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[3] Cambrian Coll, Sudbury, ON, Canada
关键词
Handwriting recognition; Deep Learning Neural Network; Data augmentation; Recurrent Neural Network; Connectionist temporal classification;
D O I
10.1007/978-3-030-86198-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has increased the performance of classification and object detection, but it generally requires large amounts of labeled data for training. In this paper, we introduce a new data augmentation algorithm that promotes diversity between classes, representing the characters of the Arabic script, and can balance samples between different classes. This algorithm gives each word in the lexicon a weight. The weight of a word is based on the occurrence probabilities of the characters constituting the word. Minority classes are given higher weight as compared to the classes frequently occurring in the text. The data augmentation technique was evaluated on a handwritten word recognition task using the publicly available IFN/ENIT and AHDB datasets. We see significant improvement in results by employing our data augmentation technique, and we achieve state-of-the-art results on both datasets.
引用
收藏
页码:322 / 335
页数:14
相关论文
共 50 条
  • [1] Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
    Eltay, Mohamed
    Zidouri, Abdelmalek
    Ahmad, Irfan
    Elarian, Yousef
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [2] A data base for arabic handwritten text recognition research
    Al-Ma'adeed, S
    Elliman, D
    Higgins, CA
    EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 485 - 489
  • [3] Distilling GRU with Data Augmentation for Unconstrained Handwritten Text Recognition
    Liu, Manfei
    Xie, Zecheng
    Huang, YaoXiong
    Jin, Lianwen
    Zhou, Weiyin
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 56 - 61
  • [4] Data Augmentation Using Transformers and Similarity Measures for Improving Arabic Text Classification
    Refai, Dania
    Abu-Soud, Saleh
    Abdel-Rahman, Mohammad J.
    IEEE ACCESS, 2023, 11 : 132516 - 132531
  • [5] Handwritten Arabic Text Recognition using Deep Belief Networks
    Porwal, Utkarsh
    Zhou, Yingbo
    Govindaraju, Venu
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 302 - 305
  • [6] Improving CNN-BGRU Hybrid Network for Arabic Handwritten Text Recognition
    Haboubi, Sofiene
    Guesmi, Tawfik
    Alshammari, Badr M.
    Alqunun, Khalid
    Alshammari, Ahmed S.
    Alsaif, Haitham
    Amiri, Hamid
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (03): : 5385 - 5397
  • [7] Improving CNN-BGRU Hybrid Network for Arabic Handwritten Text Recognition
    Haboubi, Sofiene
    Guesmi, Tawfik
    Alshammari, Badr M.
    Alqunun, Khalid
    Alshammari, Ahmed S.
    Alsaif, Haitham
    Amiri, Hamid
    Computers, Materials and Continua, 2022, 73 (03): : 5385 - 5397
  • [8] A Database for Offline Arabic Handwritten Text Recognition
    Mahmoud, Sabri A.
    Ahmad, Irfan
    Alshayeb, Mohammed
    Al-Khatib, Wasfi G.
    IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT II: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, 2011, 6754 : 397 - 406
  • [9] Offline Arabic Handwritten Text Recognition: A Survey
    Parvez, Mohammad Tanvir
    Mahmoud, Sabri A.
    ACM COMPUTING SURVEYS, 2013, 45 (02)
  • [10] ALGORITHM FOR RECOGNITION OF HANDWRITTEN TEXT
    GUBERMAN, SA
    ROZENTSVEIG, VV
    AUTOMATION AND REMOTE CONTROL, 1976, 37 (05) : 751 - 757