Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm

被引:2
|
作者
Eltay, Mohamed [1 ]
Zidouri, Abdelmalek [1 ]
Ahmad, Irfan [2 ]
Elarian, Yousef [3 ]
机构
[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Elect Engn Dept, Dhahran, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[3] Cambrian Coll, Sudbury, ON, Canada
关键词
Handwriting recognition; Deep Learning Neural Network; Data augmentation; Recurrent Neural Network; Connectionist temporal classification;
D O I
10.1007/978-3-030-86198-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has increased the performance of classification and object detection, but it generally requires large amounts of labeled data for training. In this paper, we introduce a new data augmentation algorithm that promotes diversity between classes, representing the characters of the Arabic script, and can balance samples between different classes. This algorithm gives each word in the lexicon a weight. The weight of a word is based on the occurrence probabilities of the characters constituting the word. Minority classes are given higher weight as compared to the classes frequently occurring in the text. The data augmentation technique was evaluated on a handwritten word recognition task using the publicly available IFN/ENIT and AHDB datasets. We see significant improvement in results by employing our data augmentation technique, and we achieve state-of-the-art results on both datasets.
引用
收藏
页码:322 / 335
页数:14
相关论文
共 50 条
  • [31] Data augmentation for handwritten digit recognition using generative adversarial networks
    Jha, Ganesh
    Cecotti, Hubert
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35055 - 35068
  • [32] High Performance Offline Handwritten Chinese Text Recognition with a New Data Preprocessing and Augmentation Pipeline
    Xie, Canyu
    Lai, Songxuan
    Liao, Qianying
    Jin, Lianwen
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 45 - 59
  • [33] Improving Handwritten Chinese Text Recognition by Confidence Transformation
    Wang, Qiu-Feng
    Yin, Fei
    Liu, Cheng-Lin
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 518 - 522
  • [34] Improved linear density technique for segmentation in Arabic handwritten text recognition
    Husam Ahmed Al Hamad
    Laith Abualigah
    Mohammad Shehab
    Khalil H. A. Al-Shqeerat
    Mohammad Otair
    Multimedia Tools and Applications, 2022, 81 : 28531 - 28558
  • [35] Compositional Generalization for Multi-Label Text Classification: A Data-Augmentation Approach
    Chai, Yuyang
    Li, Zhuang
    Liu, Jiahui
    Chen, Lei
    Li, Fei
    Ji, Donghong
    Teng, Chong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17727 - 17735
  • [36] A transformer-based approach for Arabic offline handwritten text recognition
    Saleh Momeni
    Bagher BabaAli
    Signal, Image and Video Processing, 2024, 18 : 3053 - 3062
  • [37] Out of vocabulary word detection and recovery in Arabic handwritten text recognition
    Jemni, Sana Khamekhem
    Kessentini, Yousri
    Kanoun, Slim
    PATTERN RECOGNITION, 2019, 93 : 507 - 520
  • [38] Improved linear density technique for segmentation in Arabic handwritten text recognition
    Al Hamad, Husam Ahmed
    Abualigah, Laith
    Shehab, Mohammad
    Al-Shqeerat, Khalil H. A.
    Otair, Mohammad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 28531 - 28558
  • [39] Class-Based Contextual Modeling for Handwritten Arabic Text Recognition
    Ahmad, Irfan
    Fink, Gernot A.
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 554 - 559
  • [40] A transformer-based approach for Arabic offline handwritten text recognition
    Momeni, Saleh
    Babaali, Bagher
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3053 - 3062