Towards better long-tailed oracle character recognition with adversarial data augmentation

被引:16
|
作者
Li, Jing [1 ,4 ]
Wang, Qiu-Feng [1 ]
Huang, Kaizhu [2 ]
Yang, Xi [1 ]
Zhang, Rui [3 ]
Goulermas, John Y. [4 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Sci, Suzhou, Peoples R China
[4] Univ Liverpool, Dept Comp Sci, Liverpool, England
基金
中国国家自然科学基金;
关键词
Oracle character recognition; Long tail; Data imbalance; Data augmentation; Mixup strategy; Generative adversarial networks;
D O I
10.1016/j.patcog.2023.109534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deciphering oracle bone script is of great significance to the study of ancient Chinese culture as well as archaeology. Although recent studies on oracle character recognition have made substantial progress, they still suffer from the long-tailed data situation that results in a noticeable performance drop on the tail classes. To mitigate this issue, we propose a generative adversarial framework to augment oracle characters in the problematic classes. In this framework, the generator produces synthetic data through convex combinations of all the available samples in the corresponding classes, and is further optimized through adversarial learning with the classifier and simultaneously the discriminator. Meanwhile, we in-troduce Repatch to generalize samples in the generator. Since tail classes do not have sufficient data for convex combinations, we propose the TailMix mechanism to generate suitable tail class samples from other classes. Experimental results show that our proposed algorithm obtains remarkable performance in oracle character recognition and achieves new state-of-the-art average (total) accuracy with 86.03% (89.46%), 86.54% (93.86%), 95.22% (96.17%) on the three datasets Oracle-AYNU, OBC306 and Oracle-20K, respectively.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] BSDA in Visual Recognition: Balanced Semantic Data Augmentation for Long-Tailed Data
    Wang, Yifan
    Huang, Eaven
    Wang, Runan
    Leng, Tuo
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [2] Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation
    Pan, Haolin
    Guo, Yong
    Yu, Mianjie
    Chen, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4215 - 4230
  • [3] Attentive Feature Augmentation for Long-Tailed Visual Recognition
    Wang, Weiqiu
    Zhao, Zhicheng
    Wang, Pingyu
    Su, Fei
    Meng, Hongying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5803 - 5816
  • [4] Margin-aware rectified augmentation for long-tailed recognition
    Xiang, Liuyu
    Han, Jungong
    Ding, Guiguang
    PATTERN RECOGNITION, 2023, 141
  • [5] MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition
    Li, Shuang
    Gong, Kaixiong
    Liu, Chi Harold
    Wang, Yulin
    Qiao, Feng
    Cheng, Xinjing
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5208 - 5217
  • [6] Text-guided Fourier Augmentation for long-tailed recognition
    Wang, Weiqiu
    Chen, Zining
    Su, Fei
    Zhao, Zhicheng
    PATTERN RECOGNITION LETTERS, 2024, 179 : 38 - 44
  • [7] Towards Effective Collaborative Learning in Long-Tailed Recognition
    Xu, Zhengzhuo
    Chai, Zenghao
    Xu, Chengyin
    Yuan, Chun
    Yang, Haiqin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3754 - 3764
  • [8] Breadcrumbs: Adversarial Class-Balanced Sampling for Long-Tailed Recognition
    Liu, Bo
    Li, Haoxiang
    Kang, Hao
    Hua, Gang
    Vasconcelos, Nuno
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 637 - 653
  • [9] DATA AUGMENTATION FOR LONG-TAILED AND IMBALANCED POLYPHONE DISAMBIGUATION IN MANDARIN
    Zhang, Yang
    Zhang, Haitong
    Lin, Yue
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7137 - 7141
  • [10] Feature Bias Correction: A Feature Augmentation Method for Long-tailed Recognition
    Yang, Jiaxin
    Li, Xiaofei
    Zhang, Jun
    Li, Shuohao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 558 - 563