Towards better long-tailed oracle character recognition with adversarial data augmentation

被引:16
|
作者
Li, Jing [1 ,4 ]
Wang, Qiu-Feng [1 ]
Huang, Kaizhu [2 ]
Yang, Xi [1 ]
Zhang, Rui [3 ]
Goulermas, John Y. [4 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Sci, Suzhou, Peoples R China
[4] Univ Liverpool, Dept Comp Sci, Liverpool, England
基金
中国国家自然科学基金;
关键词
Oracle character recognition; Long tail; Data imbalance; Data augmentation; Mixup strategy; Generative adversarial networks;
D O I
10.1016/j.patcog.2023.109534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deciphering oracle bone script is of great significance to the study of ancient Chinese culture as well as archaeology. Although recent studies on oracle character recognition have made substantial progress, they still suffer from the long-tailed data situation that results in a noticeable performance drop on the tail classes. To mitigate this issue, we propose a generative adversarial framework to augment oracle characters in the problematic classes. In this framework, the generator produces synthetic data through convex combinations of all the available samples in the corresponding classes, and is further optimized through adversarial learning with the classifier and simultaneously the discriminator. Meanwhile, we in-troduce Repatch to generalize samples in the generator. Since tail classes do not have sufficient data for convex combinations, we propose the TailMix mechanism to generate suitable tail class samples from other classes. Experimental results show that our proposed algorithm obtains remarkable performance in oracle character recognition and achieves new state-of-the-art average (total) accuracy with 86.03% (89.46%), 86.54% (93.86%), 95.22% (96.17%) on the three datasets Oracle-AYNU, OBC306 and Oracle-20K, respectively.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Distilling Virtual Examples for Long-tailed Recognition
    He, Yin-Yin
    Wu, Jianxin
    Wei, Xiu-Shen
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 235 - 244
  • [32] Open Long-Tailed Recognition in a Dynamic World
    Liu, Ziwei
    Miao, Zhongqi
    Zhan, Xiaohang
    Wang, Jiayun
    Gong, Boqing
    Yu, Stella X.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1836 - 1851
  • [33] Visualizing bivariate long-tailed data
    Dyer, Justin S.
    Owen, Art B.
    ELECTRONIC JOURNAL OF STATISTICS, 2011, 5 : 642 - 668
  • [34] Normalizing Batch Normalization for Long-Tailed Recognition
    Bao, Yuxiang
    Kang, Guoliang
    Yang, Linlin
    Duan, Xiaoyue
    Zhao, Bo
    Zhang, Baochang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 209 - 220
  • [35] Unequal-training for Deep Face Recognition with Long-tailed Noisy Data
    Zhong, Yaoyao
    Deng, Weihong
    Wang, Mei
    Hu, Jiani
    Peng, Jianteng
    Tao, Xunqiang
    Huang, Yaohai
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7804 - 7813
  • [36] Class-Balanced Regularization for Long-Tailed Recognition
    Xu, Yuge
    Lyu, Chuanlong
    NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [37] Feature fusion network for long-tailed visual recognition
    Zhou, Xuesong
    Zhai, Junhai
    Cao, Yang
    PATTERN RECOGNITION, 2023, 144
  • [38] Balanced Product of Calibrated Experts for Long-Tailed Recognition
    Aimar, Emanuel Sanchez
    Jonnarth, Arvi
    Felsberg, Michael
    Kuhlmann, Marco
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19967 - 19977
  • [39] Disentangling Label Distribution for Long-tailed Visual Recognition
    Hong, Youngkyu
    Han, Seungju
    Choi, Kwanghee
    Seo, Seokjun
    Kim, Beomsu
    Chang, Buru
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6622 - 6632
  • [40] A dual progressive strategy for long-tailed visual recognition
    Hong Liang
    Guoqing Cao
    Mingwen Shao
    Qian Zhang
    Machine Vision and Applications, 2024, 35