Towards better long-tailed oracle character recognition with adversarial data augmentation

被引:16
|
作者
Li, Jing [1 ,4 ]
Wang, Qiu-Feng [1 ]
Huang, Kaizhu [2 ]
Yang, Xi [1 ]
Zhang, Rui [3 ]
Goulermas, John Y. [4 ]
机构
[1] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou, Peoples R China
[2] Duke Kunshan Univ, Data Sci Res Ctr, Suzhou, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Sci, Suzhou, Peoples R China
[4] Univ Liverpool, Dept Comp Sci, Liverpool, England
基金
中国国家自然科学基金;
关键词
Oracle character recognition; Long tail; Data imbalance; Data augmentation; Mixup strategy; Generative adversarial networks;
D O I
10.1016/j.patcog.2023.109534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deciphering oracle bone script is of great significance to the study of ancient Chinese culture as well as archaeology. Although recent studies on oracle character recognition have made substantial progress, they still suffer from the long-tailed data situation that results in a noticeable performance drop on the tail classes. To mitigate this issue, we propose a generative adversarial framework to augment oracle characters in the problematic classes. In this framework, the generator produces synthetic data through convex combinations of all the available samples in the corresponding classes, and is further optimized through adversarial learning with the classifier and simultaneously the discriminator. Meanwhile, we in-troduce Repatch to generalize samples in the generator. Since tail classes do not have sufficient data for convex combinations, we propose the TailMix mechanism to generate suitable tail class samples from other classes. Experimental results show that our proposed algorithm obtains remarkable performance in oracle character recognition and achieves new state-of-the-art average (total) accuracy with 86.03% (89.46%), 86.54% (93.86%), 95.22% (96.17%) on the three datasets Oracle-AYNU, OBC306 and Oracle-20K, respectively.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Domain Balancing: Face Recognition on Long-Tailed Domains
    Cao, Dong
    Zhu, Xiangyu
    Huang, Xingyu
    Guo, Jianzhu
    Lei, Zhen
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5670 - 5678
  • [42] Long-tailed recognition via key attribute learning
    Fu, Yu
    Han, Jungong
    Chang, Xiang
    Chen, Changrui
    Shang, Changjing
    Shen, Qiang
    NEUROCOMPUTING, 2025, 627
  • [43] Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification
    Chen, Xiaohua
    Zhou, Yucan
    Wu, Dayan
    Zhang, Wanqian
    Zhou, Yu
    Li, Bo
    Wang, Weiping
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 356 - 364
  • [44] Targeted Supervised Contrastive Learning for Long-Tailed Recognition
    Li, Tianhong
    Cao, Peng
    Yuan, Yuan
    Fan, Lijie
    Yang, Yuzhe
    Feris, Rogerio
    Indyk, Piotr
    Katabi, Dina
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6908 - 6918
  • [45] A dual progressive strategy for long-tailed visual recognition
    Liang, Hong
    Cao, Guoqing
    Shao, Mingwen
    Zhang, Qian
    MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
  • [46] Local pseudo-attributes for long-tailed recognition
    Kim, Dong-Jin
    Ke, Tsung-Wei
    Yu, Stella X.
    PATTERN RECOGNITION LETTERS, 2023, 172 : 51 - 57
  • [47] Nested Collaborative Learning for Long-Tailed Visual Recognition
    Li, Jun
    Tan, Zichang
    Wan, Jun
    Lei, Zhen
    Guo, Guodong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6939 - 6948
  • [48] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
    Du, Chaoqun
    Wang, Yulin
    Song, Shiji
    Huang, Gao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 5890 - 5904
  • [49] Easy balanced mixing for long-tailed data
    Zhu, Zonghai
    Xing, Huanlai
    Xu, Yuge
    KNOWLEDGE-BASED SYSTEMS, 2022, 248
  • [50] Beyond the Label Distribution Prior for Long-Tailed Recognition
    Li, Ming
    Cao, Liujuan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 792 - 803