Semantic-Guided Class-Imbalance Learning Model for Zero-Shot Image Classification

被引:16
|
作者
Ji, Zhong [1 ]
Yu, Xuejie [1 ]
Yu, Yunlong [2 ]
Pang, Yanwei [1 ]
Zhang, Zhongfei [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[3] State Univ New York Binghamton Univ, Watson Sch, Comp Sci Dept, Binghamton, NY 13902 USA
基金
中国国家自然科学基金;
关键词
Visualization; Semantics; Training; Prototypes; Task analysis; Whales; Data models; Class imbalance; class visual prototype; feature generation; image classification; zero-shot learning;
D O I
10.1109/TCYB.2020.3004641
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we focus on the task of zero-shot image classification (ZSIC) that equips a learning system with the ability to recognize visual images from unseen classes. In contrast to the traditional image classification, ZSIC more easily suffers from the class-imbalance issue since it is more concerned with the class-level knowledge transferring capability. In the real world, the sample numbers of different categories generally follow a long-tailed distribution, and the discriminative information in the sample-scarce seen classes is hard to transfer to the related unseen classes in the traditional batch-based training manner, which degrades the overall generalization ability a lot. To alleviate the class-imbalance issue in ZSIC, we propose a sample-balanced training process to encourage all training classes to contribute equally to the learned model. Specifically, we randomly select the same number of images from each class across all training classes to form a training batch to ensure that the sample-scarce classes contribute equally as those classes with sufficient samples during each iteration. Considering that the instances from the same class differ in class representativeness, we further develop an efficient semantic-guided feature fusion model to obtain the discriminative class visual prototype for the following visual-semantic interaction process via distributing different weights to the selected samples based on their class representativeness. Extensive experiments on three imbalanced ZSIC benchmark datasets for both traditional ZSIC and generalized ZSIC tasks demonstrate that our approach achieves promising results, especially for the unseen categories that are closely related to the sample-scarce seen categories. Besides, the experimental results on two class-balanced datasets show that the proposed approach also improves the classification performance against the baseline model.
引用
收藏
页码:6543 / 6554
页数:12
相关论文
共 50 条
  • [41] Zero-Shot Image Classification via Coupled Discriminative Dictionary Learning
    Liu, Lehui
    Wu, Songsong
    Chen, Runqing
    Zhou, Mengquan
    INTELLIGENT COMPUTING, NETWORKED CONTROL, AND THEIR ENGINEERING APPLICATIONS, PT II, 2017, 762 : 363 - 372
  • [42] Method for improving zero-shot image classification
    Chen, Xiangfeng
    Chen, Wenbai
    Zhang, Chong
    Lv, Mengyao
    Han, Hu
    JOURNAL OF ENGINEERING-JOE, 2018, (16): : 1688 - 1691
  • [43] Semantic-aligned reinforced attention model for zero-shot learning
    Yang, Zaiquan
    Zhang, Yuqi
    Du, Yuxin
    Tong, Chao
    IMAGE AND VISION COMPUTING, 2022, 128
  • [44] SEMANTIC AUGMENTATION HASHING FOR ZERO-SHOT IMAGE RETRIEVAL
    Zhong, Fangming
    Chen, Zhikui
    Min, Geyong
    Xia, Feng
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1943 - 1947
  • [45] Zero-shot learning with visual-semantic mutual reinforcement for image recognition
    Zhang, Yuhong
    Chen, Taohong
    Yu, Kui
    Hu, Xuegang
    Journal of Electronic Imaging, 2024, 33 (05)
  • [46] Zero-shot Image Tagging by Hierarchical Semantic Embedding
    Li, Xirong
    Liao, Shuai
    Lan, Weiyu
    Du, Xiaoyong
    Yang, Gang
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 879 - 882
  • [47] Robust image features for classification and zero-shot tasks by merging visual and semantic attributes
    Damares Crystina Oliveira de Resende
    Moacir Antonelli Ponti
    Neural Computing and Applications, 2022, 34 : 4459 - 4471
  • [48] Zero-shot classification with unseen prototype learning
    Zhong Ji
    Biying Cui
    Yunlong Yu
    Yanwei Pang
    Zhongfei Zhang
    Neural Computing and Applications, 2023, 35 : 12307 - 12317
  • [49] Zero-shot classification with unseen prototype learning
    Ji, Zhong
    Cui, Biying
    Yu, Yunlong
    Pang, Yanwei
    Zhang, Zhongfei
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (17): : 12307 - 12317
  • [50] Attribute relation learning for zero-shot classification
    Liu, Mingxia
    Zhang, Daoqiang
    Chen, Songcan
    NEUROCOMPUTING, 2014, 139 : 34 - 46