Semantic-Guided Class-Imbalance Learning Model for Zero-Shot Image Classification

被引:16
|
作者
Ji, Zhong [1 ]
Yu, Xuejie [1 ]
Yu, Yunlong [2 ]
Pang, Yanwei [1 ]
Zhang, Zhongfei [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[3] State Univ New York Binghamton Univ, Watson Sch, Comp Sci Dept, Binghamton, NY 13902 USA
基金
中国国家自然科学基金;
关键词
Visualization; Semantics; Training; Prototypes; Task analysis; Whales; Data models; Class imbalance; class visual prototype; feature generation; image classification; zero-shot learning;
D O I
10.1109/TCYB.2020.3004641
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we focus on the task of zero-shot image classification (ZSIC) that equips a learning system with the ability to recognize visual images from unseen classes. In contrast to the traditional image classification, ZSIC more easily suffers from the class-imbalance issue since it is more concerned with the class-level knowledge transferring capability. In the real world, the sample numbers of different categories generally follow a long-tailed distribution, and the discriminative information in the sample-scarce seen classes is hard to transfer to the related unseen classes in the traditional batch-based training manner, which degrades the overall generalization ability a lot. To alleviate the class-imbalance issue in ZSIC, we propose a sample-balanced training process to encourage all training classes to contribute equally to the learned model. Specifically, we randomly select the same number of images from each class across all training classes to form a training batch to ensure that the sample-scarce classes contribute equally as those classes with sufficient samples during each iteration. Considering that the instances from the same class differ in class representativeness, we further develop an efficient semantic-guided feature fusion model to obtain the discriminative class visual prototype for the following visual-semantic interaction process via distributing different weights to the selected samples based on their class representativeness. Extensive experiments on three imbalanced ZSIC benchmark datasets for both traditional ZSIC and generalized ZSIC tasks demonstrate that our approach achieves promising results, especially for the unseen categories that are closely related to the sample-scarce seen categories. Besides, the experimental results on two class-balanced datasets show that the proposed approach also improves the classification performance against the baseline model.
引用
收藏
页码:6543 / 6554
页数:12
相关论文
共 50 条
  • [31] Semantic guided knowledge graph for large-scale zero-shot learning
    Wei, Jiwei
    Sun, Haotian
    Yang, Yang
    Xu, Xing
    Li, Jingjing
    Shen, Heng Tao
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [32] Preserving Semantic Relations for Zero-Shot Learning
    Annadani, Yashas
    Biswas, Soma
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7603 - 7612
  • [33] Semantic softmax loss for zero-shot learning
    Ji, Zhong
    Sun, Yuxin
    Yu, Yunlong
    Guo, Jichang
    Pang, Yanwei
    NEUROCOMPUTING, 2018, 316 : 369 - 375
  • [34] Underwater Sonar Image Classification with Image Disentanglement Reconstruction and Zero-Shot Learning
    Peng, Ye
    Li, Houpu
    Zhang, Wenwen
    Zhu, Junhui
    Liu, Lei
    Zhai, Guojun
    Remote Sensing, 2025, 17 (01)
  • [35] Boosting Zero-Shot Image Classification via Pairwise Relationship Learning
    Li, Hanhui
    Wu, Hefeng
    Lin, Shujin
    Lin, Liang
    Luo, Xiaonan
    Izquierdo, Ebroul
    COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 : 85 - 99
  • [36] Zero-Shot Audio Classification Via Semantic Embeddings
    Xie, Huang
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1233 - 1242
  • [37] Generalised Zero-Shot Learning with Domain Classification in a Joint Semantic and Visual Space
    Felix, Rafael
    Harwood, Ben
    Sasdelli, Michele
    Carneiro, Gustavo
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 17 - 24
  • [38] Gaze Embeddings for Zero-Shot Image Classification
    Karessli, Nour
    Akata, Zeynep
    Schiele, Bernt
    Bulling, Andreas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6412 - 6421
  • [39] Multimodal Ensembling for Zero-Shot Image Classification
    Hickmon, Javon
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23747 - 23749
  • [40] Zero-Shot Image Classification Based on Attribute
    Zhang, Wei
    Chen, Wenbai
    Chen, Xiangfeng
    Han, Hu
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 25 - 30