Improving generalized zero-shot learning via cluster-based semantic disentangling representation

被引:3
|
作者
Gao, Yi [1 ]
Feng, Wentao [1 ]
Xiao, Rong [1 ]
He, Lihuo [2 ]
He, Zhenan [1 ]
Lv, Jiancheng [1 ]
Tang, Chenwei [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
美国国家科学基金会;
关键词
Generalized zero-shot learning; Domain shift; Semantic gap; Cluster; Semantic disentangling representation;
D O I
10.1016/j.patcog.2024.110320
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generalized Zero -Shot Learning (GZSL) aims to recognize both seen and unseen classes by training only the seen classes, in which the instances of unseen classes tend to be biased towards the seen class. In this paper, we propose a Cluster -based Semantic Disentangling Representation (CSDR) method to improve GZSL by alleviating the problems of domain shift and semantic gap. First, we cluster the seen data into multiple clusters, where the samples in each cluster belong to several original seen categories, so as to facilitate finegrained semantic disentangling of visual feature vectors. Then, we introduce representation random swapping and contrastive learning based on the clustering results to realize the disentangling semantic representations of semantic -unspecific, class -shared, and class -unique. The fine-grained semantic disentangling representations show high intra-class similarity and inter -class discriminability, which improve the performance of GZSL by alleviating the problem of domain shift. Finally, we construct the visual -semantic embedding space by the variational auto -encoder and alignment module, which can bridge the semantic gap by generating strongly discriminative unseen class samples. Extensive experimental results on four public data sets prove that our method significantly outperforms state-of-the-art methods in generalized and conventional settings.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Zero-Shot Program Representation Learning
    Cui, Nan
    Jiang, Yuze
    Gu, Xiaodong
    Shen, Beijun
    arXiv, 2022,
  • [32] Improving Semantic Embedding Consistency by Metric Learning for Zero-Shot Classiffication
    Bucher, Maxime
    Herbin, Stephane
    Jurie, Frederic
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 730 - 746
  • [33] Zero-Shot Learning for Intrusion Detection via Attribute Representation
    Li, Zhipeng
    Qin, Zheng
    Shen, Pengbo
    Jiang, Liu
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 352 - 364
  • [34] Generalized Zero-Shot Learning Based on Manifold Alignment
    Xu, Rui
    Shao, Shuai
    Liu, Baodi
    Liu, Weifeng
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 202 - 207
  • [35] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
    Rahman, Shafin
    Khan, Salman
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667
  • [36] Zero-shot learning via visual-semantic aligned autoencoder
    Wei, Tianshu
    Huang, Jinjie
    Jin, Cong
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (08) : 14081 - 14095
  • [37] Adversarial Zero-Shot Learning with Semantic Augmentation
    Tong, Bin
    Klinkigt, Martin
    Chen, Junwen
    Cui, Xiankun
    Kong, Quan
    Murakami, Tomokazu
    Kobayashi, Yoshiyuki
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2476 - 2483
  • [38] Semantic-diversity transfer network for generalized zero-shot learning via inner disagreement based OOD detector
    Liu, Bo
    Dong, Qiulei
    Hu, Zhanyi
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [39] Preserving Semantic Relations for Zero-Shot Learning
    Annadani, Yashas
    Biswas, Soma
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7603 - 7612
  • [40] Semantic softmax loss for zero-shot learning
    Ji, Zhong
    Sun, Yuxin
    Yu, Yunlong
    Guo, Jichang
    Pang, Yanwei
    NEUROCOMPUTING, 2018, 316 : 369 - 375