Improving generalized zero-shot learning via cluster-based semantic disentangling representation

被引:3
|
作者
Gao, Yi [1 ]
Feng, Wentao [1 ]
Xiao, Rong [1 ]
He, Lihuo [2 ]
He, Zhenan [1 ]
Lv, Jiancheng [1 ]
Tang, Chenwei [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
美国国家科学基金会;
关键词
Generalized zero-shot learning; Domain shift; Semantic gap; Cluster; Semantic disentangling representation;
D O I
10.1016/j.patcog.2024.110320
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generalized Zero -Shot Learning (GZSL) aims to recognize both seen and unseen classes by training only the seen classes, in which the instances of unseen classes tend to be biased towards the seen class. In this paper, we propose a Cluster -based Semantic Disentangling Representation (CSDR) method to improve GZSL by alleviating the problems of domain shift and semantic gap. First, we cluster the seen data into multiple clusters, where the samples in each cluster belong to several original seen categories, so as to facilitate finegrained semantic disentangling of visual feature vectors. Then, we introduce representation random swapping and contrastive learning based on the clustering results to realize the disentangling semantic representations of semantic -unspecific, class -shared, and class -unique. The fine-grained semantic disentangling representations show high intra-class similarity and inter -class discriminability, which improve the performance of GZSL by alleviating the problem of domain shift. Finally, we construct the visual -semantic embedding space by the variational auto -encoder and alignment module, which can bridge the semantic gap by generating strongly discriminative unseen class samples. Extensive experimental results on four public data sets prove that our method significantly outperforms state-of-the-art methods in generalized and conventional settings.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Generative Model with Semantic Embedding and Integrated Classifier for Generalized Zero-Shot Learning
    Pambala, Ayyappa Kumar
    Dutta, Titir
    Biswas, Soma
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1226 - 1235
  • [42] Visual-semantic consistency matching network for generalized zero-shot learning
    Zhang, Zhenqi
    Cao, Wenming
    NEUROCOMPUTING, 2023, 536 : 30 - 39
  • [43] A generalized zero-shot semantic learning model for batch process fault diagnosis
    Liu, Kai
    Zhao, Xiaoqiang
    Mou, Miao
    Hui, Yongyong
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [44] Semantic-guided Reinforced Region Embedding for Generalized Zero-Shot Learning
    Ge, Jiannan
    Xie, Hongtao
    Min, Shaobo
    Zhang, Yongdong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1406 - 1414
  • [45] A Semantic Encoding Out-of-Distribution Classifier for Generalized Zero-Shot Learning
    Ding, Jiayu
    Hu, Xiao
    Zhong, Xiaorong
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1395 - 1399
  • [46] Zero-Shot Learning via Low-Rank-Representation Based Manifold Regularization
    Meng, Min
    Zhan, Xiaoyu
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (09) : 1379 - 1383
  • [47] Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
    Liu, Man
    Li, Feng
    Zhang, Chunjie
    Wei, Yunchao
    Bai, Huihui
    Zhao, Yao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15337 - 15346
  • [48] Zero-Shot Learning via Robust Latent Representation and Manifold Regularization
    Meng, Min
    Yu, Jun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1824 - 1836
  • [49] Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning
    Ye, Yalan
    He, Yukun
    Pan, Tongjie
    Li, Jingjing
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1325 - 1337
  • [50] A zero-shot learning framework via cluster-prototype matching
    Zhang, Jing
    Li, Qingyong
    Geng, YangLi-ao
    Wang, Wen
    Sun, Wenju
    Shi, Chuan
    Ding, Zhengming
    PATTERN RECOGNITION, 2022, 124