Visual Structure Constraint for Transductive Zero-Shot Learning in the Wild

被引:5
|
作者
Wan, Ziyu [1 ]
Chen, Dongdong [2 ]
Liao, Jing [1 ]
机构
[1] City Univ Hong Kong, Kowloon, Hong Kong, Peoples R China
[2] Microsoft Cloud AI, Lexington, KY USA
关键词
Computer vision; Zero-shot learning; Visual structure constraint;
D O I
10.1007/s11263-021-01451-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To recognize objects of the unseen classes, most existing Zero-Shot Learning(ZSL) methods first learn a compatible projection function between the common semantic space and the visual space based on the data of source seen classes, then directly apply it to the target unseen classes. However, for data in the wild, distributions between the source and target domain might not match well, thus causing the well-known domain shift problem. Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (i.e.alleviate the above domain shift problem). Specifically, three different strategies (symmetric Chamfer-distance, Bipartite matching distance, and Wasserstein distance) are adopted to align the projected unseen semantic centers and visual cluster centers of test instances. We also propose two new training strategies to handle the data in the wild, where many unrelated images in the test dataset may exist. This realistic setting has never been considered in previous methods. Extensive experiments demonstrate that the proposed visual structure constraint brings substantial performance gain consistently and the new training strategies make it generalize well for data in the wild. The source code is available at https:// github.com/ raywzy/VSC..
引用
收藏
页码:1893 / 1909
页数:17
相关论文
共 50 条
  • [31] Feature Generation Approach with Indirect Domain Adaptation for Transductive Zero-shot Learning
    Huang S.
    Yang W.-L.
    Zhang Y.
    Zhang X.-H.
    Yang D.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (11): : 4268 - 4284
  • [32] Transductive Zero-Shot Learning for 3D Point Cloud Classification
    Cheraghian, Ali
    Rahman, Shafin
    Campbell, Dylan
    Petersson, Lars
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 912 - 922
  • [33] Learning Class-Transductive Intent Representations for Zero-shot Intent Detection
    Si, Qingyi
    Liu, Yuanxin
    Fu, Peng
    Lin, Zheng
    Li, Jiangnan
    Wang, Weiping
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3922 - 3928
  • [34] Transductive Zero-Shot Hashing for Multilabel Image Retrieval
    Zou, Qin
    Cao, Ling
    Zhang, Zheng
    Chen, Long
    Wang, Song
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (04) : 1673 - 1687
  • [35] Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning
    Gao, Rui
    Hou, Xingsong
    Qin, Jie
    Chen, Jiaxin
    Liu, Li
    Zhu, Fan
    Zhang, Zhao
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3665 - 3680
  • [36] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
    Zhang, Tian
    Liang, Kongming
    Du, Ruoyi
    Sun, Xian
    Ma, Zhanyu
    Guo, Jun
    [J]. COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
  • [37] Structure Fusion and Propagation for Zero-Shot Learning
    Lin, Guangfeng
    Chen, Yajun
    Zhao, Fan
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 465 - 477
  • [38] Zero-shot recognition with latent visual attributes learning
    Xie, Yurui
    He, Xiaohai
    Zhang, Jing
    Luo, Xiaodong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27321 - 27335
  • [39] Joint Visual and Semantic Optimization for zero-shot learning
    Wu, Hanrui
    Yan, Yuguang
    Chen, Sentao
    Huang, Xiangkang
    Wu, Qingyao
    Ng, Michael K.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 215 (215)
  • [40] Semantically Grounded Visual Embeddings for Zero-Shot Learning
    Nawaz, Shah
    Cavazza, Jacopo
    Del Bue, Alessio
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4588 - 4598