Visual Structure Constraint for Transductive Zero-Shot Learning in the Wild

被引:5
|
作者
Wan, Ziyu [1 ]
Chen, Dongdong [2 ]
Liao, Jing [1 ]
机构
[1] City Univ Hong Kong, Kowloon, Hong Kong, Peoples R China
[2] Microsoft Cloud AI, Lexington, KY USA
关键词
Computer vision; Zero-shot learning; Visual structure constraint;
D O I
10.1007/s11263-021-01451-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To recognize objects of the unseen classes, most existing Zero-Shot Learning(ZSL) methods first learn a compatible projection function between the common semantic space and the visual space based on the data of source seen classes, then directly apply it to the target unseen classes. However, for data in the wild, distributions between the source and target domain might not match well, thus causing the well-known domain shift problem. Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (i.e.alleviate the above domain shift problem). Specifically, three different strategies (symmetric Chamfer-distance, Bipartite matching distance, and Wasserstein distance) are adopted to align the projected unseen semantic centers and visual cluster centers of test instances. We also propose two new training strategies to handle the data in the wild, where many unrelated images in the test dataset may exist. This realistic setting has never been considered in previous methods. Extensive experiments demonstrate that the proposed visual structure constraint brings substantial performance gain consistently and the new training strategies make it generalize well for data in the wild. The source code is available at https:// github.com/ raywzy/VSC..
引用
收藏
页码:1893 / 1909
页数:17
相关论文
共 50 条
  • [1] Visual Structure Constraint for Transductive Zero-Shot Learning in the Wild
    Ziyu Wan
    Dongdong Chen
    Jing Liao
    [J]. International Journal of Computer Vision, 2021, 129 : 1893 - 1909
  • [2] Transductive Zero-Shot Learning with Visual Structure Constraint
    Wan, Ziyu
    Chen, Dongdong
    Li, Yan
    Yan, Xingguang
    Zhang, Junge
    Yu, Yizhou
    Liao, Jing
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [3] Transductive Zero-Shot Learning via Visual Center Adaptation
    Wan, Ziyu
    Li, Yan
    Yang, Min
    Zhang, Junge
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 10059 - 10060
  • [4] Transductive Visual-Semantic Embedding for Zero-shot Learning
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    Shao, Jie
    Huang, Zi
    [J]. PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 41 - 49
  • [5] Adversarial strategy for transductive zero-shot learning
    Liu, Youfa
    Du, Bo
    Ni, Fuchuan
    [J]. INFORMATION SCIENCES, 2021, 578 : 750 - 761
  • [6] Bidirectional generative transductive zero-shot learning
    Xinpeng Li
    Dan Zhang
    Mao Ye
    Xue Li
    Qiang Dou
    Qiao Lv
    [J]. Neural Computing and Applications, 2021, 33 : 5313 - 5326
  • [7] Bidirectional generative transductive zero-shot learning
    Li, Xinpeng
    Zhang, Dan
    Ye, Mao
    Li, Xue
    Dou, Qiang
    Lv, Qiao
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 5313 - 5326
  • [8] Transductive Learning for Zero-Shot Object Detection
    Rahman, Shafin
    Khan, Salman
    Barnes, Nick
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6081 - 6090
  • [9] Holistically Associated Transductive Zero-Shot Learning
    Xu, Yangyang
    Xu, Xuemiao
    Han, Guoqiang
    He, Shengfeng
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 437 - 447
  • [10] Transductive Unbiased Embedding for Zero-Shot Learning
    Song, Jie
    Shen, Chengchao
    Yang, Yezhou
    Liu, Yang
    Song, Mingli
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1024 - 1033