Zero-Shot Recognition via Optimal Transport

被引:7
|
作者
Wang, Wenlin [1 ]
Xu, Hongteng [2 ]
Wang, Guoyin [3 ]
Wang, Wenqi [4 ]
Carin, Lawrence [1 ]
机构
[1] Duke Univ, Durham, NC 27706 USA
[2] Renmin Univ China, Beijing, Peoples R China
[3] Amazon Alexa AI, Seattle, WA USA
[4] Facebook, Menlo Pk, CA USA
关键词
NETWORK;
D O I
10.1109/WACV48630.2021.00351
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an optimal transport (OT) framework for generalized zero-shot learning (GZSL), seeking to distinguish samples for both seen and unseen classes, with the assist of auxiliary attributes. The discrepancy between features and attributes is minimized by solving an optimal transport problem. Specifically, we build a conditional generative model to generate features from seen-class attributes, and establish an optimal transport between the distribution of the generated features and that of the real features. The generative model and the optimal transport are optimized iteratively with an attribute-based regularizer, that further enhances the discriminative power of the generated features. A classifier is learned based on the features generated for both the seen and unseen classes. In addition to generalized zero-shot learning, our framework is also applicable to standard and transductive ZSL problems. Experiments show that our optimal transport-based method outperforms state-of-the-art methods on several benchmark datasets.
引用
收藏
页码:3470 / 3480
页数:11
相关论文
共 50 条
  • [1] Zero-Shot Recognition via Structured Prediction
    Zhang, Ziming
    Saligrama, Venkatesh
    COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 533 - 548
  • [3] Universal Prototype Transport for Zero-Shot Action Recognition and Localization
    Pascal Mettes
    International Journal of Computer Vision, 2023, 131 : 3060 - 3073
  • [4] Zero-shot Recognition via Semantic Embeddings and Knowledge Graphs
    Wang, Xiaolong
    Ye, Yufei
    Gupta, Abhinav
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6857 - 6866
  • [5] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
    Qian Wang
    Ke Chen
    International Journal of Computer Vision, 2017, 124 : 356 - 383
  • [6] Zero-Shot Emotion Recognition via Affective Structural Embedding
    Zhan, Chi
    She, Dongyu
    Zhao, Sicheng
    Cheng, Ming-Ming
    Yang, Jufeng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1151 - 1160
  • [7] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
    Wang, Qian
    Chen, Ke
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 124 (03) : 356 - 383
  • [8] On zero-shot recognition of generic objects
    Hascoet, Tristan
    Ariki, Yasuo
    Takiguchi, Tetsuya
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9545 - 9553
  • [9] Zero-Shot Recognition with Unreliable Attributes
    Jayaraman, Dinesh
    Grauman, Kristen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [10] Scalable Zero-Shot Logo Recognition
    Shulgin, Mikhail
    Makarov, Ilya
    IEEE ACCESS, 2023, 11 : 142702 - 142710