SVDML: Semantic and Visual Space Deep Mutual Learning for Zero-Shot Learning

被引:0
|
作者
Lu, Nannan [1 ]
Luo, Yi [1 ]
Qiu, Mingkai [1 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221100, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot Learning; Semantic Representation; Visual Representation; Mutual Learning;
D O I
10.1007/978-981-99-8546-3_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The key challenge of zero-shot learning (ZSL) is how to identify novel objects for which no samples are available during the training process. Current approaches either align the global features of images to the corresponding class semantic vectors or use unidirectional attentions to locate the local visual features of images via semantic attributes to avoid interference from other noise in the image. However, they still have not found a way to establish a robust correlation between the semantic and visual representation. To solve the issue, we propose a Semantic and Visual space Deep Mutual Learning (SVDML), which consists of three modules: class representation learning, attribute embedding, and mutual learning, to establish the intrinsic semantic relations between visual features and attribute features. SVDML uses two kinds of prototype generators to separately guide the learning of global and local features of images and achieves interaction between two learning pipelines by mutual learning, so that promotes the recognition of the fine-grained features and strengthens the knowledge generalization ability in zero-shot learning. The proposed SVDML yields significant improvements over the strong baselines, leading to the new state-of the-art performances on three popular challenging benchmarks.
引用
收藏
页码:383 / 395
页数:13
相关论文
共 50 条
  • [41] Zero-shot learning by mutual information estimation and maximization
    Tang, Chenwei
    Yang, Xue
    Lv, Jiancheng
    He, Zhenan
    KNOWLEDGE-BASED SYSTEMS, 2020, 194
  • [42] Zero-Shot Learning via Visual Abstraction
    Antol, Stanislaw
    Zitnick, C. Lawrence
    Parikh, Devi
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 401 - 416
  • [43] Visual-semantic consistency matching network for generalized zero-shot learning
    Zhang, Zhenqi
    Cao, Wenming
    NEUROCOMPUTING, 2023, 536 : 30 - 39
  • [44] Learning visual-and-semantic knowledge embedding for zero-shot image classification
    Dehui Kong
    Xiliang Li
    Shaofan Wang
    Jinghua Li
    Baocai Yin
    Applied Intelligence, 2023, 53 : 2250 - 2264
  • [45] Learning visual-and-semantic knowledge embedding for zero-shot image classification
    Kong, Dehui
    Li, Xiliang
    Wang, Shaofan
    Li, Jinghua
    Yin, Baocai
    APPLIED INTELLIGENCE, 2023, 53 (02) : 2250 - 2264
  • [46] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
    Zhang, Tian
    Liang, Kongming
    Du, Ruoyi
    Sun, Xian
    Ma, Zhanyu
    Guo, Jun
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
  • [47] DEEP ZERO-SHOT LEARNING FOR SCENE SKETCH
    Xie, Yao
    Xu, Peng
    Ma, Zhanyu
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3661 - 3665
  • [48] A meaningful learning method for zero-shot semantic segmentation
    Liu, Xianglong
    Bai, Shihao
    An, Shan
    Wang, Shuo
    Liu, Wei
    Zhao, Xiaowei
    Ma, Yuqing
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (11)
  • [49] Attentive Semantic Preservation Network for Zero-Shot Learning
    Lu, Ziqian
    Yu, Yunlong
    Lu, Zhe-Ming
    Shen, Feng-Li
    Zhang, Zhongfei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2919 - 2925
  • [50] Zero-Shot Learning on Semantic Class Prototype Graph
    Fu, Zhenyong
    Xiang, Tao
    Kodirov, Elyor
    Gong, Shaogang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) : 2009 - 2022