SVDML: Semantic and Visual Space Deep Mutual Learning for Zero-Shot Learning

被引:0
|
作者
Lu, Nannan [1 ]
Luo, Yi [1 ]
Qiu, Mingkai [1 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221100, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot Learning; Semantic Representation; Visual Representation; Mutual Learning;
D O I
10.1007/978-981-99-8546-3_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The key challenge of zero-shot learning (ZSL) is how to identify novel objects for which no samples are available during the training process. Current approaches either align the global features of images to the corresponding class semantic vectors or use unidirectional attentions to locate the local visual features of images via semantic attributes to avoid interference from other noise in the image. However, they still have not found a way to establish a robust correlation between the semantic and visual representation. To solve the issue, we propose a Semantic and Visual space Deep Mutual Learning (SVDML), which consists of three modules: class representation learning, attribute embedding, and mutual learning, to establish the intrinsic semantic relations between visual features and attribute features. SVDML uses two kinds of prototype generators to separately guide the learning of global and local features of images and achieves interaction between two learning pipelines by mutual learning, so that promotes the recognition of the fine-grained features and strengthens the knowledge generalization ability in zero-shot learning. The proposed SVDML yields significant improvements over the strong baselines, leading to the new state-of the-art performances on three popular challenging benchmarks.
引用
收藏
页码:383 / 395
页数:13
相关论文
共 50 条
  • [1] Zero-shot learning with visual-semantic mutual reinforcement for image recognition
    Zhang, Yuhong
    Chen, Taohong
    Yu, Kui
    Hua, Xuegang
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
  • [2] Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
    Liu, Man
    Li, Feng
    Zhang, Chunjie
    Wei, Yunchao
    Bai, Huihui
    Zhao, Yao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15337 - 15346
  • [3] SEMANTIC MANIFOLD ALIGNMENT IN VISUAL FEATURE SPACE FOR ZERO-SHOT LEARNING
    Liao, Changsu
    Su, Li
    Zhang, Wegang
    Huang, Qingming
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [4] Joint Visual and Semantic Optimization for zero-shot learning
    Wu, Hanrui
    Yan, Yuguang
    Chen, Sentao
    Huang, Xiangkang
    Wu, Qingyao
    Ng, Michael K.
    KNOWLEDGE-BASED SYSTEMS, 2021, 215 (215)
  • [5] Generalised Zero-Shot Learning with Domain Classification in a Joint Semantic and Visual Space
    Felix, Rafael
    Harwood, Ben
    Sasdelli, Michele
    Carneiro, Gustavo
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 17 - 24
  • [6] Deep Semantic Structural Constraints for Zero-Shot Learning
    Li, Yan
    Jia, Zhen
    Zhang, Junge
    Huang, Kaiqi
    Tan, Tieniu
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7049 - 7056
  • [7] Learning semantic consistency for audio-visual zero-shot learning
    Xiaoyong Li
    Jing Yang
    Yuling Chen
    Wei Zhang
    Xiaoli Ruan
    Chengjiang Li
    Zhidong Su
    Artificial Intelligence Review, 58 (7)
  • [8] Learning semantic ambiguities for zero-shot learning
    Hanouti, Celina
    Le Borgne, Herve
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40745 - 40759
  • [9] Learning semantic ambiguities for zero-shot learning
    Celina Hanouti
    Hervé Le Borgne
    Multimedia Tools and Applications, 2023, 82 : 40745 - 40759
  • [10] Zero-Shot Object Detection via Learning an Embedding from Semantic Space to Visual Space
    Zhang, Licheng
    Wang, Xianzhi
    Yao, Lina
    Wu, Lin
    Zheng, Feng
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 906 - 912