SVDML: Semantic and Visual Space Deep Mutual Learning for Zero-Shot Learning

被引：0

作者：

Lu, Nannan ^{[1
]}

Luo, Yi ^{[1
]}

Qiu, Mingkai ^{[1
]}

机构：

[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221100, Jiangsu, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX | 2024年 / 14433卷

基金：

中国国家自然科学基金;

关键词：

Zero-shot Learning; Semantic Representation; Visual Representation; Mutual Learning;

D O I：

10.1007/978-981-99-8546-3_31

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The key challenge of zero-shot learning (ZSL) is how to identify novel objects for which no samples are available during the training process. Current approaches either align the global features of images to the corresponding class semantic vectors or use unidirectional attentions to locate the local visual features of images via semantic attributes to avoid interference from other noise in the image. However, they still have not found a way to establish a robust correlation between the semantic and visual representation. To solve the issue, we propose a Semantic and Visual space Deep Mutual Learning (SVDML), which consists of three modules: class representation learning, attribute embedding, and mutual learning, to establish the intrinsic semantic relations between visual features and attribute features. SVDML uses two kinds of prototype generators to separately guide the learning of global and local features of images and achieves interaction between two learning pipelines by mutual learning, so that promotes the recognition of the fine-grained features and strengthens the knowledge generalization ability in zero-shot learning. The proposed SVDML yields significant improvements over the strong baselines, leading to the new state-of the-art performances on three popular challenging benchmarks.

引用

页码：383 / 395

页数：13

共 50 条

[41] Zero-shot learning by mutual information estimation and maximization
Tang, Chenwei
Yang, Xue
Lv, Jiancheng
He, Zhenan
KNOWLEDGE-BASED SYSTEMS, 2020, 194
[42] Zero-Shot Learning via Visual Abstraction
Antol, Stanislaw
Zitnick, C. Lawrence
Parikh, Devi
COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 401 - 416
[43] Visual-semantic consistency matching network for generalized zero-shot learning
Zhang, Zhenqi
Cao, Wenming
NEUROCOMPUTING, 2023, 536 : 30 - 39
[44] Learning visual-and-semantic knowledge embedding for zero-shot image classification
Dehui Kong
Xiliang Li
Shaofan Wang
Jinghua Li
Baocai Yin
Applied Intelligence, 2023, 53 : 2250 - 2264
[45] Learning visual-and-semantic knowledge embedding for zero-shot image classification
Kong, Dehui
Li, Xiliang
Wang, Shaofan
Li, Jinghua
Yin, Baocai
APPLIED INTELLIGENCE, 2023, 53 (02) : 2250 - 2264
[46] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
Zhang, Tian
Liang, Kongming
Du, Ruoyi
Sun, Xian
Ma, Zhanyu
Guo, Jun
COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
[47] DEEP ZERO-SHOT LEARNING FOR SCENE SKETCH
Xie, Yao
Xu, Peng
Ma, Zhanyu
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3661 - 3665
[48] A meaningful learning method for zero-shot semantic segmentation
Liu, Xianglong
Bai, Shihao
An, Shan
Wang, Shuo
Liu, Wei
Zhao, Xiaowei
Ma, Yuqing
SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (11)
[49] Attentive Semantic Preservation Network for Zero-Shot Learning
Lu, Ziqian
Yu, Yunlong
Lu, Zhe-Ming
Shen, Feng-Li
Zhang, Zhongfei
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2919 - 2925
[50] Zero-Shot Learning on Semantic Class Prototype Graph
Fu, Zhenyong
Xiang, Tao
Kodirov, Elyor
Gong, Shaogang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) : 2009 - 2022

← 1 2 3 4 5 →