Multi-scale visual attention for attribute disambiguation in zero-shot learning

被引:1
|
作者
Tian, Long [1 ]
Chen, Bo [1 ]
Ren, Jie [1 ]
Zhang, Hao [1 ]
Wu, Zhenhua [3 ]
Han, Ning [2 ]
Chen, Yuanwei [4 ]
Liu, Hongwei [1 ]
机构
[1] Xidian Univ, Natl Lab Radar Signal Proc, Xian 710071, Peoples R China
[2] Inst Mech Technol, 16 Jinhua North Rd, Xian 710032, Peoples R China
[3] Anhui Univ, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Peoples R China
[4] China Acad Space Technol, Beijing, Peoples R China
关键词
Zero-shot image recognition; Visual attention; Attribute disambiguation;
D O I
10.1016/j.image.2021.116614
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Observing the phenomenon that the discriminative visual features and unambiguous attribute descriptions are important in zero-shot learning (ZSL), we propose a Multi-scale Visual Attention for Attribute Disambiguation (MVAAD). MVAAD contains a Multi-Scale Visual Attention Network (MSVAN) to realize attentions on image regions, which helps MVAAD to learn more discriminative visual features. Based on the multi-scale visual features in MSVAN, we also develop a Coarse-to-fine Visual-guided Attribute Selection Module (CVASM) to use the multi-scale visual attentive features for attribute disambiguation. Both of MSVAN and CVASM can be jointly trained in an end-to-end manner by minimizing the visual-semantic classification loss and the latent visual contrastive triplet loss. Experimental results on four popular ZSL benchmarks, AwA2, CUB, SUN and FLO, illustrate that MVAAD is able to not only achieve the state-of-the-art performance, but also give meaningful and explainable visualizations on the visual attention and the attribute selection.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Attribute Attention for Semantic Disambiguation in Zero-Shot Learning
    Liu, Yang
    Guo, Jishun
    Cai, Deng
    He, Xiaofei
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6697 - 6706
  • [2] ADAPTIVE MULTI-SCALE SEMANTIC FUSION NETWORK FOR ZERO-SHOT LEARNING
    Song, Jing
    Peng, Peixi
    Zhai, Yunpeng
    Zhang, Chong
    Tian, Yonghong
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [3] A Large-scale Attribute Dataset for Zero-shot Learning
    Zhao, Bo
    Fu, Yanwei
    Liang, Rui
    Wu, Jiahong
    Wang, Yonggang
    Wang, Yizhou
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 398 - 407
  • [4] Attribute subspaces for zero-shot learning
    Zhou, Lei
    Liu, Yang
    Bai, Xiao
    Li, Na
    Yu, Xiaohan
    Zhou, Jun
    Hancock, Edwin R.
    [J]. PATTERN RECOGNITION, 2023, 144
  • [5] Zero-Shot Learning with Attribute Selection
    Guo, Yuchen
    Ding, Guiguang
    Han, Jungong
    Tang, Sheng
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6870 - 6877
  • [6] Zero-shot Learning With Fuzzy Attribute
    Liu, Chongwen
    Shang, Zhaowei
    Tang, Yuan Yan
    [J]. 2017 3RD IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2017, : 277 - 282
  • [7] Exploiting multi-scale contextual prompt learning for zero-shot semantic segmentation☆
    Wang, Yiqi
    Tian, Yingjie
    [J]. DISPLAYS, 2024, 81
  • [8] Multi-Scale Speaker Vectors for Zero-Shot Speech Synthesis
    Cory, Tristin
    Iqbal, Razib
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 496 - 501
  • [9] Rethinking attribute localization for zero-shot learning
    Shuhuang CHEN
    Shiming CHEN
    GuoSen XIE
    Xiangbo SHU
    Xinge YOU
    Xuelong LI
    [J]. Science China(Information Sciences), 2024, (07) - 196
  • [10] Rethinking attribute localization for zero-shot learning
    Chen, Shuhuang
    Chen, Shiming
    Xie, Guo-Sen
    Shu, Xiangbo
    You, Xinge
    Li, Xuelong
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (07)