Discriminative Region-based Multi-Label Zero-Shot Learning

被引:19
|
作者
Narayan, Sanath [1 ]
Gupta, Akshita [1 ]
Khan, Salman [2 ]
Khan, Fahad Shahbaz [2 ,3 ]
Shao, Ling [1 ]
Shah, Mubarak [4 ]
机构
[1] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[2] Mohamed Bin Zayed Univ AI, Al Ain, U Arab Emirates
[3] Linkoping Univ, Linkoping, Sweden
[4] Univ Cent Florida, Orlando, FL 32816 USA
关键词
D O I
10.1109/ICCV48922.2021.00861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label zero-shot learning (ZSL) is a more realistic counter-part of standard single-label ZSL since several objects can co-exist in a natural image. However, the occurrence of multiple objects complicates the reasoning and requires region-specific processing of visual features to preserve their contextual cues. We note that the best existing multi-label ZSL method takes a shared approach towards attending to region features with a common set of attention maps for all the classes. Such shared maps lead to diffused attention, which does not discriminatively focus on relevant locations when the number of classes are large. Moreover, mapping spatially-pooled visual features to the class semantics leads to inter-class feature entanglement, thus hampering the classification. Here, we propose an alternate approach towards region-based discriminability-preserving multi-label zero-shot classification. Our approach maintains the spatial resolution to preserve region-level characteristics and utilizes a bi-level attention module (BiAM) to enrich the features by incorporating both region and scene context information. The enriched region-level features are then mapped to the class semantics and only their class predictions are spatially pooled to obtain image-level predictions, thereby keeping the multi-class features disentangled. Our approach sets a new state of the art on two large-scale multi-label zero-shot benchmarks: NUS-WIDE and Open Images. On NUS-WIDE, our approach achieves an absolute gain of 6.9% mAP for ZSL, compared to the best published results.
引用
收藏
页码:8711 / 8720
页数:10
相关论文
共 50 条
  • [21] Pairnorm based Graphical Convolution Network for zero-shot multi-label classification
    Chauhan, Vikas
    Tiwari, Aruna
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [22] Joint Embedding with Multi-Task Learning for Multi-Label Zero-Shot Action Recognition
    An, Rongqiao
    Miao, Zhenjiang
    Li, Qingyu
    [J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 613 - 618
  • [23] Efficient max-margin multi-label classification with applications to zero-shot learning
    Bharath Hariharan
    S. V. N. Vishwanathan
    Manik Varma
    [J]. Machine Learning, 2012, 88 : 127 - 155
  • [24] Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification
    Zhang, Yu
    Shen, Zhihong
    Wu, Chieh-Han
    Xie, Boya
    Hao, Junheng
    Wang, Ye-Yi
    Wang, Kuansan
    Han, Jiawei
    [J]. PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3162 - 3173
  • [25] Uncertainty Flow Facilitates Zero-Shot Multi-Label Learning in Affective Facial Analysis
    Bai, Wenjun
    Quan, Changqin
    Luo, Zhiwei
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (02):
  • [26] Efficient max-margin multi-label classification with applications to zero-shot learning
    Hariharan, Bharath
    Vishwanathan, S. V. N.
    Varma, Manik
    [J]. MACHINE LEARNING, 2012, 88 (1-2) : 127 - 155
  • [27] Diverse and tailored image generation for zero-shot multi-label classification
    Zhang, Kaixin
    Yuan, Zhixiang
    Huang, Tao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [28] Neural Architecture Search with Heterogeneous Representation Learning for Zero-Shot Multi-Label Text Classification
    Chen, Liang
    Yan, Xueming
    Wang, Zilong
    Huang, Han
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [29] Learning exclusive discriminative semantic information for zero-shot learning
    Mi, Jian-Xun
    Zhang, Zhonghao
    Tai, Debao
    Zhou, Li-Fang
    Jia, Wei
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 761 - 772
  • [30] Discriminative deep attributes for generalized zero-shot learning
    Kim, Hoseong
    Lee, Jewook
    Byun, Hyeran
    [J]. PATTERN RECOGNITION, 2022, 124