Discriminative Region-based Multi-Label Zero-Shot Learning

被引:19
|
作者
Narayan, Sanath [1 ]
Gupta, Akshita [1 ]
Khan, Salman [2 ]
Khan, Fahad Shahbaz [2 ,3 ]
Shao, Ling [1 ]
Shah, Mubarak [4 ]
机构
[1] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[2] Mohamed Bin Zayed Univ AI, Al Ain, U Arab Emirates
[3] Linkoping Univ, Linkoping, Sweden
[4] Univ Cent Florida, Orlando, FL 32816 USA
关键词
D O I
10.1109/ICCV48922.2021.00861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label zero-shot learning (ZSL) is a more realistic counter-part of standard single-label ZSL since several objects can co-exist in a natural image. However, the occurrence of multiple objects complicates the reasoning and requires region-specific processing of visual features to preserve their contextual cues. We note that the best existing multi-label ZSL method takes a shared approach towards attending to region features with a common set of attention maps for all the classes. Such shared maps lead to diffused attention, which does not discriminatively focus on relevant locations when the number of classes are large. Moreover, mapping spatially-pooled visual features to the class semantics leads to inter-class feature entanglement, thus hampering the classification. Here, we propose an alternate approach towards region-based discriminability-preserving multi-label zero-shot classification. Our approach maintains the spatial resolution to preserve region-level characteristics and utilizes a bi-level attention module (BiAM) to enrich the features by incorporating both region and scene context information. The enriched region-level features are then mapped to the class semantics and only their class predictions are spatially pooled to obtain image-level predictions, thereby keeping the multi-class features disentangled. Our approach sets a new state of the art on two large-scale multi-label zero-shot benchmarks: NUS-WIDE and Open Images. On NUS-WIDE, our approach achieves an absolute gain of 6.9% mAP for ZSL, compared to the best published results.
引用
收藏
页码:8711 / 8720
页数:10
相关论文
共 50 条
  • [1] Generative Multi-Label Zero-Shot Learning
    Gupta, Akshita
    Narayan, Sanath
    Khan, Salman
    Khan, Fahad Shahbaz
    Shao, Ling
    van de Weijer, Joost
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14611 - 14624
  • [2] Graph embedding based multi-label Zero-shot Learning
    Zhang, Haigang
    Meng, Xianglong
    Cao, Weipeng
    Liu, Ye
    Ming, Zhong
    Yang, Jinfeng
    [J]. NEURAL NETWORKS, 2023, 167 : 129 - 140
  • [3] Zero-shot multi-label learning via label factorisation
    Shao, Hang
    Guo, Yuchen
    Ding, Guiguang
    Han, Jungong
    [J]. IET COMPUTER VISION, 2019, 13 (02) : 117 - 124
  • [4] Generalized Zero-Shot Extreme Multi-label Learning
    Gupta, Nilesh
    Bohra, Sakina
    Prabhu, Yashoteja
    Purohit, Saurabh
    Varma, Manik
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 527 - 535
  • [5] A Probabilistic Framework for Zero-Shot Multi-Label Learning
    Gaure, Abhilash
    Gupta, Aishwarya
    Verma, Vinay Kumar
    Rai, Piyush
    [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [6] Few-Shot and Zero-Shot Multi-Label Learning for Structured Label Spaces
    Rios, Anthony
    Kavuluru, Ramakanth
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3132 - 3142
  • [7] Multi-label zero-shot learning with graph convolutional networks
    Ou, Guangjin
    Yu, Guoxian
    Domeniconi, Carlotta
    Lu, Xuequan
    Zhang, Xiangliang
    [J]. NEURAL NETWORKS, 2020, 132 : 333 - 341
  • [8] Multi-Label Zero-Shot Learning With Adversarial and Variational Techniques
    Gull, Muqaddas
    Arif, Omar
    [J]. IEEE ACCESS, 2024, 12 : 94990 - 95006
  • [9] A Transferable Generative Framework for Multi-Label Zero-Shot Learning
    Ma, Peirong
    He, Zhiquan
    Ran, Wu
    Lu, Hong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3409 - 3423
  • [10] Semantic Diversity Learning for Zero-Shot Multi-label Classification
    Ben-Cohen, Avi
    Zamir, Nadav
    Ben Baruch, Emanuel
    Friedman, Itamar
    Zelnik-Manor, Lihi
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 620 - 630