ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries

被引:0
|
作者
Zhang, Xingxing [1 ]
Gui, Shupeng [2 ]
Jin, Jian [3 ]
Zhu, Zhenfeng [4 ,5 ]
Zhao, Yao [4 ,5 ]
机构
[1] Tsinghua Univ, Beijing 100084, Peoples R China
[2] Meta, Menlo Pk, CA 94025 USA
[3] Nanyang Technol Univ, Singapore 639798, Singapore
[4] Beijing Jiaotong Univ, Beijing 100044, Peoples R China
[5] Beijing Jiaotong Univ, Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
关键词
Defensive zero-shot learning; adversarial attacks; min-max optimization; relation prediction;
D O I
10.1109/TMM.2023.3258624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning (ZSL) has received extensive attention recently especially in areas of fine-grained object recognition, retrieval, and image captioning. Due to the complete lack of training samples and high requirement of defense transferability, the ZSL model learned is particularly vulnerable against adversarial attacks. Recent work also showed adversarially robust generalization requires more data. This may significantly affect the robustness of ZSL. However, very few efforts have been devoted towards this direction. In this paper, we take an initial attempt, and propose a generic formulation to provide a systematical solution (named <bold>ATZSL</bold>) for learning a defensive ZSL model. It is capable of achieving better generalization on various adversarial objects recognition while only losing a negligible performance on clean images for unseen classes, by casting ZSL into a min-max optimization problem. To address it, we design a defensive relation prediction network, which can bridge the seen and unseen class domains via attributes to generalize prediction and defense strategy. Additionally, our framework can be extended to deal with the poisoned scenario of unseen class attributes. An extensive group of experiments are then presented, demonstrating that ATZSL obtains remarkably more favorable trade-off between model transferability and robustness, over currently available alternatives under various settings.
引用
收藏
页码:15 / 27
页数:13
相关论文
共 50 条
  • [31] Zero-Shot Visual Emotion Recognition by Exploiting BERT
    Kang, Hyunwook
    Hazarika, Devamanyu
    Kim, Dongho
    Kim, Jihie
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2023, 543 : 485 - 494
  • [32] Discriminative Learning of Latent Features for Zero-Shot Recognition
    Li, Yan
    Zhang, Junge
    Zhang, Jianguo
    Huang, Kaiqi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7463 - 7471
  • [33] Zero-Shot Object Recognition by Semantic Manifold Distance
    Fu, Zhenyong
    Xiang, Tao
    Kodirov, Elyor
    Gong, Shaogang
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2635 - 2644
  • [34] Zero-shot recognition with latent visual attributes learning
    Xie, Yurui
    He, Xiaohai
    Zhang, Jing
    Luo, Xiaodong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27321 - 27335
  • [35] A ZERO-SHOT ARCHITECTURE FOR ACTION RECOGNITION IN STILL IMAGES
    Safaei, Marjaneh
    Foroosh, Hassan
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 460 - 464
  • [36] Variational Autoencoder for Zero-Shot Recognition of Bai Characters
    Lin, Weiwei
    Ma, Tai
    Zhang, Zeqing
    Li, Xiaofan
    Xue, Xingsi
    Wireless Communications and Mobile Computing, 2022, 2022
  • [37] Deconfounding Causal Inference for Zero-Shot Action Recognition
    Wang, Junyan
    Jiang, Yiqi
    Long, Yang
    Sun, Xiuyu
    Pagnucco, Maurice
    Song, Yang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3976 - 3986
  • [38] Hyperbolic Visual Embedding Learning for Zero-Shot Recognition
    Liu, Shaoteng
    Chen, Jingjing
    Pan, Liangming
    Ngo, Chong-Wah
    Chua, Tat-Seng
    Jiang, Yu-Gang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9270 - 9278
  • [39] Extreme Reverse Projection Learning for Zero-Shot Recognition
    Guan, Jiechao
    Zhao, An
    Lu, Zhiwu
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 125 - 141
  • [40] Global Semantic Descriptors for Zero-Shot Action Recognition
    Estevam, Valter
    Laroca, Rayson
    Pedrini, Helio
    Menotti, David
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1843 - 1847