Adversarial Robustness through Disentangled Representations

被引:0
|
作者
Yang, Shuo [1 ]
Guo, Tianyu [2 ]
Wang, Yunhe [3 ]
Xu, Chang [1 ]
机构
[1] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW, Australia
[2] Peking Univ, Sch EECS, Key Lab Machine Percept MOE, CMIC, Beijing, Peoples R China
[3] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
基金
澳大利亚研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the remarkable empirical performance of deep learning models, their vulnerability to adversarial examples has been revealed in many studies. They are prone to make a susceptible prediction to the input with imperceptible adversarial perturbation. Although recent works have remarkably improved the model's robustness under the adversarial training strategy, an evident gap between the natural accuracy and adversarial robustness inevitably exists. In order to mitigate this problem, in this paper, we assume that the robust and non-robust representations are two basic ingredients entangled in the integral representation. For achieving adversarial robustness, the robust representations of natural and adversarial examples should be disentangled from the non-robust part and the alignment of the robust representations can bridge the gap between accuracy and robustness. Inspired by this motivation, we propose a novel defence method called Deep Robust Representation Disentanglement Network (DRRDN). Specifically, DRRDN employs a disentangler to extract and align the robust representations from both adversarial and natural examples. Theoretical analysis guarantees the mitigation of the trade-off between robustness and accuracy with good disentanglement and alignment performance. Experimental results on benchmark datasets finally demonstrate the empirical superiority of our method.
引用
收藏
页码:3145 / 3153
页数:9
相关论文
共 50 条
  • [1] Achieving Robustness in the Wild via Adversarial Mixing with Disentangled Representations
    Gowal, Sven
    Qin, Chongli
    Huang, Po-Sen
    Cemgil, Taylan
    Dvijotham, Krishnamurthy
    Mann, Timothy
    Kohli, Pushmeet
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1208 - 1217
  • [2] Learning Interpretable Disentangled Representations Using Adversarial VAEs
    Sarhan, Mhd Hasan
    Eslami, Abouzar
    Navab, Nassir
    Albarqouni, Shadi
    [J]. DOMAIN ADAPTATION AND REPRESENTATION TRANSFER AND MEDICAL IMAGE LEARNING WITH LESS LABELS AND IMPERFECT DATA, DART 2019, MIL3ID 2019, 2019, 11795 : 37 - 44
  • [3] Adversarial Learning of Disentangled and Generalizable Representations of Visual Attributes
    Oldfield, James
    Panagakis, Yannis
    Nicolaou, Mihalis A.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3498 - 3509
  • [4] Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations
    Benaroya, Laurent
    Obin, Nicolas
    Roebel, Axel
    [J]. ENTROPY, 2023, 25 (02)
  • [5] An Adversarial Neuro-Tensorial Approach for Learning Disentangled Representations
    Wang, Mengjiao
    Shu, Zhixin
    Cheng, Shiyang
    Panagakis, Yannis
    Samaras, Dimitris
    Zafeiriou, Stefanos
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (6-7) : 743 - 762
  • [6] An Adversarial Neuro-Tensorial Approach for Learning Disentangled Representations
    Mengjiao Wang
    Zhixin Shu
    Shiyang Cheng
    Yannis Panagakis
    Dimitris Samaras
    Stefanos Zafeiriou
    [J]. International Journal of Computer Vision, 2019, 127 : 743 - 762
  • [7] LEARNING DISENTANGLED FEATURE REPRESENTATIONS FOR SPEECH ENHANCEMENT VIA ADVERSARIAL TRAINING
    Hou, Nana
    Xu, Chenglin
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 666 - 670
  • [8] Robustly Disentangled Causal Mechanisms: Validating Deep Representations for Interventional Robustness
    Suter, Raphael
    Miladinovic, Dorde
    Schoelkopf, Bernhard
    Bauer, Stefan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [9] Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness
    Zhao, Jiahao
    Mao, Wenji
    Zeng, Daniel Dajun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1237 - 1247
  • [10] TOWARDS ADVERSARIAL ROBUSTNESS VIA COMPACT FEATURE REPRESENTATIONS
    Shah, Muhammad A.
    Olivier, Raphael
    Raj, Bhiksha
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3845 - 3849