DLR: Adversarial examples detection and label recovery for deep neural networks

被引：0

作者：

Han, Keji ^{[1
,2
]}

Ge, Yao ^{[1
,2
]}

Wang, Ruchuan ^{[1
,3
]}

Li, Yun ^{[1
,2
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China

[2] Jiangsu Key Lab Big Data Secur & Intelligent Proc, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China

[3] Jiangsu High Technol Res Key Lab Wireless Sensor N, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China

来源：

PATTERN RECOGNITION LETTERS | 2025年 / 188卷

基金：

中国国家自然科学基金;

关键词：

Deep neural network; Generative classifier; Adversarial example; Anomaly detection;

D O I：

10.1016/j.patrec.2024.12.009

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks (DNNs) have been shown to be vulnerable to adversarial examples crafted by adversaries to deceive the target model. Two popular approaches to mitigate this issue are adversarial training and adversarial example detection. Adversarial training aims to enable the target model to accurately recognize adversarial examples in image classification tasks; however, it often lacks generalizability. Conversely, adversarial detection demonstrates good generalization but does not assist the target model in recognizing adversarial examples. In this paper, we first define the label recovery task to address the adversarial challenges faced by DNNs. We then propose a novel generative classifier specifically for the adversarial example label recovery task. This method is termed Detection and Label Recovery (DLR), which comprises two components: Detector and Recover. The Detector processes both legitimate and adversarial examples, while the Recover component seeks to ascertain the ground-truth label of the detected adversarial example. DLR effectively combines the strengths of adversarial training and adversarial example detection. Experimental results demonstrate that our method outperforms several state-of-the-art approaches.

引用

页码：133 / 139

页数：7

共 50 条

[31] Audio Adversarial Examples Generation with Recurrent Neural Networks
Chang, Kuei-Huan
Huang, Po-Hao
Yu, Honggang
Jin, Yier
Wang, Ting-Chi
2020 25TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2020, 2020, : 488 - 493
[32] LSD: Adversarial Examples Detection Based on Label Sequences Discrepancy
Zhang, Shigeng
Chen, Shuxin
Hua, Chengyao
Li, Zhetao
Li, Yanchun
Liu, Xuan
Chen, Kai
Li, Zhankai
Wang, Weiping
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 5133 - 5147
[33] Detecting Adversarial Examples for Deep Neural Networks via Layer Directed Discriminative Noise Injection
Wang, Si
Liu, Wenye
Chang, Chip-Hong
PROCEEDINGS OF THE 2019 ASIAN HARDWARE ORIENTED SECURITY AND TRUST SYMPOSIUM (ASIANHOST), 2019,
[34] Toward deep neural networks robust to adversarial examples, using augmented data importance perception
Chen, Zhiming
Xue, Wei
Tian, Weiwei
Wu, Yunhua
Hua, Bing
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
[35] Adversarial Examples Detection for XSS Attacks Based on Generative Adversarial Networks
Zhang, Xueqin
Zhou, Yue
Pei, Songwen
Zhuge, Jingjing
Chen, Jiahao
IEEE ACCESS, 2020, 8 (08): : 10989 - 10996
[36] Exploring adversarial examples and adversarial robustness of convolutional neural networks by mutual information
Zhang J.
Qian W.
Cao J.
Xu D.
Neural Computing and Applications, 2024, 36 (23) : 14379 - 14394
[37] Adversarial Examples Against Deep Neural Network based Steganalysis
Zhang, Yiwei
Zhang, Weiming
Chen, Kejiang
Liu, Jiayang
Liu, Yujia
Yu, Nenghai
PROCEEDINGS OF THE 6TH ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY (IH&MMSEC'18), 2018, : 67 - 72
[38] Robust Detection of Adversarial Attacks by Modeling the Intrinsic Properties of Deep Neural Networks
Zheng, Zhihao
Hong, Pengyu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[39] Enhancing Adversarial Examples on Deep Q Networks with Previous Information
Sooksatra, Korn
Rivas, Pablo
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[40] Pruning Adversarially Robust Neural Networks without Adversarial Examples
Jian, Tong
Wang, Zifeng
Wang, Yanzhi
Dy, Jennifer
Ioannidis, Stratis
2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 993 - 998

← 1 2 3 4 5 →