Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks

被引：0

作者：

Li, Huiying ^{[1
]}

Shan, Shawn ^{[1
]}

Wenger, Emily ^{[1
]}

Zhang, Jiayun ^{[2
,3
]}

Zheng, Haitao ^{[1
]}

Zhao, Ben Y. ^{[1
]}

机构：

[1] Univ Chicago, Chicago, IL 60637 USA

[2] Fudan Univ, Shanghai, Peoples R China

[3] Univ Chicago, SAND Lab, Chicago, IL 60637 USA

来源：

PROCEEDINGS OF THE 31ST USENIX SECURITY SYMPOSIUM | 2022年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep learning systems are known to be vulnerable to adversarial examples. In particular, query-based black-box attacks do not require knowledge of the deep learning model, but can compute adversarial examples over the network by submitting queries and inspecting returns. Recent work largely improves the efficiency of those attacks, demonstrating their practicality on today's ML-as-a-service platforms. We propose Blacklight, a new defense against query-based black-box adversarial attacks. Blacklight is driven by a fundamental insight: to compute adversarial examples, these attacks perform iterative optimization over the network, producing queries highly similar in the input space. Thus Blacklight detects query-based black-box attacks by detecting highly similar queries, using an efficient similarity engine operating on probabilistic content fingerprints. We evaluate Blacklight against eight state-of-the-art attacks, across a variety of models and image classification tasks. Blacklight identifies them all, often after only a handful of queries. By rejecting all detected queries, Blacklight prevents any attack from completing, even when persistent attackers continue to submit queries after banned accounts or rejected queries. Blacklight is also robust against several powerful countermeasures, including an optimal black-box attack that approximates white-box attacks in efficiency. Finally, we illustrate how Blacklight generalizes to other domains like text classification.

引用

页码：2117 / 2134

页数：18

共 50 条

[1] Random Noise Defense Against Query-Based Black-Box Attacks
Qin, Zeyu
Fan, Yanbo
Zha, Hongyuan
Wu, Baoyuan
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[2] Query-based Local Black-box Adversarial Attacks
Shi, Jing
Zhang, Xiaolin
Xu, Enhui
Wang, Yongping
Zhang, Wenwen
[J]. International Journal of Network Security, 2023, 25 (06) : 1048 - 1058
[3] On the Effectiveness of Small Input Noise for Defending Against Query-based Black-Box Attacks
Byun, Junyoung
Go, Hyojun
Kim, Changick
[J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3819 - 3828
[4] MalDBA: Detection for Query-Based Malware Black-Box Adversarial Attacks
Kong, Zixiao
Xue, Jingfeng
Liu, Zhenyan
Wang, Yong
Han, Weijie
[J]. ELECTRONICS, 2023, 12 (07)
[5] Query-based black-box attack against medical image segmentation model
Li, Siyuan
Huang, Guangji
Xu, Xing
Lu, Huimin
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 133 : 331 - 337
[6] Boundary Defense Against Black-box Adversarial Attacks
Aithal, Manjushree B.
Li, Xiaohua
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2349 - 2356
[7] Towards Lightweight Black-Box Attacks Against Deep Neural Networks
Sun, Chenghao
Zhang, Yonggang
Wan Chaoqun
Wang, Qizhou
Li, Ya
Liu, Tongliang
Han, Bo
Tian, Xinmei
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[8] Practical Black-Box Attacks on Deep Neural Networks Using Efficient Query Mechanisms
Bhagoji, Arjun Nitin
He, Warren
Li, Bo
Song, Dawn
[J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 158 - 174
[9] Parallel Rectangle Flip Attack: A Query-based Black-box Attack against Object Detection
Liang, Siyuan
Wu, Baoyuan
Fan, Yanbo
Wei, Xingxing
Cao, Xiaochun
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7677 - 7687
[10] An Adaptive Black-Box Defense Against Trojan Attacks (TROJDEF)
Liu, Guanxiong
Khreishah, Abdallah
Sharadgah, Fatima
Khalil, Issa
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5367 - 5381

← 1 2 3 4 5 →