A Survey of Adversarial Defenses and Robustness in NLP

被引：36

作者：

Goyal, Shreya ^{[1
]}

Doddapaneni, Sumanth ^{[1
]}

Khapra, Mitesh M. ^{[1
]}

Ravindran, Balaraman ^{[1
]}

机构：

[1] Indian Inst Technol Madras, Bhupat & Jyoti Mehta Sch Biosci, Robert Bosch Ctr Data Sci & AI, Chennai 600036, Tamil Nadu, India

来源：

ACM COMPUTING SURVEYS | 2023年 / 55卷 / 14S期

关键词：

Adversarial attacks; adversarial defenses; perturbations; NLP; DEEP NEURAL-NETWORKS; COMPUTER VISION; ATTACKS;

D O I：

10.1145/3593042

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In the past few years, it has become increasingly evident that deep neural networks are not resilient enough to withstand adversarial perturbations in input data, leaving them vulnerable to attack. Various authors have proposed strong adversarial attacks for computer vision and Natural Language Processing (NLP) tasks. As a response, many defense mechanisms have also been proposed to prevent these networks from failing. The significance of defending neural networks against adversarial attacks lies in ensuring that the model's predictions remain unchanged even if the input data is perturbed. Several methods for adversarial defense in NLP have been proposed, catering to different NLP tasks such as text classification, named entity recognition, and natural language inference. Some of these methods not only defend neural networks against adversarial attacks but also act as a regularization mechanism during training, saving the model from overfitting. This survey aims to review the various methods proposed for adversarial defenses in NLP over the past few years by introducing a novel taxonomy. The survey also highlights the fragility of advanced deep neural networks in NLP and the challenges involved in defending them.

引用

页数：39

共 50 条

[1] Demystifying the Adversarial Robustness of Random Transformation Defenses
Sitawarin, Chawin
Golan-Strieb, Zachary
Wagner, David
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[2] Improving the Adversarial Robustness of NLP Models by Information Bottleneck
Zhang, Cenyuan
Zhou, Xiang
Wan, Yixin
Zheng, Xiaoqing
Chang, Kai-Wei
Hsieh, Cho-Jui
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3588 - 3598
[3] On the Robustness of Deep Clustering Models: Adversarial Attacks and Defenses
Chhabra, Anshuman
Sekhari, Ashwin
Mohapatra, Prasant
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[4] Adversarial NLP for Social Network Applications: Attacks, Defenses, and Research Directions
Alsmadi, Izzat
Ahmad, Kashif
Nazzal, Mahmoud
Alam, Firoj
Al-Fuqaha, Ala
Khreishah, Abdallah
Algosaibi, Abdulelah
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (06) : 3089 - 3108
[5] Measure and Improve Robustness in NLP Models: A Survey
Wang, Xuezhi
Wang, Haohan
Yang, Diyi
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4569 - 4586
[6] Evaluating Adversarial Robustness of Secret Key-Based Defenses
Ali, Ziad Tariq Muhammad
Mohammed, Ameer
Ahmad, Imtiaz
IEEE ACCESS, 2022, 10 : 34872 - 34882
[7] Evaluating the Adversarial Robustness of Adaptive Test-time Defenses
Croce, Francesco
Gowal, Sven
Brunner, Thomas
Shelhamer, Evan
Hein, Matthias
Cemgil, Taylan
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[8] Survey on adversarial attacks and defenses for object detection
Wang, Xinxin
Chen, Jing
He, Kun
Zhang, Zijun
Du, Ruiying
Li, Qiao
She, Jisi
Tongxin Xuebao/Journal on Communications, 2023, 44 (11): : 260 - 277
[9] Towards Trustworthy NLP: An Adversarial Robustness Enhancement Based on Perplexity Difference
Ge, Zhaocheng
Hu, Hanping
Zhao, Tengfei
Frontiers in Artificial Intelligence and Applications, 2023, 372 : 803 - 810
[10] A Survey on Efficient Methods for Adversarial Robustness
Muhammad, Awais
Bae, Sung-Ho
IEEE ACCESS, 2022, 10 : 118815 - 118830

← 1 2 3 4 5 →