Generating Natural Language Adversarial Examples on a Large Scale with Generative Models

被引：5

作者：

Ren, Yankun ^{[1
]}

Lin, Jianbin ^{[1
]}

Tang, Siliang ^{[2
]}

Zhou, Jun ^{[1
]}

Yang, Shuang ^{[1
]}

Qi, Yuan ^{[1
]}

Ren, Xiang ^{[3
]}

机构：

[1] Ant Financial Serv Grp, Hangzhou, Peoples R China

[2] Zhejiang Univ, Hangzhou, Peoples R China

[3] Univ Southern Calif, Los Angeles, CA 90007 USA

来源：

ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年 / 325卷

关键词：

D O I：

10.3233/FAIA200340

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Today text classification models have been widely used. However, these classifiers are found to be easily fooled by adversarial examples. Fortunately, standard attacking methods generate adversarial texts in a pair-wise way, that is, an adversarial text can only be created from a real-world text by replacing a few words. In many applications, these texts are limited in numbers, therefore their corresponding adversarial examples are often not diverse enough and sometimes hard to read, thus can be easily detected by humans and cannot create chaos at a large scale. In this paper, we propose an end to end solution to efficiently generate adversarial texts from scratch using generative models, which are not restricted to perturbing the given texts. We call it unrestricted adversarial text generation. Specifically, we train a conditional variational autoencoder (VAE) with an additional adversarial loss to guide the generation of adversarial examples. Moreover, to improve the validity of adversarial texts, we utilize discrimators and the training framework of generative adversarial networks (GANs) to make adversarial texts consistent with real data. Experimental results on sentiment analysis demonstrate the scalability and efficiency of our method. It can attack text classification models with a higher success rate than existing methods, and provide acceptable quality for humans in the meantime.

引用

页码：2156 / 2163

页数：8

共 50 条

[41] Really natural adversarial examples
Pedraza, Anibal
Deniz, Oscar
Bueno, Gloria
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (04) : 1065 - 1077
[42] Really natural adversarial examples
Anibal Pedraza
Oscar Deniz
Gloria Bueno
International Journal of Machine Learning and Cybernetics, 2022, 13 : 1065 - 1077
[43] Encoding large-scale cosmological structure with generative adversarial networks
Ullmo, Marion
Decelle, Aurelien
Aghanim, Nabila
ASTRONOMY & ASTROPHYSICS, 2021, 651
[44] Adversarial Examples Detection for XSS Attacks Based on Generative Adversarial Networks
Zhang, Xueqin
Zhou, Yue
Pei, Songwen
Zhuge, Jingjing
Chen, Jiahao
IEEE ACCESS, 2020, 8 (08): : 10989 - 10996
[45] An efficient framework for generating robust adversarial examples
Zhang, Lili
Wang, Xiaoping
Lu, Kai
Peng, Shaoliang
Wang, Xiaodong
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (09) : 1433 - 1449
[46] Generating Transferable Adversarial Examples for Speech Classification
Kim, Hoki
Park, Jinseong
Lee, Jaewook
PATTERN RECOGNITION, 2023, 137
[47] Generating adversarial examples with input significance indicator
Qiu, Xiaofeng
Zhou, Shuya
NEUROCOMPUTING, 2020, 394 : 1 - 12
[48] Discovering the Syntax and Strategies of Natural Language Programming with Generative Language Models
Jiang, Ellen
Toh, Edwin
Molina, Alejandra
Olson, Kristen
Kayacik, Claire
Donsbach, Aaron
Cai, Carrie J.
Terry, Michael
PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
[49] Large Language Models and Generative AI, Oh My!
Zyda, Michael
COMPUTER, 2024, 57 (03) : 127 - 132
[50] Large language models for generative information extraction: a survey
Xu, Derong
Chen, Wei
Peng, Wenjun
Zhang, Chao
Xu, Tong
Zhao, Xiangyu
Wu, Xian
Zheng, Yefeng
Wang, Yang
Chen, Enhong
Frontiers of Computer Science, 2024, 18 (06)

← 1 2 3 4 5 →