Generating Natural Language Adversarial Examples on a Large Scale with Generative Models

被引:5
|
作者
Ren, Yankun [1 ]
Lin, Jianbin [1 ]
Tang, Siliang [2 ]
Zhou, Jun [1 ]
Yang, Shuang [1 ]
Qi, Yuan [1 ]
Ren, Xiang [3 ]
机构
[1] Ant Financial Serv Grp, Hangzhou, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
[3] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
D O I
10.3233/FAIA200340
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Today text classification models have been widely used. However, these classifiers are found to be easily fooled by adversarial examples. Fortunately, standard attacking methods generate adversarial texts in a pair-wise way, that is, an adversarial text can only be created from a real-world text by replacing a few words. In many applications, these texts are limited in numbers, therefore their corresponding adversarial examples are often not diverse enough and sometimes hard to read, thus can be easily detected by humans and cannot create chaos at a large scale. In this paper, we propose an end to end solution to efficiently generate adversarial texts from scratch using generative models, which are not restricted to perturbing the given texts. We call it unrestricted adversarial text generation. Specifically, we train a conditional variational autoencoder (VAE) with an additional adversarial loss to guide the generation of adversarial examples. Moreover, to improve the validity of adversarial texts, we utilize discrimators and the training framework of generative adversarial networks (GANs) to make adversarial texts consistent with real data. Experimental results on sentiment analysis demonstrate the scalability and efficiency of our method. It can attack text classification models with a higher success rate than existing methods, and provide acceptable quality for humans in the meantime.
引用
收藏
页码:2156 / 2163
页数:8
相关论文
共 50 条
  • [41] Really natural adversarial examples
    Pedraza, Anibal
    Deniz, Oscar
    Bueno, Gloria
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (04) : 1065 - 1077
  • [42] Really natural adversarial examples
    Anibal Pedraza
    Oscar Deniz
    Gloria Bueno
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 1065 - 1077
  • [43] Encoding large-scale cosmological structure with generative adversarial networks
    Ullmo, Marion
    Decelle, Aurelien
    Aghanim, Nabila
    ASTRONOMY & ASTROPHYSICS, 2021, 651
  • [44] Adversarial Examples Detection for XSS Attacks Based on Generative Adversarial Networks
    Zhang, Xueqin
    Zhou, Yue
    Pei, Songwen
    Zhuge, Jingjing
    Chen, Jiahao
    IEEE ACCESS, 2020, 8 (08): : 10989 - 10996
  • [45] An efficient framework for generating robust adversarial examples
    Zhang, Lili
    Wang, Xiaoping
    Lu, Kai
    Peng, Shaoliang
    Wang, Xiaodong
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (09) : 1433 - 1449
  • [46] Generating Transferable Adversarial Examples for Speech Classification
    Kim, Hoki
    Park, Jinseong
    Lee, Jaewook
    PATTERN RECOGNITION, 2023, 137
  • [47] Generating adversarial examples with input significance indicator
    Qiu, Xiaofeng
    Zhou, Shuya
    NEUROCOMPUTING, 2020, 394 : 1 - 12
  • [48] Discovering the Syntax and Strategies of Natural Language Programming with Generative Language Models
    Jiang, Ellen
    Toh, Edwin
    Molina, Alejandra
    Olson, Kristen
    Kayacik, Claire
    Donsbach, Aaron
    Cai, Carrie J.
    Terry, Michael
    PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
  • [49] Large Language Models and Generative AI, Oh My!
    Zyda, Michael
    COMPUTER, 2024, 57 (03) : 127 - 132
  • [50] Large language models for generative information extraction: a survey
    Xu, Derong
    Chen, Wei
    Peng, Wenjun
    Zhang, Chao
    Xu, Tong
    Zhao, Xiangyu
    Wu, Xian
    Zheng, Yefeng
    Wang, Yang
    Chen, Enhong
    Frontiers of Computer Science, 2024, 18 (06)