Generating Fluent Chinese Adversarial Examples for Sentiment Classification

被引:0
|
作者
Wang, Congyi [1 ,2 ]
Zeng, Jianping [1 ,2 ]
Wu, Chengrong [1 ,2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai 200433, Peoples R China
基金
国家重点研发计划;
关键词
Adversarial examples; Chinese natural language; Sentiment classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Highly accurate classifiers can be trained by existing machine learning models, however, most of these classifiers do not consider the adversarial attack. This makes these classifiers vulnerable to adversarial examples. In order to improve the ability of sentiment classifiers to resist the adversarial attack, it is very important to generate high-quality adversarial examples. Most of the existing methods that generate natural language adversarial examples aim at English text with relatively simple strategies, but a single transformation strategy is easily detected by the defender. In this paper, we propose a new method to generate Chinese natural language adversarial examples, which is called AD-ER (Adversarial Examples with Readability). The first step is to select the important words in the text, which have great impact on the sentiment classifier. Then we proposed four variant strategies to replace the important words and the best candidate word is selected heuristically under the constraints of its readability and maximum entropy model. The simulation results on a real shopping review dataset verify that the examples generated by our method can produce large attack disturbance to the classifiers. Different from other examples, our examples have good readability and diversity, which are more fluent and harder to be detected.
引用
收藏
页码:149 / +
页数:6
相关论文
共 50 条
  • [1] Generating Fluent Adversarial Examples for Natural Languages
    Zhang, Huangzhao
    Zhou, Hao
    Miao, Ning
    Li, Lei
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5564 - 5569
  • [2] Generating Transferable Adversarial Examples for Speech Classification
    Kim, Hoki
    Park, Jinseong
    Lee, Jaewook
    PATTERN RECOGNITION, 2023, 137
  • [3] Generating natural adversarial examples with universal perturbations for text classification
    Gao, Haoran
    Zhang, Hua
    Yang, Xingguo
    Li, Wenmin
    Gao, Fei
    Wen, Qiaoyan
    NEUROCOMPUTING, 2022, 471 : 175 - 182
  • [4] Generating Adversarial Examples for Topic-Dependent Argument Classification
    Mayer, Tobias
    Marro, Santiago
    Cabrio, Elena
    Villata, Serena
    COMPUTATIONAL MODELS OF ARGUMENT (COMMA 2020), 2020, 326 : 33 - 44
  • [5] Generating Adversarial Examples with Adversarial Networks
    Xiao, Chaowei
    Li, Bo
    Zhu, Jun-Yan
    He, Warren
    Liu, Mingyan
    Song, Dawn
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3905 - 3911
  • [6] Adversarial Examples Generation Method for Chinese Text Classification
    Xu, En-Hui
    Zhang, Xiao-Lin
    Wang, Yong-Ping
    Zhang, Shuai
    Liu, Li-Xin
    Xu, Li
    International Journal of Network Security, 2022, 24 (04) : 587 - 596
  • [7] Survey on Generating Adversarial Examples
    Pan W.-W.
    Wang X.-Y.
    Song M.-L.
    Chen C.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (01): : 67 - 81
  • [8] WordRevert: Adversarial Examples Defence Method for Chinese Text Classification
    Xu, Enhui
    Zhang, Xiaolin
    Wang, Yongping
    Zhang, Shuai
    Lu, Lixin
    Xu, Li
    IEEE ACCESS, 2022, 10 : 28832 - 28841
  • [9] WordChange: Adversarial Examples Generation Approach for Chinese Text Classification
    Nuo, Cheng
    Chang, Guo-Qin
    Gao, Haichang
    Pei, Ge
    Zhang, Yang
    IEEE ACCESS, 2020, 8 (08): : 79561 - 79572
  • [10] Adversarial Examples Generation Approach for Tendency Classification on Chinese Texts
    Wang W.-Q.
    Wang R.
    Wang L.-N.
    Tang B.-X.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (08): : 2415 - 2427