A4NT: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation

被引:0
|
作者
Shetty, Rakshith [1 ]
Schiele, Bernt [1 ]
Fritz, Mario [1 ]
机构
[1] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text-based analysis methods enable an adversary to reveal privacy relevant author attributes such as gender, age and can identify the text's author. Such methods can compromise the privacy of an anonymous author even when the author tries to remove privacy sensitive content. In this paper, we propose an automatic method, called the Adversarial Author Attribute Anonymity Neural Translation (A(4)NT), to combat such text-based adversaries. Unlike prior works on obfuscation, we propose a system that is fully automatic and learns to perform obfuscation entirely from the data. This allows us to easily apply the A(4)NT system to obfuscate different author attributes. We propose a sequence-to-sequence language model, inspired by machine translation, and an adversarial training framework to design a system which learns to transform the input text to obfuscate the author attributes without paired data. We also propose and evaluate techniques to impose constraints on our A(4)NT model to preserve the semantics of the input text. A(4)NT learns to make minimal changes to the input to successfully fool author attribute classifiers, while preserving the meaning of the input text. Our experiments on two datasets and three settings show that the proposed method is effective in fooling the attribute classifiers and thus improves the anonymity of authors.
引用
收藏
页码:1633 / 1650
页数:18
相关论文
共 50 条
  • [1] Generative adversarial training for neural machine translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    [J]. NEUROCOMPUTING, 2018, 321 : 146 - 155
  • [2] Adversarial Training for Unknown Word Problems in Neural Machine Translation
    Ji, Yatu
    Hou, Hongxu
    Chen, Junjie
    Wu, Nier
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)
  • [3] Noise-Based Adversarial Training for Enhancing Agglutinative Neural Machine Translation
    Ji, Yatu
    Hou, Hongxu
    Chen, Junjie
    Wu, Nier
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 : 392 - 396
  • [4] Crafting Adversarial Examples for Neural Machine Translation
    Zhang, Xinze
    Zhang, Junzhe
    Chen, Zhenhua
    He, Kun
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1967 - 1977
  • [5] Manifold Adversarial Augmentation for Neural Machine Translation
    Chen, Guandan
    Fan, Kai
    Zhang, Kaibo
    Chen, Boxing
    Huang, Zhongqiang
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3184 - 3189
  • [6] Effective Adversarial Regularization for Neural Machine Translation
    Sato, Motoki
    Suzuki, Jun
    Kiyono, Shun
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 204 - 210
  • [7] Robust Neural Machine Translation with Doubly Adversarial Inputs
    Cheng, Yong
    Jiang, Lu
    Macherey, Wolfgang
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4324 - 4333
  • [8] Adversarial Subword Regularization for Robust Neural Machine Translation
    Park, Jungsoo
    Sung, Mujeen
    Lee, Jinhyuk
    Kang, Jaewoo
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1945 - 1953
  • [9] IMPROVING ADVERSARIAL NEURAL MACHINE TRANSLATION WITH PRIOR KNOWLEDGE
    Yang, Yating
    Li, Xiao
    Jiang, Tonghai
    Kong, Jinying
    Ma, Bo
    Zhou, Xi
    Wang, Lei
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1373 - 1377
  • [10] A Reinforced Generation of Adversarial Examples for Neural Machine Translation
    Zou, Wei
    Huang, Shujian
    Xie, Jun
    Dai, Xinyu
    Chen, Jiajun
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3486 - 3497