Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

被引:0
|
作者
Staliunaite, Ieva [1 ]
Gorinski, Philip John [1 ]
Iacobacci, Ignacio [1 ]
机构
[1] Huawei Noahs Ark Lab, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the plausibility of causal relations between clauses is a commonsense reasoning task that requires complex inference ability. The general approach to this task is to train a large pretrained language model on a specific dataset. However, the available training data for the task is often scarce, which leads to instability of model training or reliance on the shallow features of the dataset. This paper presents a number of techniques for making models more robust in the domain of causal reasoning. Firstly, we perform adversarial training by generating perturbed inputs through synonym substitution. Secondly, based on a linguistic theory of discourse connectives, we perform data augmentation using a discourse parser for detecting causally linked clauses in large text, and generating distractors with a generative language model. Both methods boost model performance on the Choice of Plausible Alternatives (COPA) dataset, as well as on a Balanced COPA dataset, which is a modified version of the original data that has been developed to avoid superficial cues, leading to a more challenging benchmark. We show a statistically significant improvement in performance and robustness on both datasets, even with only a small number of additionally generated data points.
引用
收藏
页码:13834 / 13842
页数:9
相关论文
共 50 条
  • [1] Generative Data Augmentation for Commonsense Reasoning
    Yang, Yiben
    Malaviya, Chaitanya
    Fernandez, Jared
    Swayamdipta, Swabha
    Le Bras, Ronan
    Wang, Ji-Ping
    Bhagavatula, Chandra
    Choi, Yejin
    Downe, Doug
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1008 - 1025
  • [2] Adversarial Training for Commonsense Inference
    Pereira, Lis
    Liu, Xiaodong
    Cheng, Fei
    Asahara, Masayuki
    Kobayashi, Ichiro
    [J]. 5TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2020), 2020, : 55 - 60
  • [3] XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
    Ponti, Edoardo M.
    Glaves, Goran
    Majewska, Olga
    Liu, Qianchu
    Vulic, Ivan
    Korhonen, Anna
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2362 - 2376
  • [4] Commonsense Causal Reasoning between Short Texts
    Luo, Zhiyi
    Sha, Yuchen
    Zhu, Kenny Q.
    Hwang, Seung-won
    Wang, Zhongyuan
    [J]. FIFTEENTH INTERNATIONAL CONFERENCE ON THE PRINCIPLES OF KNOWLEDGE REPRESENTATION AND REASONING, 2016, : 421 - 430
  • [5] TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP
    Morris, John X.
    Lifland, Eli
    Yoo, Jin Yong
    Grigsby, Jake
    Jin, Di
    Qi, Yanjun
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING: SYSTEM DEMONSTRATIONS, 2020, : 119 - 126
  • [6] COLA: Contextualized Commonsense Causal Reasoning from the Causal Inference Perspective
    Wang, Zhaowei
    Do, Quyet V.
    Zhang, Hongming
    Zhang, Jiayao
    Wang, Weiqi
    Fang, Tianqing
    Song, Yangqiu
    Wong, Ginny Y.
    See, Simon
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5253 - 5271
  • [7] REMOTE SENSING DATA AUGMENTATION THROUGH ADVERSARIAL TRAINING
    Lv, Ning
    Ma, Hongxiang
    Chen, Chen
    Pei, Qingqi
    Zhou, Yang
    Xiao, Fenglin
    Li, Ji
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2511 - 2514
  • [8] Remote Sensing Data Augmentation Through Adversarial Training
    Lv, Ning
    Ma, Hongxiang
    Chen, Chen
    Pei, Qingqi
    Zhou, Yang
    Xiao, Fenglin
    Li, Ji
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 9318 - 9333
  • [9] Enhancing Narrative Commonsense Reasoning With Multilevel Causal Knowledge
    Mu, Feiteng
    Li, Wenjie
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [10] Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation
    Peng, Xi
    Tang, Zhiqiang
    Yang, Fei
    Feris, Rogerio S.
    Metaxas, Dimitris
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2226 - 2234