Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

被引:0
|
作者
Staliunaite, Ieva [1 ]
Gorinski, Philip John [1 ]
Iacobacci, Ignacio [1 ]
机构
[1] Huawei Noahs Ark Lab, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the plausibility of causal relations between clauses is a commonsense reasoning task that requires complex inference ability. The general approach to this task is to train a large pretrained language model on a specific dataset. However, the available training data for the task is often scarce, which leads to instability of model training or reliance on the shallow features of the dataset. This paper presents a number of techniques for making models more robust in the domain of causal reasoning. Firstly, we perform adversarial training by generating perturbed inputs through synonym substitution. Secondly, based on a linguistic theory of discourse connectives, we perform data augmentation using a discourse parser for detecting causally linked clauses in large text, and generating distractors with a generative language model. Both methods boost model performance on the Choice of Plausible Alternatives (COPA) dataset, as well as on a Balanced COPA dataset, which is a modified version of the original data that has been developed to avoid superficial cues, leading to a more challenging benchmark. We show a statistically significant improvement in performance and robustness on both datasets, even with only a small number of additionally generated data points.
引用
收藏
页码:13834 / 13842
页数:9
相关论文
共 50 条
  • [41] Sliced Wasserstein adversarial training for improving adversarial robustness
    Lee, Woojin
    Lee, Sungyoon
    Kim, Hoki
    Lee, Jaewook
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (08) : 3229 - 3242
  • [42] Generative adversarial network augmentation for solving the training data imbalance problem in crop classification
    Shumilo, Leonid
    Okhrimenko, Anton
    Kussul, Nataliia
    Drozd, Sofiia
    Shkalikov, Oleh
    [J]. REMOTE SENSING LETTERS, 2023, 14 (11) : 1131 - 1140
  • [43] Visual Choice of Plausible Alternatives: An Evaluation of Image-based Commonsense Causal Reasoning
    Yeo, Jinyoung
    Lee, Gyeongbok
    Wang, Gengyu
    Choi, Seungtaek
    Cho, Hyunsouk
    Amplayo, Reinald Kim
    Hwang, Seung-won
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2009 - 2013
  • [44] Elaborating sensor data using temporal and spatial commonsense reasoning
    Morgan, Bo
    Singh, Push
    [J]. BSN 2006: INTERNATIONAL WORKSHOP ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS, PROCEEDINGS, 2006, : 187 - +
  • [45] Understanding and Improving Fast Adversarial Training
    Andriushchenko, Maksym
    Flammarion, Nicolas
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [46] Causal reasoning from longitudinal data
    Arjas, E
    Parner, J
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2004, 31 (02) : 171 - 187
  • [47] PointDrop: Improving Object Detection from Sparse Point Clouds via Adversarial Data Augmentation
    Ma, Wenxin
    Chen, Jian
    Du, Qing
    Jia, Wei
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10004 - 10009
  • [48] Improving Astronomical Time-series Classification via Data Augmentation with Generative Adversarial Networks
    Garcia-Jara, German
    Protopapas, Pavlos
    Estevez, Pablo A.
    [J]. ASTROPHYSICAL JOURNAL, 2022, 935 (01):
  • [49] Training Augmentation with Adversarial Examples for Robust Speech Recognition
    Sun, Sining
    Yeh, Ching-Feng
    Ostendorf, Mari
    Hwang, Mei-Yuh
    Xie, Lei
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2404 - 2408
  • [50] Virtual Adversarial Training and Data Augmentation for Acoustic Event Detection with Gated Recurrent Neural Networks
    Zoehrer, Matthias
    Pernkopf, Franz
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 493 - 497