Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference

被引:0
|
作者
Cengiz, Cemil [1 ]
Yuret, Deniz [1 ]
机构
[1] Koc Univ, KUIS AI Lab, Istanbul, Turkey
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end models trained on natural language inference (NLI) datasets show low generalization on out-of-distribution evaluation sets. The models tend to learn shallow heuristics due to dataset biases. The performance decreases dramatically on diagnostic sets measuring compositionality or robustness against simple heuristics. Existing solutions for this problem employ dataset augmentation which has the drawbacks of being applicable to only a limited set of adversaries and at worst hurting the model performance on other adversaries not included in the augmentation set. Our proposed solution is to improve sentence understanding (hence out-of-distribution generalization) with joint learning of explicit semantics. We show that a BERT based model trained jointly on English semantic role labeling (SRL) and NLI achieves significantly higher performance on external evaluation sets measuring generalization performance.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [21] Joint Bi-Affine Parsing and Semantic Role Labeling
    Shi, Peng
    Zhang, Yue
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 338 - 341
  • [22] Semantic Role Labeling For Russian Language Based on Ensemble Model
    Zheng, Xinping
    Zhou, Bin
    Huang, Jiuming
    Liu, Yunxuan
    Wang, Hao
    Wang, Zhichao
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 1263 - 1268
  • [23] Semantic Role Labeling for Russian Language Based on Russian FrameBank
    Kuznetsov, Ilya
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2015, 2015, 542 : 333 - 349
  • [24] HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization
    Chen, Jiaao
    Shen, Dinghan
    Chen, Weizhu
    Yang, Diyi
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4380 - 4390
  • [25] NeuralLog: Natural Language Inference with Joint Neural and Logical Reasoning
    Chen, Zeming
    Gao, Qiyue
    Moss, Lawrence S.
    10TH CONFERENCE ON LEXICAL AND COMPUTATIONAL SEMANTICS (SEM 2021), 2021, : 78 - 88
  • [26] Semantic Inference from Natural Language Privacy Policies and Android Code
    Hosseini, Mitra Bokaei
    ESEC/FSE'18: PROCEEDINGS OF THE 2018 26TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2018, : 940 - 943
  • [27] Joint End-to-End Semantic Proto-role Labeling
    Spaulding, Elizabeth
    Kazantsev, Gary
    Dredze, Mark
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 723 - 736
  • [28] Semantic Role Labeling with Discriminative Feature Selection for Spoken Language Understanding
    Liu, Chao-Hong
    Wu, Chung-Hsien
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1039 - 1042
  • [29] Semi-Supervised Semantic Role Labeling with Bidirectional Language Models
    Munir, Kashif
    Zhao, Hai
    Li, Zuchao
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (06)
  • [30] A Deep Natural Language Inference Predictor Without Language-Specific Training Data
    Corradi, Lorenzo
    Manenti, Alessandro
    Del Bonifro, Francesca
    Setti, Francesco
    Del Sorbo, Dario
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT II, 2023, 14234 : 168 - 181