Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference

被引:0
|
作者
Cengiz, Cemil [1 ]
Yuret, Deniz [1 ]
机构
[1] Koc Univ, KUIS AI Lab, Istanbul, Turkey
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end models trained on natural language inference (NLI) datasets show low generalization on out-of-distribution evaluation sets. The models tend to learn shallow heuristics due to dataset biases. The performance decreases dramatically on diagnostic sets measuring compositionality or robustness against simple heuristics. Existing solutions for this problem employ dataset augmentation which has the drawbacks of being applicable to only a limited set of adversaries and at worst hurting the model performance on other adversaries not included in the augmentation set. Our proposed solution is to improve sentence understanding (hence out-of-distribution generalization) with joint learning of explicit semantics. We show that a BERT based model trained jointly on English semantic role labeling (SRL) and NLI achieves significantly higher performance on external evaluation sets measuring generalization performance.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [31] Co-training for Low Resource Scientific Natural Language Inference
    Sadat, Mobashir
    Caragea, Cornelia
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 2538 - 2550
  • [32] Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both?
    Shaw, Peter
    Chang, Ming-Wei
    Pasupat, Panupong
    Toutanova, Kristina
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 922 - 938
  • [33] SRTube: Video-Language Pre-Training with Action-Centric Video Tube Features and Semantic Role Labeling
    Lee, Ju-Hee
    Kang, Je-Won
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 13689 - 13699
  • [34] Natural language spoken interface control using data-driven semantic inference
    Bellegarda, JR
    Silverman, KEA
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (03): : 267 - 277
  • [35] Semi-Supervised Semantic Role Labeling with Cross-View Training
    Cai, Rui
    Lapata, Mirella
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1018 - 1027
  • [36] Span-Based Semantic Role Labeling with Argument Pruning and Second-Order Inference
    Jia, Zixia
    Yan, Zhaohui
    Wu, Haoyi
    Tu, Kewei
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10822 - 10830
  • [37] Joint Labeling of Syntactic Function and Semantic Role Using Probabilistic Finite State Automata
    Salama, Amr Rekaby
    Menzel, Wolfgang
    INTELLIGENT SYSTEMS AND APPLICATIONS, INTELLISYS, VOL 2, 2019, 869 : 588 - 605
  • [38] Semantic role labeling for Arabic language using case-based reasoning approach
    Meguehout H.
    Bouhadada T.
    Laskri M.T.
    Meguehout, Hamza (meguehout.hamza@gmail.com), 1600, Springer Science and Business Media, LLC (20): : 363 - 372
  • [39] Domain Adaptation in Semantic Role Labeling Using a Neural Language Model and Linguistic Resources
    Quynh Thi Ngoc Do
    Bethard, Steven
    Moens, Marie-Francine
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1812 - 1823
  • [40] Tree Kernel-Based Semantic Role Labeling in Chinese Language for Nominal Predicates
    Wang, Bu-Kang
    Wang, Hong-Ling
    Yuan, Xiao-Hong
    Zhou, Guo-Dong
    11TH CHINESE LEXICAL SEMANTICS WORKSHOP (CKSW2010), 2010, : 425 - 431