Connective Prediction for Implicit Discourse Relation Recognition via Knowledge Distillation

被引:0
|
作者
Wu, Hongyi [1 ]
Zhou, Hao [1 ]
Lan, Man [1 ,2 ,3 ]
Wu, Yuanbin [1 ]
Zhang, Yadong [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[2] East China Normal Univ, Shanghai Inst AI Educ, Shanghai, Peoples R China
[3] Lingang Lab, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1 | 2023年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Implicit discourse relation recognition (IDRR) remains a challenging task in discourse analysis due to the absence of connectives. Most existing methods utilize one-hot labels as the sole optimization target, ignoring the internal association among connectives. Besides, these approaches spend lots of effort on template construction, negatively affecting the generalization capability. To address these problems, we propose a novel Connective Prediction via Knowledge Distillation (CP-KD) approach to instruct large-scale pre-trained language models (PLMs) mining the latent correlations between connectives and discourse relations, which is meaningful for IDRR. Experimental results on the PDTB 2.0/3.0 and CoNLL 2016 datasets show that our method significantly outperforms the state-of-the-art models on coarse-grained and fine-grained discourse relations. Moreover, our approach can be transferred to explicit discourse relation recognition (EDRR) and achieve acceptable performance. Our code is released in https://github.com/cubenlp/CP_KD- for-IDRR.
引用
收藏
页码:5908 / 5923
页数:16
相关论文
共 50 条
  • [31] TransS-Driven Joint Learning Architecture for Implicit Discourse Relation Recognition
    He, Ruifang
    Wang, Jian
    Guo, Fengyu
    Han, Yugui
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 139 - 148
  • [32] Hierarchical multi-task learning with CRF for implicit discourse relation recognition
    Wu, Changxing
    Hu, Chaowen
    Li, Ruochen
    Lin, Hongyu
    Su, Jinsong
    KNOWLEDGE-BASED SYSTEMS, 2020, 195
  • [33] Meta-learning based instance manipulation for implicit discourse relation recognition
    Zeng, Jiali
    Xie, Binbin
    Wu, Changxing
    Yin, Yongjing
    Zeng, Hualin
    Su, Jinsong
    KNOWLEDGE-BASED SYSTEMS, 2023, 267
  • [34] Using Active Learning to Expand Training Data for Implicit Discourse Relation Recognition
    Xu, Yang
    Hong, Yu
    Ruan, Huibin
    Yao, Jianmin
    Zhang, Min
    Zhou, Guodong
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 725 - 731
  • [35] Implicit Discourse Relation Recognition Integrating Feature Coding and Phrase Interaction Perception
    Wang X.-L.
    Jin F.-Y.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (04): : 1377 - 1388
  • [36] Implicit Discourse Relation Recognition Based on Contextual Interaction Perception and Pattern Filtering
    Guo F.-Y.
    He R.-F.
    Dang J.-W.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (05): : 901 - 915
  • [37] Prompt-based Logical Semantics Enhancement for Implicit Discourse Relation Recognition
    Wang, Chenxu
    Jian, Ping
    Huang, Mu
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 687 - 699
  • [38] Cross-lingual implicit discourse relation recognition with co-training
    Lu, Yao-jie
    Xu, Mu
    Wu, Chang-xing
    Xiong, De-yi
    Wang, Hong-ji
    Su, Jin-song
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (05) : 651 - 661
  • [39] Encoding and Fusing Semantic Connection and Linguistic Evidence for Implicit Discourse Relation Recognition
    Xiang, Wei
    Wang, Bang
    Dai, Lu
    Mo, Yijun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3247 - 3257
  • [40] Implicit Discourse Relation Recognition via a BiLSTM-CNN Architecture With Dynamic Chunk-Based Max Pooling
    Guo, Fengyu
    He, Ruifang
    Dang, Jianwu
    IEEE ACCESS, 2019, 7 : 169281 - 169292