MiDTD: A Simple and Effective Distillation Framework for Distantly Supervised Relation Extraction

被引:6
|
作者
Li, Rui [1 ]
Yang, Cheng [1 ]
Li, Tingwei [1 ]
Su, Sen [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Comp Sci, Natl Pilot Software Engn Sch, 10 Xi Tucheng Rd, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural language processing; NLP; knowledge distillation; distant supervision; neural network; multi-instance learning; label softening;
D O I
10.1145/3503917
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Relation extraction (RE), an important information extraction task, faced the great challenge brought by limited annotation data. To this end, distant supervision was proposed to automatically label RE data, and thus largely increased the number of annotated instances. Unfortunately, lots of noise relation annotations brought by automatic labeling become a new obstacle. Some recent studies have shown that the teacher-student framework of knowledge distillation can alleviate the interference of noise relation annotations via label softening. Nevertheless, we find that they still suffer from two problems: propagation of inaccurate dark knowledge and constraint of a unified distillation temperature. In this article, we propose a simple and effective Multi-instance Dynamic Temperature Distillation (MiDTD) framework, which is model-agnostic and mainly involves two modules: multi-instance target fusion (MiTF) and dynamic temperature regulation (DTR). MiTF combines the teacher's predictions for multiple sentences with the same entity pair to amend the inaccurate dark knowledge in each student's target. DTR allocates alterable distillation temperatures to different training instances to enable the softness of most student's targets to be regulated to a moderate range. In experiments, we construct three concrete MiDTD instantiations with BERT, PCNN, and BiLSTM-based RE models, and the distilled students significantly outperform their teachers and the state-of-the-art (SOTA) methods.
引用
收藏
页数:32
相关论文
共 50 条
  • [21] Hierarchical Knowledge Transfer Network for Distantly Supervised Relation Extraction
    Song, Wei
    Gu, Weishuai
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [22] Denoising by Markov Random Filed in Distantly Supervised Relation Extraction
    Li, Yameng
    Liu, Ruifang
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1525 - 1529
  • [23] Exploring Long Tail Data in Distantly Supervised Relation Extraction
    Gui, Yaocheng
    Liu, Qian
    Zhu, Man
    Gao, Zhiqiang
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 514 - 522
  • [24] Knowledge-embodied attention for distantly supervised relation extraction
    Deng, Kejun
    Zhang, Xuemiao
    Ye, Songtao
    Liu, Junfei
    INTELLIGENT DATA ANALYSIS, 2020, 24 (02) : 445 - 457
  • [25] Improving Distantly Supervised Relation Extraction by Natural Language Inference
    Zhou, Kang
    Qiao, Qiao
    Li, Yuepei
    Li, Qi
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 14047 - 14055
  • [26] ReadsRE: Retrieval-Augmented Distantly Supervised Relation Extraction
    Zhang, Yue
    Fei, Hongliang
    Li, Ping
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2257 - 2262
  • [27] Interaction-and-Response Network for Distantly Supervised Relation Extraction
    Song, Wei
    Gu, Weishuai
    Zhu, Fuxin
    Park, Soon Cheol
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9523 - 9537
  • [28] Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training
    Chen, Tao
    Shi, Haochen
    Liu, Liyuan
    Tang, Siliang
    Shao, Jian
    Chen, Zhigang
    Zhuang, Yueting
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 12675 - 12682
  • [29] Integrating External Entity Knowledge for Distantly Supervised Relation Extraction
    Gao J.
    Wan H.
    Lin Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (12): : 2794 - 2802
  • [30] Distantly Supervised Relation Extraction via Contextual Information Interaction and Relation Embeddings
    Yin, Huixin
    Liu, Shengquan
    Jian, Zhaorui
    SYMMETRY-BASEL, 2023, 15 (09):