Deep purified feature mining model for joint named entity recognition and relation extraction

被引:10
|
作者
Wang, Youwei [1 ]
Wang, Ying [1 ]
Sun, Zhongchuan [1 ]
Li, Yinghao [2 ]
Hu, Shizhe [1 ]
Ye, Yangdong [1 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[2] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
Named entity recognition Relation extraction Purified features Information bottleneck; NETWORK;
D O I
10.1016/j.ipm.2023.103511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Table filling based joint named entity recognition and relation extraction task aims to share representation of subtasks in a table to extract structured knowledge. However, most of existing studies need additional labels and dedicated deep neural networks to learn shared representation, imposing heavy burdens to decoders. More seriously, almost all these models suffer from feature confusion problem, failing to capture purified task-specific features from shared representation to perform subtasks. To address these challenging problems, in this paper we propose a novel and effective Deep puRified fEAture Mining (DREAM) model for joint named entity recognition and relation extraction task, which can automatically capture purified task-specific features to improve the classification performance of subtasks. Specifically, unlike introducing additional labels or dedicated network architectures, we design a new lightweight shared representation learning (LSRL) module by the plainest labels of joint task and thus encodes context by the hybrid convolutional neural networks. Afterwards, a task -aware information bottleneck (TIB) module is proposed to explore the relation between the mutual information of the joint distribution of each subtask and its task-specific features. With the above two modules well obtain shared representation and purified task-specific features, the satisfactory classification results of both subtasks can be guaranteed. Experiment results show that the proposed model is highly effective, obtaining the promising results on three different benchmarks: CoNNL04 (general text), ADE (biomedical text) and SciERC (scientific text). For example, DREAM respectively achieves F1-scores of 78.18%, 80.28% and 44.60% in performing the relation extraction subtask on the CoNNL04, ADE and SciERC datasets. The promising performance indicates that the proposed model can be applied to many practical applications such as biomedical information extraction. The source code is publicly available at https://github.com/SWT-AITeam/DREAM.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Deep Span Representations for Named Entity Recognition
    Zhu, Enwei
    Liu, Yiyang
    Li, Jinpeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10565 - 10582
  • [42] Turkish Named Entity Recognition with Deep Learning
    Gunes, Asim
    Tantug, A. Cuneyd
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [43] Improved Deep Persian Named Entity Recognition
    Bokaei, Mohammad Hadi
    Mahmoudi, Maryam
    2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 381 - 386
  • [44] Deep learning for named entity recognition: a survey
    Hu Z.
    Hou W.
    Liu X.
    Neural Comput. Appl., 16 (8995-9022): : 8995 - 9022
  • [45] A Deep Learning Solution to Named Entity Recognition
    Murthy, V. Rudra
    Bhattacharyya, Pushpak
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 427 - 438
  • [46] Named Entity Recognition and Relation Extraction with Graph Neural Networks in Semi Structured Documents
    Carbonell, Manuel
    Riba, Pau
    Villegas, Mauricio
    Fornes, Alicia
    Llados, Josep
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9622 - 9627
  • [47] A novel feature integration method for named entity recognition model in product titles
    Sun, Shiqi
    Li, Jingyuan
    Zhang, Kun
    Sun, Xinghang
    Cen, Jianhe
    Wang, Yuanzhuo
    COMPUTATIONAL INTELLIGENCE, 2024, 40 (03)
  • [48] BiodivNERE: Gold standard corpora for named entity recognition and relation extraction in the biodiversity domain
    Abdelmageed, Nora
    Loeffler, Felicitas
    Feddoul, Leila
    Algergawy, Alsayed
    Samuel, Sheeba
    Gaikwad, Jitendra
    Kazem, Anahita
    Koenig-Ries, Birgitta
    BIODIVERSITY DATA JOURNAL, 2022, 10
  • [49] A French Corpus and Annotation Schema for Named Entity Recognition and Relation Extraction of Financial News
    Jabbari, Ali
    Sauvage, Olivier
    Zeine, Hamada
    Chergui, Hamza
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2293 - 2299
  • [50] Annotation Scheme for Named Entity Recognition and Relation Extraction Tasks in the Domain of People with Dementia
    Suravee, Sumaiya
    Stoev, Teodor
    Schindler, David
    Hochgraeber, Iris
    Pinkert, Christiane
    Holle, Bernhard
    Halek, Margareta
    Krueger, Frank
    Yordanova, Kristina
    2022 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2022,