Deep purified feature mining model for joint named entity recognition and relation extraction

被引:10
|
作者
Wang, Youwei [1 ]
Wang, Ying [1 ]
Sun, Zhongchuan [1 ]
Li, Yinghao [2 ]
Hu, Shizhe [1 ]
Ye, Yangdong [1 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[2] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
Named entity recognition Relation extraction Purified features Information bottleneck; NETWORK;
D O I
10.1016/j.ipm.2023.103511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Table filling based joint named entity recognition and relation extraction task aims to share representation of subtasks in a table to extract structured knowledge. However, most of existing studies need additional labels and dedicated deep neural networks to learn shared representation, imposing heavy burdens to decoders. More seriously, almost all these models suffer from feature confusion problem, failing to capture purified task-specific features from shared representation to perform subtasks. To address these challenging problems, in this paper we propose a novel and effective Deep puRified fEAture Mining (DREAM) model for joint named entity recognition and relation extraction task, which can automatically capture purified task-specific features to improve the classification performance of subtasks. Specifically, unlike introducing additional labels or dedicated network architectures, we design a new lightweight shared representation learning (LSRL) module by the plainest labels of joint task and thus encodes context by the hybrid convolutional neural networks. Afterwards, a task -aware information bottleneck (TIB) module is proposed to explore the relation between the mutual information of the joint distribution of each subtask and its task-specific features. With the above two modules well obtain shared representation and purified task-specific features, the satisfactory classification results of both subtasks can be guaranteed. Experiment results show that the proposed model is highly effective, obtaining the promising results on three different benchmarks: CoNNL04 (general text), ADE (biomedical text) and SciERC (scientific text). For example, DREAM respectively achieves F1-scores of 78.18%, 80.28% and 44.60% in performing the relation extraction subtask on the CoNNL04, ADE and SciERC datasets. The promising performance indicates that the proposed model can be applied to many practical applications such as biomedical information extraction. The source code is publicly available at https://github.com/SWT-AITeam/DREAM.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Named Entity Relation Mining Using Wikipedia
    Iftene, Adrian
    Balahur-Dobrescu, Alexandra
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 763 - 766
  • [22] Character Feature Learning for Named Entity Recognition
    Zeng, Ping
    Tan, Qingping
    Zhang, Haoyu
    Meng, Xiankai
    Zhang, Zhuo
    Xu, Jianjun
    Lei, Yan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (07) : 1811 - 1815
  • [23] Better Feature Integration for Named Entity Recognition
    Xu, Lu
    Jie, Zhanming
    Lu, Wei
    Bing, Lidong
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3457 - 3469
  • [24] Feature Importance for Biomedical Named Entity Recognition
    Huggard, Hamish
    Zhang, Aaron
    Zhang, Edmond
    Koh, Yun Sing
    AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 406 - 417
  • [25] Multimodal Named Entity Recognition and Relation Extraction with Retrieval-Augmented Strategy
    Hu, Xuming
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3488 - 3488
  • [26] A Joint Model for Named Entity Recognition With Sentence-Level Entity Type Attentions
    Qian, Tao
    Zhang, Meishan
    Lou, Yinxia
    Hua, Daiwen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1438 - 1448
  • [27] An ERNIE-Based Joint Model for Chinese Named Entity Recognition
    Wang, Yu
    Sun, Yining
    Ma, Zuchang
    Gao, Lisheng
    Xu, Yang
    APPLIED SCIENCES-BASEL, 2020, 10 (16):
  • [28] Joint Speech Translation and Named Entity Recognition
    Gaido, Marco
    Papi, Sara
    Negri, Matteo
    Turchi, Marco
    INTERSPEECH 2023, 2023, : 47 - 51
  • [29] Chinese Data Extraction and Named Entity Recognition
    Yang, Tingwei
    Jiang, Daguang
    Shi, Shenghui
    Than, Siyan
    Zhuo, Lin
    Yin, Yukang
    Liang, Zheng
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 105 - 109
  • [30] A Federated Named Entity Recognition Model with Explicit Relation for Power Grid
    Luo, Jingtang
    Yao, Shiying
    Zhao, Changming
    Xu, Jie
    Feng, Jim
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 4207 - 4216