A multi-head adjacent attention-based pyramid layered model for nested named entity recognition

被引:3
|
作者
Cui, Shengmin [1 ]
Joe, Inwhee [1 ]
机构
[1] Hanyang Univ, Dept Comp Sci, 222 Wangsimni Ro, Seoul 04763, South Korea
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 03期
关键词
Nested named entity recognition; Named entity recognition; Attention; Pyramid; Natural language processing; EXTRACTION;
D O I
10.1007/s00521-022-07747-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) is one of the widely studied natural language processing tasks in recent years. Conventional solutions treat the NER as a sequence labeling problem, but these approaches cannot handle nested NER. This is due to the fact that nested NER refers to the case where one entity contains another entity and it is not feasible to tag each token with a single tag. The pyramid model stacks L flat NER layers for prediction, which subtly enumerates all spans with length less than or equal to L. However, the original model introduces a block consisting of a convolutional layer and a bidirectional long short-term memory (Bi-LSTM) layer as the decoder, which does not consider the dependency between adjacent inputs and the Bi-LSTM cannot perform parallel computation on sequential inputs. For the purpose of improving performance and reducing the forward computation, we propose a Multi-Head Adjacent Attention-based Pyramid Layered model. In addition, when constructing a pyramid structure for span representation, the information of the intermediate words has more proportion than words on the two sides. To address this imbalance in the span representation, we fuse the output of the attention layer with the features of head and tail words when doing classification. We conducted experiments on nested NER datasets such as GENIA, SciERC, and ADE to validate the effectiveness of our proposed model.
引用
收藏
页码:2561 / 2574
页数:14
相关论文
共 50 条
  • [1] A multi-head adjacent attention-based pyramid layered model for nested named entity recognition
    Shengmin Cui
    Inwhee Joe
    Neural Computing and Applications, 2023, 35 : 2561 - 2574
  • [2] A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition
    Xu, Yongxiu
    Huang, Heyan
    Feng, Chong
    Hu, Yue
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14185 - 14193
  • [3] Fast Neural Chinese Named Entity Recognition with Multi-head Self-attention
    Qi, Tao
    Wu, Chuhan
    Wu, Fangzhao
    Ge, Suyu
    Liu, Junxin
    Huang, Yongfeng
    Xie, Xing
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 98 - 110
  • [4] Adversarial Transfer Learning for Named Entity Recognition Based on Multi-Head Attention Mechanism and Feature Fusion
    Zhao, Dandan
    Zhang, Pan
    Meng, Jiana
    Wu, Yue
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 272 - 284
  • [5] Attention-based Multi-level Feature Fusion for Named Entity Recognition
    Yang, Zhiwei
    Chen, Hechang
    Zhang, Jiawei
    Ma, Jing
    Chang, Yi
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3594 - 3600
  • [6] AERNs: Attention-Based Entity Region Networks for Multi-Grained Named Entity Recognition
    Dai, Jianghai
    Feng, Chong
    Bai, Xuefeng
    Dai, Jinming
    Zhang, Huanhuan
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 408 - 415
  • [7] A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention Mechanisms
    Meng, Lingwen
    Wang, Yulin
    Huang, Yuanjun
    Ma, Dingli
    Zhu, Xinshan
    Zhang, Shumei
    Energies, 2025, 18 (02)
  • [8] A Controlled Attention for Nested Named Entity Recognition
    Chen, Yanping
    Huang, Rong
    Pan, Lijun
    Huang, Ruizhang
    Zheng, Qinghua
    Chen, Ping
    COGNITIVE COMPUTATION, 2023, 15 (01) : 132 - 145
  • [9] A Controlled Attention for Nested Named Entity Recognition
    Yanping Chen
    Rong Huang
    Lijun Pan
    Ruizhang Huang
    Qinghua Zheng
    Ping Chen
    Cognitive Computation, 2023, 15 : 132 - 145
  • [10] Named Entity Recognition in Electronic Medical Records Incorporating Pre-trained and Multi-Head Attention
    Yang, Haotian
    Wang, Li
    Yang, Yanpeng
    IAENG International Journal of Computer Science, 2024, 51 (04) : 401 - 408