A multi-head adjacent attention-based pyramid layered model for nested named entity recognition

被引:4
|
作者
Cui, Shengmin [1 ]
Joe, Inwhee [1 ]
机构
[1] Hanyang Univ, Dept Comp Sci, 222 Wangsimni Ro, Seoul 04763, South Korea
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 03期
关键词
Nested named entity recognition; Named entity recognition; Attention; Pyramid; Natural language processing; EXTRACTION;
D O I
10.1007/s00521-022-07747-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) is one of the widely studied natural language processing tasks in recent years. Conventional solutions treat the NER as a sequence labeling problem, but these approaches cannot handle nested NER. This is due to the fact that nested NER refers to the case where one entity contains another entity and it is not feasible to tag each token with a single tag. The pyramid model stacks L flat NER layers for prediction, which subtly enumerates all spans with length less than or equal to L. However, the original model introduces a block consisting of a convolutional layer and a bidirectional long short-term memory (Bi-LSTM) layer as the decoder, which does not consider the dependency between adjacent inputs and the Bi-LSTM cannot perform parallel computation on sequential inputs. For the purpose of improving performance and reducing the forward computation, we propose a Multi-Head Adjacent Attention-based Pyramid Layered model. In addition, when constructing a pyramid structure for span representation, the information of the intermediate words has more proportion than words on the two sides. To address this imbalance in the span representation, we fuse the output of the attention layer with the features of head and tail words when doing classification. We conducted experiments on nested NER datasets such as GENIA, SciERC, and ADE to validate the effectiveness of our proposed model.
引用
收藏
页码:2561 / 2574
页数:14
相关论文
共 50 条
  • [41] Named Entity Recognition Using EHealth-BiLSTM-CRF Combine with Multi-head Self-attention for Chinese Medical Information
    Wang, Bin
    Jiang, Fangjiao
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 451 - 462
  • [42] An Attention-Based ID-CNNs-CRF Model for Named Entity Recognition on Clinical Electronic Medical Records
    Gao, Ming
    Xiao, Qifeng
    Wu, Shaochun
    Deng, Kun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 231 - 242
  • [43] A Model for Sea Ice Segmentation based on Feature Pyramid Network and Multi-head Self-attention
    Xu, Yuanxiang
    Feng, Yuan
    Song, Shengyu
    Liu, Jiahao
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 97 - 102
  • [44] Deep Learning Based Mobilenet and Multi-Head Attention Model for Facial Expression Recognition
    Nouisser, Aicha
    Zouari, Ramzi
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 485 - 491
  • [45] Temporal Residual Network Based Multi-Head Attention Model for Arabic Handwriting Recognition
    Zouari, Ramzi
    Othmen, Dalila
    Boubaker, Houcine
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 469 - 476
  • [46] HTLinker: A Head-to-Tail Linker for Nested Named Entity Recognition
    Li, Xiang
    Yang, Junan
    Liu, Hui
    Hu, Pengjiang
    SYMMETRY-BASEL, 2021, 13 (09):
  • [47] Intelligent Bearing Fault Diagnosis Using Multi-Head Attention-Based CNN
    Wang, Hui
    Xu, Jiawen
    Yan, Ruqiang
    Sun, Chuang
    Chen, Xuefeng
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON THROUGH-LIFE ENGINEERING SERVICES (TESCONF 2019), 2020, 49 : 112 - 118
  • [48] Multi-Head Structural Attention-Based Vision Transformer with Sequential Views for 3D Object Recognition
    Bao, Jianjun
    Luo, Ke
    Kou, Qiqi
    He, Liang
    Zhao, Guo
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [49] A segment enhanced span-based model for nested named entity recognition
    Li, Fei
    Wang, Zheng
    Hui, Siu Cheung
    Liao, Lejian
    Zhu, Xinhua
    Huang, Heyan
    NEUROCOMPUTING, 2021, 465 : 26 - 37
  • [50] Span-Based Nested Named Entity Recognition with Pretrained Language Model
    Liu, Chenxu
    Fan, Hongjie
    Liu, Junfei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 620 - 628