A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition

被引：0

作者：

Xu, Yongxiu ^{[1
,2
]}

Huang, Heyan ^{[3
]}

Feng, Chong ^{[3
]}

Hu, Yue ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China

[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China

来源：

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, researchers have shown an increased interest in recognizing the overlapping entities that have nested structures. However, most existing models ignore the semantic correlation between words under different entity types. Considering words in sentence play different roles under different entity types, we argue that the correlation intensities of pairwise words in sentence for each entity type should be considered. In this paper, we treat named entity recognition as a multi-class classification of word pairs and design a simple neural model to handle this issue. Our model applies a supervised multi-head self-attention mechanism, where each head corresponds to one entity type, to construct the word-level correlations for each type. Our model can flexibly predict the span type by the correlation intensities of its head and tail under the corresponding type. In addition, we fuse entity boundary detection and entity classification by a multi-task learning framework, which can capture the dependencies between these two tasks. To verify the performance of our model, we conduct extensive experiments on both nested and flat datasets. The experimental results show that our model can outperform the previous state-of-the-art methods on multiple tasks without any extra NLP tools or human annotations.

引用

页码：14185 / 14193

页数：9

共 50 条

[31] Riding feeling recognition based on multi-head self-attention LSTM for driverless automobile
Tang, Xianzhi
Xie, Yongjia
Li, Xinlong
Wang, Bo
PATTERN RECOGNITION, 2025, 159
[32] EEG-Based Emotion Recognition Using Convolutional Recurrent Neural Network with Multi-Head Self-Attention
Hu, Zhangfang
Chen, Libujie
Luo, Yuan
Zhou, Jingfan
APPLIED SCIENCES-BASEL, 2022, 12 (21):
[33] An adaptive multi-head self-attention coupled with attention filtered LSTM for advanced scene text recognition
Selvam, Prabu
Kumar, S. N.
Kannadhasan, S.
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2025,
[34] Masked multi-head self-attention for causal speech enhancement
Nicolson, Aaron
Paliwal, Kuldip K.
SPEECH COMMUNICATION, 2020, 125 : 80 - 96
[35] Multi-modal multi-head self-attention for medical VQA
Joshi, Vasudha
Mitra, Pabitra
Bose, Supratik
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 42585 - 42608
[36] Neural Linguistic Steganalysis via Multi-Head Self-Attention
Jiao, Sai-Mei
Wang, Hai-feng
Zhang, Kun
Hu, Ya-qi
JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2021, 2021 (2021)
[37] Epilepsy detection based on multi-head self-attention mechanism
Ru, Yandong
An, Gaoyang
Wei, Zheng
Chen, Hongming
PLOS ONE, 2024, 19 (06):
[38] Intelligent Micro-Kick Detection Using a Multi-Head Self-Attention Network
Zhang, Dezhi
Sun, Weifeng
Dai, Yongshou
Wang, Dongyue
Guo, Yanliang
Gong, Chentao
PROCESSES, 2025, 13 (02)
[39] Multi-modal multi-head self-attention for medical VQA
Vasudha Joshi
Pabitra Mitra
Supratik Bose
Multimedia Tools and Applications, 2024, 83 : 42585 - 42608
[40] Personalized News Recommendation with CNN and Multi-Head Self-Attention
Li, Aibin
He, Tingnian
Guo, Yi
Li, Zhuoran
Rong, Yixuan
Liu, Guoqi
2022 IEEE 13TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2022, : 102 - 108

← 1 2 3 4 5 →