Multi-Level Attention Based Coreference Resolution With Gated Recurrent Unit and Convolutional Neural Networks

被引:1
|
作者
Bianbadroma [1 ,2 ]
Ngodrup [1 ]
Zhao, Erping [1 ]
Wang, Yuhao [1 ]
Zhang, Yakun [1 ]
机构
[1] Xizang Minzu Univ, Coll Informat Engn, Xianyang, Peoples R China
[2] Tibet Net Cloud Scitech Co Ltd, Lhasa 850001, Xizang, Peoples R China
来源
IEEE ACCESS | 2023年 / 11卷
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Neural networks; Task analysis; Context modeling; Bit error rate; Convolutional neural networks; Mention diversity; coreference resolution; attention mechanism; GRU-CNN; data enhancement;
D O I
10.1109/ACCESS.2023.3234433
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Aiming at the diversity of the entity mentions in the field of animal husbandry in Tibet, the reference resolution model Att-GRU-CNN based on multi-level attention mechanism is proposed. The model uses GRU network for global semantic feature learning and knowledge memory, and uses CNN network to further extract local high-level semantic features. A word-level attention layer is added on the GRU hidden layer, the feature vector of entity mention does dot product operation with the feature vector of each word, and the result is normalized and used as the weight value distribution of the words, thus, the prior knowledge that the mention is the most important in the context is given to the network; A sentence-level attention layer is added on the CNN convolution layer, the convolutional layer output and entity name library matrix do association operation, and the result is used as the weight value distribution of the sentence, so as to strengthen the relation between the sentence where the mention is located and the entity scientific name. At the same time, the data enhancement technology is used to improve the generalization ability of the model. Finally, the ablation experiment on Tibetan animal husbandry dataset verified the effectiveness of each component of the model and the superposition effect after combination; the performance comparison experiment is carried out on public datasets MUC, B3 and CEAF(?4). The experimental results show that this model has a significant improvement over other models in the accuracy rate, recall rate and F1 score.
引用
收藏
页码:4895 / 4904
页数:10
相关论文
共 50 条
  • [41] New GRU from Convolutional Neural Network and Gated Recurrent Unit
    Atassi, A.
    El Azami, I.
    Sadiq, A.
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE, E-LEARNING AND INFORMATION SYSTEMS 2018 (DATA'18), 2018,
  • [42] Time series-based groundwater level forecasting using gated recurrent unit deep neural networks
    Lin, Haiping
    Gharehbaghi, Amin
    Zhang, Qian
    Band, Shahab S.
    Pai, Hao Ting
    Chau, Kwok-Wing
    Mosavi, Amir
    [J]. ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2022, 16 (01) : 1655 - 1672
  • [43] Attention-Based Gated Convolutional Neural Networks for Distant Supervised Relation Extraction
    Li, Xingya
    Chen, Yufeng
    Xu, Jinan
    Zhang, Yujie
    [J]. CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 246 - 257
  • [44] Attention-Based Phonetic Convolutional Recurrent Neural Networks for Language Identification
    Gundluru, Ramesh
    Venkatesh, Vayyavuru
    Murty, K. Sri Rama
    [J]. 2021 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2021, : 475 - 480
  • [45] An Attention-Based Convolutional Recurrent Neural Networks for Scene Text Recognition
    Alshawi, Adil Abdullah Abdulhussein
    Tanha, Jafar
    Balafar, Mohammad Ali
    [J]. IEEE ACCESS, 2024, 12 : 8123 - 8134
  • [46] Emotion recognition based on convolutional gated recurrent units with attention
    Ye, Zhu
    Jing, Yuan
    Wang, Qinghua
    Li, Pengrui
    Liu, Zhihong
    Yan, Mingjing
    Zhang, Yongqing
    Gao, Dongrui
    [J]. CONNECTION SCIENCE, 2023, 35 (01)
  • [47] Convolutional Neural Network-Based Bidirectional Gated Recurrent Unit-Additive Attention Mechanism Hybrid Deep Neural Networks for Short-Term Traffic Flow Prediction
    Liu, Song
    Lin, Wenting
    Wang, Yue
    Yu, Dennis Z.
    Peng, Yong
    Ma, Xianting
    [J]. SUSTAINABILITY, 2024, 16 (05)
  • [48] Stock prediction based on bidirectional gated recurrent unit with convolutional neural network and feature selection
    Zhou, Qihang
    Zhou, Changjun
    Wang, Xiao
    [J]. PLOS ONE, 2022, 17 (02):
  • [49] Multi-Level Feature Abstraction from Convolutional Neural Networks for Multimodal Biometric Identification
    Soleymani, Sobhan
    Dabouei, Ali
    Kazemi, Hadi
    Dawson, Jeremy
    Nasrabadi, Nasser M.
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3469 - 3476
  • [50] Brain MRI detection and classification: Harnessing convolutional neural networks and multi-level thresholding
    Kamireddy, Rasool Reddy
    Kandala, Rajesh N. V. P. S.
    Dhuli, Ravindra
    Polinati, Srinivasu
    Sonti, Kamesh
    Tadeusiewicz, Ryszard
    Plawiak, Pawel
    [J]. PLOS ONE, 2024, 19 (08):