A Chinese Nested Named Entity Recognition Model for Chicken Disease Based on Multiple Fine-Grained Feature Fusion and Efficient Global Pointer

被引:0
|
作者
Wang, Xiajun [1 ,2 ]
Peng, Cheng [1 ,3 ,4 ]
Li, Qifeng [1 ,3 ,4 ]
Yu, Qinyang [1 ,3 ,4 ]
Lin, Liqun [2 ]
Li, Pingping [1 ,2 ]
Gao, Ronghua [1 ,3 ,4 ]
Wu, Wenbiao [1 ,3 ,4 ]
Jiang, Ruixiang [1 ,3 ,4 ]
Yu, Ligen [1 ,3 ,4 ]
Ding, Luyu [1 ,3 ,4 ]
Zhu, Lei [1 ,3 ,4 ]
机构
[1] Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing 100097, Peoples R China
[2] Hubei Univ, Fac Resources & Environm Sci, Wuhan 430061, Peoples R China
[3] Natl Innovat Ctr Digital Technol Anim Husb, Beijing 100097, Peoples R China
[4] Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 18期
关键词
nested named entity recognition; chicken disease; multiple fine-grained feature fusion; RoBERTa; efficient global pointer; NETWORK;
D O I
10.3390/app14188495
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application This study proposes a multiple fine-grained nested named entity recognition model, which provides a solution for other specialized fields and lays the foundation for subsequent knowledge graph construction and intelligent inquiry system construction.Abstract Extracting entities from large volumes of chicken epidemic texts is crucial for knowledge sharing, integration, and application. However, named entity recognition (NER) encounters significant challenges in this domain, particularly due to the prevalence of nested entities and domain-specific named entities, coupled with a scarcity of labeled data. To address these challenges, we compiled a corpus from 50 books on chicken diseases, covering 28 different disease types. Utilizing this corpus, we constructed the CDNER dataset and developed a nested NER model, MFGFF-BiLSTM-EGP. This model integrates the multiple fine-grained feature fusion (MFGFF) module with a BiLSTM neural network and employs an efficient global pointer (EGP) to predict the entity location encoding. In the MFGFF module, we designed three encoders: the character encoder, word encoder, and sentence encoder. This design effectively captured fine-grained features and improved the recognition accuracy of nested entities. Experimental results showed that the model performed robustly, with F1 scores of 91.98%, 73.32%, and 82.54% on the CDNER, CMeEE V2, and CLUENER datasets, respectively, outperforming other commonly used NER models. Specifically, on the CDNER dataset, the model achieved an F1 score of 79.68% for nested entity recognition. This research not only advances the development of a knowledge graph and intelligent question-answering system for chicken diseases, but also provides a viable solution for extracting disease information that can be applied to other livestock species.
引用
收藏
页数:23
相关论文
共 49 条
  • [31] Named Entity Recognition of Chinese Electronic Medical Records Based on Multi-Feature Fusion
    Sun, Zhen
    Li, Xinfu
    Computer Engineering and Applications, 2023, 59 (23) : 136 - 144
  • [32] Constructing fine-grained entity recognition corpora based on clinical records of traditional Chinese medicine
    Tingting Zhang
    Yaqiang Wang
    Xiaofeng Wang
    Yafei Yang
    Ying Ye
    BMC Medical Informatics and Decision Making, 20
  • [33] Named Entity Recognition Model of Power Equipment Based on Multi-feature Fusion
    Wu, Yun
    Ma, Xiangwen
    Yang, Jieming
    Wang, Anping
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 255 - 267
  • [34] Constructing fine-grained entity recognition corpora based on clinical records of traditional Chinese medicine
    Zhang, Tingting
    Wang, Yaqiang
    Wang, Xiaofeng
    Yang, Yafei
    Ye, Ying
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
  • [35] Fine-Grained Car Recognition Model Based on Semantic DCNN Features Fusion
    Yang J.
    Cao H.
    Wang R.
    Xue L.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (01): : 141 - 157
  • [36] A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion
    He, Lanfei
    Zhang, Xuefei
    Li, Zhiwei
    Xiao, Peng
    Wei, Ziming
    Cheng, Xu
    Qu, Shaocheng
    COMPLEXITY, 2022, 2022
  • [37] IBNNER: A Biaffine Model-Based Chinese Nested Named Entity Recognition Method for Medical Texts
    Lu, Ping
    Shao, Chongkun
    Deng, Shan
    Zeng, Jiaying
    Lin, Kaibiao
    IAENG International Journal of Computer Science, 2024, 51 (11) : 1686 - 1699
  • [38] The Feature Selection Based on CRFs Model for Chinese Named Entity Recognition in Micro-blog
    Li, Fang
    Du, Ya-Jun
    Zhao, Hong-Yuan
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015), 2015, : 987 - 993
  • [39] An efficient fine-grained vehicle recognition method based on part-level feature optimization
    Lu, Lei
    Cai, Yancheng
    Huang, Hua
    Wang, Ping
    NEUROCOMPUTING, 2023, 536 : 40 - 49
  • [40] Chinese Medical Named Entity Recognition Based on Fusion of Global Features and Multi-Local Features
    Sun, Huarong
    Wang, Jianfeng
    Li, Bo
    Cao, Xiyuan
    Zang, Junbin
    Xue, Chenyang
    Zhang, Zhidong
    IEEE ACCESS, 2023, 11 : 137506 - 137520