A Chinese Nested Named Entity Recognition Model for Chicken Disease Based on Multiple Fine-Grained Feature Fusion and Efficient Global Pointer

被引:0
|
作者
Wang, Xiajun [1 ,2 ]
Peng, Cheng [1 ,3 ,4 ]
Li, Qifeng [1 ,3 ,4 ]
Yu, Qinyang [1 ,3 ,4 ]
Lin, Liqun [2 ]
Li, Pingping [1 ,2 ]
Gao, Ronghua [1 ,3 ,4 ]
Wu, Wenbiao [1 ,3 ,4 ]
Jiang, Ruixiang [1 ,3 ,4 ]
Yu, Ligen [1 ,3 ,4 ]
Ding, Luyu [1 ,3 ,4 ]
Zhu, Lei [1 ,3 ,4 ]
机构
[1] Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing 100097, Peoples R China
[2] Hubei Univ, Fac Resources & Environm Sci, Wuhan 430061, Peoples R China
[3] Natl Innovat Ctr Digital Technol Anim Husb, Beijing 100097, Peoples R China
[4] Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 18期
关键词
nested named entity recognition; chicken disease; multiple fine-grained feature fusion; RoBERTa; efficient global pointer; NETWORK;
D O I
10.3390/app14188495
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application This study proposes a multiple fine-grained nested named entity recognition model, which provides a solution for other specialized fields and lays the foundation for subsequent knowledge graph construction and intelligent inquiry system construction.Abstract Extracting entities from large volumes of chicken epidemic texts is crucial for knowledge sharing, integration, and application. However, named entity recognition (NER) encounters significant challenges in this domain, particularly due to the prevalence of nested entities and domain-specific named entities, coupled with a scarcity of labeled data. To address these challenges, we compiled a corpus from 50 books on chicken diseases, covering 28 different disease types. Utilizing this corpus, we constructed the CDNER dataset and developed a nested NER model, MFGFF-BiLSTM-EGP. This model integrates the multiple fine-grained feature fusion (MFGFF) module with a BiLSTM neural network and employs an efficient global pointer (EGP) to predict the entity location encoding. In the MFGFF module, we designed three encoders: the character encoder, word encoder, and sentence encoder. This design effectively captured fine-grained features and improved the recognition accuracy of nested entities. Experimental results showed that the model performed robustly, with F1 scores of 91.98%, 73.32%, and 82.54% on the CDNER, CMeEE V2, and CLUENER datasets, respectively, outperforming other commonly used NER models. Specifically, on the CDNER dataset, the model achieved an F1 score of 79.68% for nested entity recognition. This research not only advances the development of a knowledge graph and intelligent question-answering system for chicken diseases, but also provides a viable solution for extracting disease information that can be applied to other livestock species.
引用
收藏
页数:23
相关论文
共 49 条
  • [1] Chinese Fine-Grained Named Entity Recognition Based on BILTAR and GlobalPointer Modules
    Li, Weijun
    Liu, Jintong
    Gao, Yuxiao
    Zhang, Xinyong
    Gu, Jianlai
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [2] A Model for Chinese Named Entity Recognition Based on Global Pointer and Adversarial Learning
    ZHANG Yangsen
    LI Jianlong
    XIN Yonghui
    ZHAO Xiquan
    LIU Yang
    Chinese Journal of Electronics, 2023, 32 (04) : 854 - 867
  • [3] A Model for Chinese Named Entity Recognition Based on Global Pointer and Adversarial Learning
    Zhang Yangsen
    Li Jianlong
    Xin Yonghui
    Zhao Xiquan
    Liu Yang
    CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (04) : 854 - 867
  • [4] Chinese Fine-Grained Geological Named Entity Recognition With Rules and FLAT
    Chen, Siying
    Hua, Weihua
    Liu, Xiuguo
    Deng, Xiaotong
    Zeng, Xinling
    Duan, Jianchao
    EARTH AND SPACE SCIENCE, 2022, 9 (12)
  • [5] Fine-Grained Chinese Named Entity Recognition Based on MacBERT-Attn-BiLSTM-CRF Model
    Wang, Jueyang
    Li, Shuzhen
    Agyemang-Duah, Edward
    Feng, Xingyu
    Xu, Chun
    Ji, Yuao
    Liu, Junqiang
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 125 - 131
  • [6] RoBGP: A Chinese Nested Biomedical Named Entity Recognition ModelBased on RoBERTa and Global Pointer
    Cui, Xiaohui
    Song, Chao
    Li, Dongmei
    Qu, Xiaolong
    Long, Jiao
    Yang, Yu
    Zhang, Hanchao
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (03): : 3603 - 3618
  • [7] Named entity recognition for Chinese based on global pointer and adversarial training
    Li, Hongjun
    Cheng, Mingzhe
    Yang, Zelin
    Yang, Liqun
    Chua, Yansong
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [8] Named entity recognition for Chinese based on global pointer and adversarial training
    Hongjun Li
    Mingzhe Cheng
    Zelin Yang
    Liqun Yang
    Yansong Chua
    Scientific Reports, 13
  • [9] Named Entity Recognition Model Based on Feature Fusion
    Sun, Zhen
    Li, Xinfu
    INFORMATION, 2023, 14 (02)
  • [10] Noun-based attention mechanism for Fine-grained Named Entity Recognition
    Rodriguez, Alejandro Jesus Castaneira
    Castro, Daniel Castro
    Herold Garcia, Silena
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193