A New Chinese Named Entity Recognition Method for Pig Disease Domain Based on Lexicon-Enhanced BERT and Contrastive Learning

被引:0
|
作者
Peng, Cheng [1 ,2 ,3 ]
Wang, Xiajun [1 ,4 ]
Li, Qifeng [1 ,2 ,3 ]
Yu, Qinyang [1 ,2 ,3 ]
Jiang, Ruixiang [1 ,2 ,3 ]
Ma, Weihong [1 ,2 ,3 ]
Wu, Wenbiao [1 ,2 ,3 ]
Meng, Rui [1 ,2 ,3 ]
Li, Haiyan [1 ,2 ,3 ]
Huai, Heju [1 ,2 ,3 ]
Wang, Shuyan [1 ,2 ,3 ]
He, Longjuan [5 ]
机构
[1] Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing 100097, Peoples R China
[2] Natl Innovat Ctr Digital Technol Anim Husb, Beijing 100097, Peoples R China
[3] Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China
[4] Hubei Univ, Fac Resources & Environm Sci, Wuhan 430061, Peoples R China
[5] Chinese Acad Agr Sci, Inst Agr Econ & Dev, Beijing 100081, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 16期
关键词
pig disease; Chinese named entity recognition; lexicon-enhanced BERT; contrastive learning; small sample;
D O I
10.3390/app14166944
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application Our work provides reliable technical support for the information extraction of pig diseases in Chinese . It can be applied to other domain - specific fields, thereby facilitating seamless adaptation for named entity identification across diverse contexts .Abstract Named Entity Recognition (NER) is a fundamental and pivotal stage in the development of various knowledge-based support systems, including knowledge retrieval and question-answering systems. In the domain of pig diseases, Chinese NER models encounter several challenges, such as the scarcity of annotated data, domain-specific vocabulary, diverse entity categories, and ambiguous entity boundaries. To address these challenges, we propose PDCNER, a Pig Disease Chinese Named Entity Recognition method leveraging lexicon-enhanced BERT and contrastive learning. Firstly, we construct a domain-specific lexicon and pre-train word embeddings in the pig disease domain. Secondly, we integrate lexicon information of pig diseases into the lower layers of BERT using a Lexicon Adapter layer, which employs char-word pair sequences. Thirdly, to enhance feature representation, we propose a lexicon-enhanced contrastive loss layer on top of BERT. Finally, a Conditional Random Field (CRF) layer is employed as the model's decoder. Experimental results show that our proposed model demonstrates superior performance over several mainstream models, achieving a precision of 87.76%, a recall of 86.97%, and an F1-score of 87.36%. The proposed model outperforms BERT-BiLSTM-CRF and LEBERT by 14.05% and 6.8%, respectively, with only 10% of the samples available, showcasing its robustness in data scarcity scenarios. Furthermore, the model exhibits generalizability across publicly available datasets. Our work provides reliable technical support for the information extraction of pig diseases in Chinese and can be easily extended to other domains, thereby facilitating seamless adaptation for named entity identification across diverse contexts.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Chinese clinical named entity recognition with variant neural structures based on BERT methods
    Li, Xiangyang
    Zhang, Huan
    Zhou, Xiao-Hua
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 107
  • [32] A Chinese BERT-Based Dual-Channel Named Entity Recognition Method for Solid Rocket Engines
    Zheng, Zhiqiang
    Liu, Minghao
    Weng, Zhi
    ELECTRONICS, 2023, 12 (03)
  • [33] A Chinese named entity recognition method for landslide geological disasters based on deep learning
    Yang, Banghui
    Zhou, Chunlei
    Li, Suju
    Wang, Yuzhu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [34] A Radical-Based Method for Chinese Named Entity Recognition
    Wu, Yuefei
    Wei, Xiao
    Qin, Yongbin
    Chen, Yanping
    PROCEEDINGS OF 2019 2ND INTERNATIONAL CONFERENCE ON BIG DATA TECHNOLOGIES (ICBDT 2019), 2019, : 125 - 130
  • [35] Leveraging Integrated Learning for Open-Domain Chinese Named Entity Recognition
    Diao J.
    Zhou Z.
    Shi G.
    International Journal of Crowd Science, 2022, 6 (02) : 74 - 79
  • [36] Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer
    Lei Zhang
    Pengfei Xia
    Xiaoxuan Ma
    Chengwei Yang
    Xin Ding
    Complex & Intelligent Systems, 2024, 10 : 4473 - 4491
  • [37] Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer
    Zhang, Lei
    Xia, Pengfei
    Ma, Xiaoxuan
    Yang, Chengwei
    Ding, Xin
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 4473 - 4491
  • [38] Research on Chinese Semantic Named Entity Recognition in Marine Engine Room Systems Based on BERT
    Shen, Henglong
    Cao, Hui
    Sun, Guangxi
    Chen, Dong
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (07)
  • [39] Named Entity Recognition Method for Power Equipment Based on BERT-BiLSTM-CRF
    Hu, Jiangyi
    Yang, Wenqing
    Yang, Huafei
    Wei, Shanming
    Sun, Zhen
    2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 694 - 699
  • [40] Chinese Medical Named Entity Recognition based on Expert Knowledge and Fine-tuning Bert
    Zhang, Bofeng
    Yao, Xiuhong
    Li, Haiyan
    Aini, Mirensha
    2023 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH, ICKG, 2023, : 84 - 90