Research on Named Entity Recognition Methods in Chinese Forest Disease Texts

被引:1
|
作者
Wang, Qi [1 ]
Su, Xiyou [1 ]
机构
[1] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 100083, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 08期
基金
中国国家自然科学基金;
关键词
disease; named entity recognition; multi-feature; transformer; bi-gated recurrent unit; CRF;
D O I
10.3390/app12083885
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Named entity recognition of forest diseases plays a key role in knowledge extraction in the field of forestry. The aim of this paper is to propose a named entity recognition method based on multi-feature embedding, a transformer encoder, a bi-gated recurrent unit (BiGRU), and conditional random fields (CRF). According to the characteristics of the forest disease corpus, several features are introduced here to improve the method's accuracy. In this paper, we analyze the characteristics of forest disease texts; carry out pre-processing, labeling, and extraction of multiple features; and construct forest disease texts. In the input representation layer, the method integrates multi-features, such as characters, radicals, word boundaries, and parts of speech. Then, implicit features (e.g., sentence context features) are captured through the transformer's encoding layer. The obtained features are transmitted to the BiGRU layer for further deep feature extraction. Finally, the CRF model is used to learn constraints and output the optimal annotation of disease names, damage sites, and drug entities in the forest disease texts. The experimental results on the self-built data set of forest disease texts show that the precision of the proposed method for entity recognition reached more than 93%, indicating that it can effectively solve the task of named entity recognition in forest disease texts.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [1] Survey of Chinese Named Entity Recognition Research
    Zhao, Jigui
    Qian, Yurong
    Wang, Kui
    Hou, Shuxiang
    Chen, Jiaying
    Computer Engineering and Applications, 2024, 60 (01) : 15 - 27
  • [2] Chinese Named Entity Recognition Methods Combined with Entity Boundary Cues
    Huang, Rong
    Chen, Yanping
    Hu, Ying
    Huang, Ruizhang
    Qin, Yongbin
    Computer Engineering and Applications, 2024, 60 (06) : 199 - 206
  • [3] Research on Chinese Named Entity Recognition in the Marine Field
    Cao, Xiaojuan
    Yang, Yongquan
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [4] Research on Chinese Named Entity Recognition Based on Ontology
    Chang, Weili
    Luo, Fang
    Qian, Jilai
    MECHANICAL ENGINEERING AND INTELLIGENT SYSTEMS, PTS 1 AND 2, 2012, 195-196 : 1180 - 1185
  • [5] Research of Chinese Named Entity Recognition Using GATE
    Cheng, Chen
    Cheng, Xianyi
    Hua, Jin
    BIOTECHNOLOGY, CHEMICAL AND MATERIALS ENGINEERING, PTS 1-3, 2012, 393-395 : 262 - 264
  • [6] Named Entity Recognition Experiments on Turkish Texts
    Kuecuek, Dilek
    Yazici, Adnan
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 524 - 535
  • [7] Analysis of Neural Network Modules for Named Entity Recognition of Chinese Medical Texts
    Yufeng D.
    Guoxiu H.
    Data Analysis and Knowledge Discovery, 2023, 7 (02) : 26 - 37
  • [8] EduNER: a Chinese named entity recognition dataset for education research
    Xu Li
    Chengkun Wei
    Zhuoren Jiang
    Wenlong Meng
    Fan Ouyang
    Zihui Zhang
    Wenzhi Chen
    Neural Computing and Applications, 2023, 35 : 17717 - 17731
  • [9] Named Entity Recognition for Digitised Historical Texts
    Grover, Claire
    Givon, Sharon
    Tobin, Richard
    Ball, Julian
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1343 - 1346
  • [10] EduNER: a Chinese named entity recognition dataset for education research
    Li, Xu
    Wei, Chengkun
    Jiang, Zhuoren
    Meng, Wenlong
    Ouyang, Fan
    Zhang, Zihui
    Chen, Wenzhi
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 17717 - 17731