Research on Named Entity Recognition Methods in Chinese Forest Disease Texts

被引:1
|
作者
Wang, Qi [1 ]
Su, Xiyou [1 ]
机构
[1] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 100083, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 08期
基金
中国国家自然科学基金;
关键词
disease; named entity recognition; multi-feature; transformer; bi-gated recurrent unit; CRF;
D O I
10.3390/app12083885
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Named entity recognition of forest diseases plays a key role in knowledge extraction in the field of forestry. The aim of this paper is to propose a named entity recognition method based on multi-feature embedding, a transformer encoder, a bi-gated recurrent unit (BiGRU), and conditional random fields (CRF). According to the characteristics of the forest disease corpus, several features are introduced here to improve the method's accuracy. In this paper, we analyze the characteristics of forest disease texts; carry out pre-processing, labeling, and extraction of multiple features; and construct forest disease texts. In the input representation layer, the method integrates multi-features, such as characters, radicals, word boundaries, and parts of speech. Then, implicit features (e.g., sentence context features) are captured through the transformer's encoding layer. The obtained features are transmitted to the BiGRU layer for further deep feature extraction. Finally, the CRF model is used to learn constraints and output the optimal annotation of disease names, damage sites, and drug entities in the forest disease texts. The experimental results on the self-built data set of forest disease texts show that the precision of the proposed method for entity recognition reached more than 93%, indicating that it can effectively solve the task of named entity recognition in forest disease texts.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] CLASSIFICATION ATTENTION FOR CHINESE NAMED ENTITY RECOGNITION
    Cong, Kai
    Wang, Yunpeng
    Li, Tao
    Xu, Yanbin
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2021, 22 (09) : 1675 - 1686
  • [32] Chinese named entity recognition: The state of the art
    Liu, Pan
    Guo, Yanming
    Wang, Fenglei
    Li, Guohui
    NEUROCOMPUTING, 2022, 473 : 37 - 53
  • [33] MoGCN: Mixture of Gated Convolutional Neural Network for Named Entity Recognition of Chinese Historical Texts
    Yan, Chengxi
    Su, Qi
    Wang, Jun
    IEEE ACCESS, 2020, 8 : 181629 - 181639
  • [34] A Named Entity Recognition Method Based on Knowledge Distillation and Efficient GlobalPointer for Chinese Medical Texts
    Zhai, Zhengwei
    Fan, Rongli
    Huang, Jie
    Xiong, Neal N.
    Zhang, Lijuan
    Wan, Jian
    Zhang, Lei
    IEEE ACCESS, 2024, 12 : 83563 - 83574
  • [35] Research on Chinese Named Entity Recognition Based on Lexical Information and Spatial Features
    Zhang, Zhipeng
    Liu, Shengquan
    Jian, Zhaorui
    Yin, Huixin
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [36] Research on Open Domain Named Entity Recognition Based on Chinese Query Logs
    Di, Yanxing
    WeiSong
    HanshiWang
    Liu, Lizhen
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 40 - 44
  • [37] Named entity recognition from Greek texts: The GIE project
    Karkaletsis, V
    Spyropoulos, CD
    Petasis, G
    ADVANCES IN INTELLIGENT SYSTEMS: CONCEPTS, TOOLS AND APPLICATIONS, 1999, 21 : 131 - 142
  • [38] Impact of translation on named-entity recognition in radiology texts
    Campos, Luis
    Pedro, Vasco
    Couto, Francisco
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2017,
  • [39] Named-Entity Recognition from Greek and English Texts
    Vangelis Karkaletsis
    Georgios Paliouras
    Georgios Petasis
    Natasa Manousopoulou
    Constantine D. Spyropoulos
    Journal of Intelligent and Robotic Systems, 1999, 26 : 123 - 135
  • [40] Named-entity recognition from Greek and English texts
    Karkaletsis, V
    Paliouras, G
    Petasis, G
    Manousopoulou, N
    Spyropoulos, CD
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1999, 26 (02) : 123 - 135