Research on Named Entity Recognition Methods in Chinese Forest Disease Texts

被引:1
|
作者
Wang, Qi [1 ]
Su, Xiyou [1 ]
机构
[1] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 100083, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 08期
基金
中国国家自然科学基金;
关键词
disease; named entity recognition; multi-feature; transformer; bi-gated recurrent unit; CRF;
D O I
10.3390/app12083885
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Named entity recognition of forest diseases plays a key role in knowledge extraction in the field of forestry. The aim of this paper is to propose a named entity recognition method based on multi-feature embedding, a transformer encoder, a bi-gated recurrent unit (BiGRU), and conditional random fields (CRF). According to the characteristics of the forest disease corpus, several features are introduced here to improve the method's accuracy. In this paper, we analyze the characteristics of forest disease texts; carry out pre-processing, labeling, and extraction of multiple features; and construct forest disease texts. In the input representation layer, the method integrates multi-features, such as characters, radicals, word boundaries, and parts of speech. Then, implicit features (e.g., sentence context features) are captured through the transformer's encoding layer. The obtained features are transmitted to the BiGRU layer for further deep feature extraction. Finally, the CRF model is used to learn constraints and output the optimal annotation of disease names, damage sites, and drug entities in the forest disease texts. The experimental results on the self-built data set of forest disease texts show that the precision of the proposed method for entity recognition reached more than 93%, indicating that it can effectively solve the task of named entity recognition in forest disease texts.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [21] Named Entity Recognition to Detect Criminal Texts on the Web
    Skorzewski, Pawel
    Pieniowski, Mikolaj
    Demenko, Grazyna
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6223 - 6231
  • [22] An integrative approach to Chinese Named Entity recognition
    Huang, Degen
    Sun, Xiao
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 171 - +
  • [23] A hybrid approach for Chinese named entity recognition
    Fang, XS
    Sheng, HY
    DISCOVERY SCIENCE, PROCEEDINGS, 2002, 2534 : 297 - 301
  • [24] Product named entity recognition in Chinese text
    Jun Zhao
    Feifan Liu
    Language Resources and Evaluation, 2008, 42 : 197 - 217
  • [25] A hybrid model for Chinese named entity recognition
    Sun, Xiao
    Huang, Degen
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
  • [26] Chinese Data Extraction and Named Entity Recognition
    Yang, Tingwei
    Jiang, Daguang
    Shi, Shenghui
    Than, Siyan
    Zhuo, Lin
    Yin, Yukang
    Liang, Zheng
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 105 - 109
  • [27] Bag of Tricks for Chinese Named Entity Recognition
    Xiao, Yao
    Peng, Jingbo
    Fu, Luoyi
    Zhang, Haisong
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [28] Multitask Learning for Chinese Named Entity Recognition
    Zhang, Qun
    Li, Zhenzhen
    Feng, Dawei
    Li, Dongsheng
    Huang, Zhen
    Peng, Yuxing
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 653 - 662
  • [29] Chinese named entity recognition: The state of the art
    Liu, Pan
    Guo, Yanming
    Wang, Fenglei
    Li, Guohui
    Neurocomputing, 2022, 473 : 37 - 53
  • [30] Product named entity recognition in Chinese text
    Zhao, Jun
    Liu, Feifan
    LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (02) : 197 - 217