Research on Named Entity Recognition Methods in Chinese Forest Disease Texts

被引:1
|
作者
Wang, Qi [1 ]
Su, Xiyou [1 ]
机构
[1] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 100083, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 08期
基金
中国国家自然科学基金;
关键词
disease; named entity recognition; multi-feature; transformer; bi-gated recurrent unit; CRF;
D O I
10.3390/app12083885
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Named entity recognition of forest diseases plays a key role in knowledge extraction in the field of forestry. The aim of this paper is to propose a named entity recognition method based on multi-feature embedding, a transformer encoder, a bi-gated recurrent unit (BiGRU), and conditional random fields (CRF). According to the characteristics of the forest disease corpus, several features are introduced here to improve the method's accuracy. In this paper, we analyze the characteristics of forest disease texts; carry out pre-processing, labeling, and extraction of multiple features; and construct forest disease texts. In the input representation layer, the method integrates multi-features, such as characters, radicals, word boundaries, and parts of speech. Then, implicit features (e.g., sentence context features) are captured through the transformer's encoding layer. The obtained features are transmitted to the BiGRU layer for further deep feature extraction. Finally, the CRF model is used to learn constraints and output the optimal annotation of disease names, damage sites, and drug entities in the forest disease texts. The experimental results on the self-built data set of forest disease texts show that the precision of the proposed method for entity recognition reached more than 93%, indicating that it can effectively solve the task of named entity recognition in forest disease texts.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Named Entity Recognition for Social Media Texts with Semantic Augmentation
    Nie, Yuyang
    Tian, Yuanhe
    Wan, Xiang
    Yan Song
    Bo Dai
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1383 - 1391
  • [42] Named-entity recognition from Greek and English texts
    Karkaletsis, Vangelis
    Paliouras, Georgios
    Petasis, Georgios
    Manousopoulou, Natasa
    Spyropoulos, Constantine D.
    Journal of Intelligent and Robotic Systems: Theory and Applications, 1999, 26 (02): : 123 - 135
  • [43] Efficient methods for biomedical named entity recognition
    Chan, Shing-Kit
    Lam, Wai
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 729 - 735
  • [44] Research on Named Entity Recognition for Science and Technology Terms in Chinese Based on Dependent Entity Word Vector
    Lan, Yu
    Xu, Hongguang
    Xu, Ke
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2020, : 25 - 30
  • [45] Enhancing Entity Boundary Detection for Better Chinese Named Entity Recognition
    Chen, Chun
    Kong, Fang
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 20 - 25
  • [46] IBNNER: A Biaffine Model-Based Chinese Nested Named Entity Recognition Method for Medical Texts
    Lu, Ping
    Shao, Chongkun
    Deng, Shan
    Zeng, Jiaying
    Lin, Kaibiao
    IAENG International Journal of Computer Science, 2024, 51 (11) : 1686 - 1699
  • [47] Chinese clinical named entity recognition with variant neural structures based on BERT methods
    Li, Xiangyang
    Zhang, Huan
    Zhou, Xiao-Hua
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 107
  • [48] Application of Data Encryption in Chinese Named Entity Recognition
    Dong, Jikun
    Long, Kaifang
    Yu, Hui
    Xu, Weizhi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 99 - 111
  • [49] Chinese Named Entity Recognition Augmented with Lexicon Memory
    Zhou, Yi
    Zheng, Xiao-Qing
    Huang, Xuan-Jing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (05) : 1021 - 1035
  • [50] Chinese Named Entity Recognition and Disambiguation Based on Wikipedia
    Yu Miao
    Lv Yajuan
    Liu Qun
    Su Jinsong
    Xiong Hao
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, 2012, 333 : 272 - 283