Medical Named Entity Recognition Fusing Part-of-Speech and Stroke Features

被引:4
|
作者
Yi, Fen [1 ]
Liu, Hong [1 ]
Wang, You [2 ]
Wu, Sheng [3 ]
Sun, Cheng [4 ]
Feng, Peng [3 ]
Zhang, Jin [1 ,3 ]
机构
[1] Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Peoples R China
[2] Zhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China
[3] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
[4] Hunan Normal Univ, Sch Math & Stat, Changsha 410081, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期
基金
中国国家自然科学基金;
关键词
entity recognition; BERT; BiLSTM; multiple features; CRF;
D O I
10.3390/app13158913
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
It is highly significant from a research standpoint and a valuable practice to identify diseases, symptoms, drugs, examinations, and other medical entities in medical text data to support knowledge maps, question and answer systems, and other downstream tasks that can provide the public with knowledgeable answers. However, when contrasted with other languages like English, Chinese words lack a distinct dividing line, and medical entities have problems such as long length and multiple entity types nesting. Therefore, to address these issues, this study suggests a medical named entity recognition (NER) approach that combines part-of-speech and stroke features. First, the text is fed into the BERT pre-training model to get the semantic representation of the text, while the part-of-speech feature vector is obtained using the part-of-speech dictionary, and the stroke feature of the text is extracted through a convolution neural network (CNN). The word vector is then joined with the part-of-speech and stroke feature vectors, respectively, and input into the BiLSTM and CRF layer for training. Additionally, to balance the disparity in data volume across several types of entities, the class-weighted loss function is included in the loss function. According to the experimental findings, our model's F1 score on the CCKS2019 dataset reaches 78.65%, and the recognition performance exceeds many existing algorithms.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Leveraging Part-of-Speech Tagging Features and a Novel Regularization Strategy for Chinese Medical Named Entity Recognition
    Jiang, Miao
    Zhang, Xin
    Chen, Chonghao
    Shao, Taihua
    Chen, Honghui
    MATHEMATICS, 2022, 10 (09)
  • [2] Named Entity Recognition Based On A Hidden Markov Model in Part-Of-Speech Tagging
    Ageishi, Ryohei
    Miura, Takao
    2008 FIRST INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES, VOLS 1 AND 2, 2008, : 404 - 409
  • [3] On development of multimodal named entity recognition using part-of-speech and mixture of experts
    Chen, Jianying
    Xue, Yun
    Zhang, Haolan
    Ding, Weiping
    Zhang, Zhengxuan
    Chen, Jiehai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (06) : 2181 - 2192
  • [4] On development of multimodal named entity recognition using part-of-speech and mixture of experts
    Jianying Chen
    Yun Xue
    Haolan Zhang
    Weiping Ding
    Zhengxuan Zhang
    Jiehai Chen
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2181 - 2192
  • [5] Joint Part-of-Speech Tagging and Named Entity Recognition Using Factor Graphs
    Mora, Gyoergy
    Vincze, Veronika
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 232 - 239
  • [6] Named entity recognition for the Indonesian language: Combining contextual, morphological and part-of-speech features into a knowledge engineering approach
    Budi, I
    Bressan, S
    Wahyudi, G
    Hasibuan, ZA
    Nazief, BAA
    DISCOVERY SCIENCE, PROCEEDINGS, 2005, 3735 : 57 - 69
  • [7] Correcting word segmentation and part-of-speech tagging errors for Chinese named entity recognition
    Yao, TF
    Wei, D
    Erbach, G
    INTERNET CHALLENGE: TECHNOLOGY AND APPLICATIONS, 2002, : 29 - 36
  • [8] BLAC: A Named Entity Recognition Model Incorporating Part-of-Speech Attention in Irregular Short Text
    Zhu, Ming
    Li, Huakang
    Sun, Xiaoyu
    Yang, Zhuo
    2020 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE-RCAR 2020), 2020, : 56 - 61
  • [9] Korean-Vietnamese Neural Machine Translation with Named Entity Recognition and Part-of-Speech Tags
    Van-Hai Vu
    Quang-Phuoc Nguyen
    Kiem-Hieu Nguyen
    Shin, Joon-Choul
    Ock, Cheol-Young
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (04) : 866 - 873
  • [10] A Multi-domain Named Entity Recognition Method Based on Part-of-Speech Attention Mechanism
    Zhang, Shun
    Sheng, Ying
    Gao, Jiangfan
    Chen, Jianhui
    Huang, Jiajin
    Lin, Shaofu
    COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2019, 2019, 1042 : 631 - 644