Language model based on deep learning network for biomedical named entity recognition

被引:2
|
作者
Hou, Guan [1 ]
Jian, Yuhao [1 ]
Zhao, Qingqing [1 ]
Quan, Xiongwen [1 ]
Zhang, Han [1 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Tianjin, Peoples R China
关键词
Biomedical named entity recognition; Deep learning; Language model; Multi-task learning;
D O I
10.1016/j.ymeth.2024.04.013
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Biomedical Named Entity Recognition (BioNER) is one of the most basic tasks in biomedical text mining, which aims to automatically identify and classify biomedical entities in text. Recently, deep learning-based methods have been applied to Biomedical Named Entity Recognition and have shown encouraging results. However, many biological entities are polysemous and ambiguous, which is one of the main obstacles to the task of biomedical named entity recognition. Deep learning methods require large amounts of training data, so the lack of data also affect the performance of model recognition. To solve the problem of polysemous words and insufficient data, for the task of biomedical named entity recognition, we propose a multi-task learning framework fused with language model based on the BiLSTM-CRF architecture. Our model uses a language model to design a differential encoding of the context, which could obtain dynamic word vectors to distinguish words in different datasets. Moreover, we use a multi-task learning method to collectively share the dynamic word vector of different types of entities to improve the recognition performance of each type of entity. Experimental results show that our model reduces the false positives caused by polysemous words through differentiated coding, and improves the performance of each subtask by sharing information between different entity data. Compared with other state-of-the art methods, our model achieved superior results in four typical training sets, and achieved the best results in F1 values.
引用
收藏
页码:71 / 77
页数:7
相关论文
共 50 条
  • [41] A Deep Learning Solution to Named Entity Recognition
    Murthy, V. Rudra
    Bhattacharyya, Pushpak
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 427 - 438
  • [42] An imConvNet-based deep learning model for Chinese medical named entity recognition
    Zheng, Yuchen
    Han, Zhenggong
    Cai, Yimin
    Duan, Xubo
    Sun, Jiangling
    Yang, Wei
    Huang, Haisong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [43] An imConvNet-based deep learning model for Chinese medical named entity recognition
    Yuchen Zheng
    Zhenggong Han
    Yimin Cai
    Xubo Duan
    Jiangling Sun
    Wei Yang
    Haisong Huang
    BMC Medical Informatics and Decision Making, 22
  • [44] Dictionary-based matching graph network for biomedical named entity recognition
    Lou, Yinxia
    Zhu, Xun
    Tan, Kai
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [45] Dictionary-based matching graph network for biomedical named entity recognition
    Yinxia Lou
    Xun Zhu
    Kai Tan
    Scientific Reports, 13 (1)
  • [46] Evaluation on Network Social Media Named Entity Recognition Model Based on Active Learning
    He, Guijiao
    Zhou, Yunfeng
    Zheng, Yaodong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (08)
  • [47] A neural network multi-task learning approach to biomedical named entity recognition
    Crichton, Gamal
    Pyysalo, Sampo
    Chiu, Billy
    Korhonen, Anna
    BMC BIOINFORMATICS, 2017, 18
  • [48] A Deep Learning Based Approach for Biomedical Named Entity Recognition Using Multitasking Transfer Learning with BiLSTM, BERT and CRF
    Pooja H.
    Jagadeesh M.P.P.
    SN Computer Science, 5 (5)
  • [49] A neural network multi-task learning approach to biomedical named entity recognition
    Gamal Crichton
    Sampo Pyysalo
    Billy Chiu
    Anna Korhonen
    BMC Bioinformatics, 18
  • [50] Improving dictionary-based named entity recognition with deep learning
    Nastou, Katerina
    Koutrouli, Mikaela
    Pyysalo, Sampo
    Jensen, Lars Juhl
    BIOINFORMATICS, 2024, 40 : ii45 - ii52