Contextual Word Embedding for Biomedical Knowledge Extraction: a Rapid Review and Case Study

被引:0
|
作者
Dinithi Vithanage
Ping Yu
Lei Wang
Chao Deng
机构
[1] University of Wollongong,School of Computing and Information Technology
[2] University of Wollongong,School of Medical, Indigenous and Health Sciences
关键词
Knowledge extraction; Natural language processing; Contextual word embedding models; Biomedical text; Electronic health records;
D O I
暂无
中图分类号
学科分类号
摘要
Recent advancements in natural language processing (NLP), particularly contextual word embedding models, have improved knowledge extraction from biomedical and healthcare texts. However, limited comprehensive research compares these models. This study conducts a scoping review and compares the performance of the major contextual word embedding models for biomedical knowledge extraction. From 26 articles identified from Scopus, PubMed, PubMed Central, and Google Scholar between 2017 and 2021, 18 notable contextual word embedding models were identified. These include ELMo, BERT, BioBERT, BlueBERT, CancerBERT, DDS-BERT, RuBERT, LABSE, EhrBERT, MedBERT, Clinical BERT, Clinical BioBERT, Discharge Summary BERT, Discharge Summary BioBERT, GPT, GPT-2, GPT-3, and GPT2-Bio-Pt. A case study compared the performance of six representative models—ELMo, BERT, BioBERT, BlueBERT, Clinical BioBERT, and GPT-3—across text classification, named entity recognition, and question answering. The evaluation utilized datasets comprising biomedical text from tweets, NCBI, PubMed, and clinical notes sourced from two electronic health record datasets. Performance metrics, including accuracy and F1 score, were used. The results of this case study reveal that BioBERT performs the best in analyzing biomedical text, while Clinical BioBERT excels in analyzing clinical notes. These findings offer crucial insights into word embedding models for researchers, practitioners, and stakeholders utilizing NLP in biomedical and clinical document analysis.
引用
收藏
页码:158 / 179
页数:21
相关论文
共 50 条
  • [1] Contextual Word Embedding for Biomedical Knowledge Extraction: a Rapid Review and Case Study
    Vithanage, Dinithi
    Yu, Ping
    Wang, Lei
    Deng, Chao
    [J]. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2024, 8 (01) : 158 - 179
  • [2] Impact analysis of keyword extraction using contextual word embedding
    Khan, Muhammad Qasim
    Shahid, Abdul
    Uddin, M. Irfan
    Roman, Muhammad
    Alharbi, Abdullah
    Alosaimi, Wael
    Almalki, Jameel
    Alshahrani, Saeed M.
    [J]. PEERJ COMPUTER SCIENCE, 2022, 8
  • [3] Contextual Word Embedding: A Case Study in Clustering Tweets about Emergency Situations
    Ganguly, Debasis
    Ghosh, Kripabandhu
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 73 - 74
  • [4] Integrating Extra Knowledge into Word Embedding Models for Biomedical NLP Tasks
    Ling, Yuan
    An, Yuan
    Liu, Mengwen
    Hasan, Sadid A.
    Fan, Yetian
    Hu, Xiaohua
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 968 - 975
  • [5] Contextual embedding and model weighting by fusing domain knowledge on Biomedical Question Answering
    Lu, Yuxuan
    Yan, Jingya
    Qi, Zhixuan
    Ge, Zhongzheng
    Du, Yongping
    [J]. 13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022, 2022,
  • [6] Comparative Study of Word Embedding Methods in Biomedical Named Entities Recognition
    Derbel, Houssemeddine
    Habacha Chaibi, Anja
    Benabdelkader, Chiraz
    Hajjami Ben Ghezala, Henda
    [J]. VISION 2025: EDUCATION EXCELLENCE AND MANAGEMENT OF INNOVATIONS THROUGH SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE, 2019, : 6356 - 6367
  • [7] Information Retrieval Based on Knowledge-Enhanced Word Embedding Through Dialog: A Case Study
    Ren, Jin
    Wang, Hengsheng
    Liu, Tong
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 275 - 290
  • [8] Information Retrieval Based on Knowledge-Enhanced Word Embedding Through Dialog: A Case Study
    Jin Ren
    Hengsheng Wang
    Tong Liu
    [J]. International Journal of Computational Intelligence Systems, 2020, 13 : 275 - 290
  • [9] An information extraction and representation system for rapid review of the biomedical literature
    Revere, D
    Fuller, S
    Bugni, PF
    Martin, GM
    [J]. MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 788 - 792
  • [10] Event causality extraction through external event knowledge learning and polyhedral word embedding
    Wei, Xiao
    Huang, Chenyang
    Zhu, Nengjun
    [J]. MACHINE LEARNING, 2024, 113 (08) : 1 - 20