Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

被引:0
|
作者
Chen, Yuhao [1 ]
Wang, Zhimu [1 ]
Zulkernine, Farhana [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Biomedical summarization; Large Language Model; Generative Model;
D O I
10.1109/ICDH62654.2024.00030
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Unstructured text in medical notes and dialogues contains rich information. Recent advancements in Large Language Models (LLMs) have demonstrated superior performance in question answering and summarization tasks on unstructured text data, outperforming traditional text analysis approaches. However, there is a lack of scientific studies in the literature that methodically evaluate and report on the performance of different LLMs, specifically for domain-specific data such as medical chart notes. We propose an evaluation approach to analyze the performance of open-source LLMs such as Llama2 and Mistral for medical summarization tasks, using GPT-4 as an assessor. Our innovative approach to quantitative evaluation of LLMs can enable quality control, support the selection of effective LLMs for specific tasks, and advance knowledge discovery in digital health.
引用
收藏
页码:126 / 128
页数:3
相关论文
共 50 条
  • [1] Open-source large language models in medical education: Balancing promise and challenges
    Ray, Partha Pratim
    [J]. ANATOMICAL SCIENCES EDUCATION, 2024, 17 (06) : 1361 - 1362
  • [2] Enhancing Code Security Through Open-Source Large Language Models: A Comparative Study
    Ridley, Norah
    Branca, Enrico
    Kimber, Jadyn
    Stakhanova, Natalia
    [J]. FOUNDATIONS AND PRACTICE OF SECURITY, PT I, FPS 2023, 2024, 14551 : 233 - 249
  • [3] Servicing open-source large language models for oncology
    Ray, Partha Pratim
    [J]. ONCOLOGIST, 2024,
  • [4] An open-source data manager for network models
    Knox, Stephen
    Tomlinson, James
    Harou, Julien J.
    Meier, Philipp
    Rosenberg, David E.
    Lund, Jay R.
    Rheinheimer, David E.
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2019, 122
  • [5] Convergence Between Migrant Smuggling and Trafficking of Goods: Text Analysis of Open-Source Data
    Aziani, Alberto
    Jofre, Maria
    Mancuso, Marina
    [J]. INTERNATIONAL MIGRATION REVIEW, 2023,
  • [6] A tutorial on open-source large language models for behavioral science
    Hussain, Zak
    Binz, Marcel
    Mata, Rui
    Wulff, Dirk U.
    [J]. BEHAVIOR RESEARCH METHODS, 2024,
  • [7] TACIT: An open-source text analysis, crawling, and interpretation tool
    Dehghani, Morteza
    Johnson, Kate M.
    Garten, Justin
    Boghrati, Reihane
    Hoover, Joe
    Balasubramanian, Vijayan
    Singh, Anurag
    Shankar, Yuvarani
    Pulickal, Linda
    Rajkumar, Aswin
    Parmar, Niki Jitendra
    [J]. BEHAVIOR RESEARCH METHODS, 2017, 49 (02) : 538 - 547
  • [8] TACIT: An open-source text analysis, crawling, and interpretation tool
    Morteza Dehghani
    Kate M. Johnson
    Justin Garten
    Reihane Boghrati
    Joe Hoover
    Vijayan Balasubramanian
    Anurag Singh
    Yuvarani Shankar
    Linda Pulickal
    Aswin Rajkumar
    Niki Jitendra Parmar
    [J]. Behavior Research Methods, 2017, 49 : 538 - 547
  • [9] An open-source platform for Sensitivity Analysis of QSP models
    Packirisamy, Prakash
    Kumar, Rukmini
    [J]. JOURNAL OF PHARMACOKINETICS AND PHARMACODYNAMICS, 2016, 43 : S113 - S113
  • [10] Preliminary Systematic Review of Open-Source Large Language Models in Education
    Lin, Michael Pin-Chuan
    Chang, Daniel
    Hall, Sarah
    Jhajj, Gaganpreet
    [J]. GENERATIVE INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, PT I, ITS 2024, 2024, 14798 : 68 - 77