Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

被引:0
|
作者
Chen, Yuhao [1 ]
Wang, Zhimu [1 ]
Zulkernine, Farhana [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Biomedical summarization; Large Language Model; Generative Model;
D O I
10.1109/ICDH62654.2024.00030
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Unstructured text in medical notes and dialogues contains rich information. Recent advancements in Large Language Models (LLMs) have demonstrated superior performance in question answering and summarization tasks on unstructured text data, outperforming traditional text analysis approaches. However, there is a lack of scientific studies in the literature that methodically evaluate and report on the performance of different LLMs, specifically for domain-specific data such as medical chart notes. We propose an evaluation approach to analyze the performance of open-source LLMs such as Llama2 and Mistral for medical summarization tasks, using GPT-4 as an assessor. Our innovative approach to quantitative evaluation of LLMs can enable quality control, support the selection of effective LLMs for specific tasks, and advance knowledge discovery in digital health.
引用
收藏
页码:126 / 128
页数:3
相关论文
共 50 条
  • [41] Analysis of Intercrossed Open-Source Software Repositories Data in GitHub
    Farah, Gabriel
    Correal, Dario
    2013 8TH COMPUTING COLOMBIAN CONFERENCE (8CCC), 2013, : 37 - 42
  • [42] Open-Source Data and the Study of Homicide
    Parkin, William S.
    Gruenewald, Jeff
    JOURNAL OF INTERPERSONAL VIOLENCE, 2017, 32 (18) : 2693 - 2723
  • [43] Open-source tools for data mining
    Zupan, Blaz
    Demsar, Janez
    CLINICS IN LABORATORY MEDICINE, 2008, 28 (01) : 37 - +
  • [44] Comparative analysis of colorimetric staining in skin using open-source software
    Billings, Paul C.
    Sanzari, Jenine K.
    Kennedy, Ann R.
    Cengel, Keith A.
    Seykora, John T.
    EXPERIMENTAL DERMATOLOGY, 2015, 24 (02) : 157 - 159
  • [45] OPENWALNUT: AN OPEN-SOURCE TOOL FOR VISUALIZATION OF MEDICAL AND BIO-SIGNAL DATA
    Eichelbaum, Sebastian
    Hlawitschka, Mario
    Scheuermann, Gerik
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2013, 58
  • [46] Open-Source Federated Learning Frameworks for IoT: A Comparative Review and Analysis
    Kholod, Ivan
    Yanaki, Evgeny
    Fomichev, Dmitry
    Shalugin, Evgeniy
    Novikova, Evgenia
    Filippov, Evgeny
    Nordlund, Mats
    SENSORS, 2021, 21 (01) : 1 - 22
  • [47] Open-Ethical AI: Advancements in Open-Source Human-Centric Neural Language Models
    Sicari, Sabrina
    Cevallos, Jesus F.M.
    Rizzardi, Alessandra
    Coen-Porisini, Alberto
    ACM Computing Surveys, 2024, 57 (04)
  • [48] Comparative Analysis and Conversion Between Actiwatch and ActiGraph Open-Source Counts
    Lee, Paul H.
    Neishabouri, Ali
    Tse, Andy C. Y.
    Guo, Christine C.
    JOURNAL FOR THE MEASUREMENT OF PHYSICAL BEHAVIOUR, 2024, 7 (01) : 12 - 12
  • [49] A Comparative Analysis of Object Detection Metrics with a Companion Open-Source Toolkit
    Padilla, Rafael
    Passos, Wesley L.
    Dias, Thadeu L. B.
    Netto, Sergio L.
    da Silva, Eduardo A. B.
    ELECTRONICS, 2021, 10 (03) : 1 - 28
  • [50] Comparative analysis of real issues in open-source machine learning projects
    Lai, Tuan Dung
    Simmons, Anj
    Barnett, Scott
    Schneider, Jean-Guy
    Vasa, Rajesh
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (03)