Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

被引:0
|
作者
Chen, Yuhao [1 ]
Wang, Zhimu [1 ]
Zulkernine, Farhana [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Biomedical summarization; Large Language Model; Generative Model;
D O I
10.1109/ICDH62654.2024.00030
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Unstructured text in medical notes and dialogues contains rich information. Recent advancements in Large Language Models (LLMs) have demonstrated superior performance in question answering and summarization tasks on unstructured text data, outperforming traditional text analysis approaches. However, there is a lack of scientific studies in the literature that methodically evaluate and report on the performance of different LLMs, specifically for domain-specific data such as medical chart notes. We propose an evaluation approach to analyze the performance of open-source LLMs such as Llama2 and Mistral for medical summarization tasks, using GPT-4 as an assessor. Our innovative approach to quantitative evaluation of LLMs can enable quality control, support the selection of effective LLMs for specific tasks, and advance knowledge discovery in digital health.
引用
收藏
页码:126 / 128
页数:3
相关论文
共 50 条
  • [21] Archetypes of open-source business models
    Estelle Duparc
    Frederik Möller
    Ilka Jussen
    Maleen Stachon
    Sükran Algac
    Boris Otto
    Electronic Markets, 2022, 32 : 727 - 745
  • [22] Archetypes of open-source business models
    Duparc, Estelle
    Moeller, Frederik
    Jussen, Ilka
    Stachon, Maleen
    Algac, Sukran
    Otto, Boris
    ELECTRONIC MARKETS, 2022, 32 (02) : 727 - 745
  • [23] PharmaLLM: A Medicine Prescriber Chatbot Exploiting Open-Source Large Language Models
    Ayesha Azam
    Zubaira Naz
    Muhammad Usman Ghani Khan
    Human-Centric Intelligent Systems, 2024, 4 (4): : 527 - 544
  • [24] Automated Essay Scoring and Revising Based on Open-Source Large Language Models
    Song, Yishen
    Zhu, Qianta
    Wang, Huaibo
    Zheng, Qinhua
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2024, 17 : 1920 - 1930
  • [25] Open-source large language models in action: A bioinformatics chatbot for PRIDE database
    Bai, Jingwen
    Kamatchinathan, Selvakumar
    Kundu, Deepti J.
    Bandla, Chakradhar
    Vizcaino, Juan Antonio
    Perez-Riverol, Yasset
    PROTEOMICS, 2024,
  • [26] PMC-LLaMA: toward building open-source language models for medicine
    Wu, Chaoyi
    Lin, Weixiong
    Zhang, Xiaoman
    Zhang, Ya
    Xie, Weidi
    Wang, Yanfeng
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) : 1833 - 1843
  • [27] Open-Source Data Alternatives and Models for Flood Risk Management in Nepal
    Thakuri, Sudeep
    Parajuli, Binod Prasad
    Shakya, Puja
    Baskota, Preshika
    Pradhan, Deepa
    Chauhan, Raju
    REMOTE SENSING, 2022, 14 (22)
  • [28] Accessible Russian Large Language Models: Open-Source Models and Instructive Datasets for Commercial Applications
    Kosenko, D. P.
    Kuratov, Yu. M.
    Zharikova, D. R.
    DOKLADY MATHEMATICS, 2023, 108 (SUPPL 2) : S393 - S398
  • [29] Accessible Russian Large Language Models: Open-Source Models and Instructive Datasets for Commercial Applications
    D. P. Kosenko
    Yu. M. Kuratov
    D. R. Zharikova
    Doklady Mathematics, 2023, 108 : S393 - S398
  • [30] OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data
    Cartella, Giuseppe
    Baldrati, Alberto
    Morelli, Davide
    Cornia, Marcella
    Bertini, Marco
    Cucchiara, Rita
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 245 - 256