Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

被引：0

作者：

Chen, Yuhao ^{[1
]}

Wang, Zhimu ^{[1
]}

Zulkernine, Farhana ^{[1
]}

机构：

[1] Queens Univ, Sch Comp, Kingston, ON, Canada

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, ICDH 2024 | 2024年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Biomedical summarization; Large Language Model; Generative Model;

D O I：

10.1109/ICDH62654.2024.00030

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Unstructured text in medical notes and dialogues contains rich information. Recent advancements in Large Language Models (LLMs) have demonstrated superior performance in question answering and summarization tasks on unstructured text data, outperforming traditional text analysis approaches. However, there is a lack of scientific studies in the literature that methodically evaluate and report on the performance of different LLMs, specifically for domain-specific data such as medical chart notes. We propose an evaluation approach to analyze the performance of open-source LLMs such as Llama2 and Mistral for medical summarization tasks, using GPT-4 as an assessor. Our innovative approach to quantitative evaluation of LLMs can enable quality control, support the selection of effective LLMs for specific tasks, and advance knowledge discovery in digital health.

引用

页码：126 / 128

页数：3

共 50 条

[21] Archetypes of open-source business models
Estelle Duparc
Frederik Möller
Ilka Jussen
Maleen Stachon
Sükran Algac
Boris Otto
Electronic Markets, 2022, 32 : 727 - 745
[22] Archetypes of open-source business models
Duparc, Estelle
Moeller, Frederik
Jussen, Ilka
Stachon, Maleen
Algac, Sukran
Otto, Boris
ELECTRONIC MARKETS, 2022, 32 (02) : 727 - 745
[23] PharmaLLM: A Medicine Prescriber Chatbot Exploiting Open-Source Large Language Models
Ayesha Azam
Zubaira Naz
Muhammad Usman Ghani Khan
Human-Centric Intelligent Systems, 2024, 4 (4): : 527 - 544
[24] Automated Essay Scoring and Revising Based on Open-Source Large Language Models
Song, Yishen
Zhu, Qianta
Wang, Huaibo
Zheng, Qinhua
IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2024, 17 : 1920 - 1930
[25] Open-source large language models in action: A bioinformatics chatbot for PRIDE database
Bai, Jingwen
Kamatchinathan, Selvakumar
Kundu, Deepti J.
Bandla, Chakradhar
Vizcaino, Juan Antonio
Perez-Riverol, Yasset
PROTEOMICS, 2024,
[26] PMC-LLaMA: toward building open-source language models for medicine
Wu, Chaoyi
Lin, Weixiong
Zhang, Xiaoman
Zhang, Ya
Xie, Weidi
Wang, Yanfeng
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) : 1833 - 1843
[27] Open-Source Data Alternatives and Models for Flood Risk Management in Nepal
Thakuri, Sudeep
Parajuli, Binod Prasad
Shakya, Puja
Baskota, Preshika
Pradhan, Deepa
Chauhan, Raju
REMOTE SENSING, 2022, 14 (22)
[28] Accessible Russian Large Language Models: Open-Source Models and Instructive Datasets for Commercial Applications
Kosenko, D. P.
Kuratov, Yu. M.
Zharikova, D. R.
DOKLADY MATHEMATICS, 2023, 108 (SUPPL 2) : S393 - S398
[29] Accessible Russian Large Language Models: Open-Source Models and Instructive Datasets for Commercial Applications
D. P. Kosenko
Yu. M. Kuratov
D. R. Zharikova
Doklady Mathematics, 2023, 108 : S393 - S398
[30] OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data
Cartella, Giuseppe
Baldrati, Alberto
Morelli, Davide
Cornia, Marcella
Bertini, Marco
Cucchiara, Rita
IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 245 - 256

← 1 2 3 4 5 →