Constructing synthetic datasets with generative artificial intelligence to train large language models to classify acute renal failure from clinical notes

被引:1
|
作者
Litake, Onkar [1 ]
Park, Brian H. [1 ]
Tully, Jeffrey L. [1 ]
Gabriel, Rodney A. [1 ,2 ]
机构
[1] Univ Calif San Diego, Dept Anesthesiol, Div Perioperat Informat, 9400 Campus Point Dr, La Jolla, CA 92037 USA
[2] Univ Calif San Diego Hlth, Dept Biomed Informat, La Jolla, CA 92037 USA
关键词
large language models; artificial intelligence; generative AI; ChatGPT;
D O I
10.1093/jamia/ocae081
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives To compare performances of a classifier that leverages language models when trained on synthetic versus authentic clinical notes.Materials and Methods A classifier using language models was developed to identify acute renal failure. Four types of training data were compared: (1) notes from MIMIC-III; and (2, 3, and 4) synthetic notes generated by ChatGPT of varied text lengths of 15 (GPT-15 sentences), 30 (GPT-30 sentences), and 45 (GPT-45 sentences) sentences, respectively. The area under the receiver operating characteristics curve (AUC) was calculated from a test set from MIMIC-III.Results With RoBERTa, the AUCs were 0.84, 0.80, 0.84, and 0.76 for the MIMIC-III, GPT-15, GPT-30- and GPT-45 sentences training sets, respectively.Discussion Training language models to detect acute renal failure from clinical notes resulted in similar performances when using synthetic versus authentic training data.Conclusion The use of training data derived from protected health information may not be needed.
引用
收藏
页码:1404 / 1410
页数:7
相关论文
共 37 条
  • [1] Clinical Science and Practice in the Age of Large Language Models and Generative Artificial Intelligence
    Schueller, Stephen M.
    Morris, Robert R.
    [J]. JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 2023, 91 (10) : 559 - 561
  • [2] Medical education empowered by generative artificial intelligence large language models
    Jowsey, Tanisha
    Stokes-Parish, Jessica
    Singleton, Rachelle
    Todorovic, Michael
    [J]. TRENDS IN MOLECULAR MEDICINE, 2023, 29 (12) : 971 - 973
  • [3] Generative Artificial Intelligence Through ChatGPT and Other Large Language Models in Ophthalmology Clinical Applications and Challenges
    Tan, Ting Fang
    Thirunavukarasu, Arun James
    Campbell, J. Peter
    Keane, Pearse A.
    Pasquale, Louis R.
    Abramoff, Michael D.
    Kalpathy-Cramer, Jayashree
    Lum, Flora
    Kim, Judy E.
    Baxter, Sally L.
    Ting, Daniel Shu Wei
    [J]. OPHTHALMOLOGY SCIENCE, 2023, 3 (04):
  • [4] Legal aspects of generative artificial intelligence and large language models in examinations and theses
    Maerz, Maren
    Himmelbauer, Monika
    Boldt, Kevin
    Oksche, Alexander
    [J]. GMS JOURNAL FOR MEDICAL EDUCATION, 2024, 41 (04):
  • [5] A Generative Artificial Intelligence Using Multilingual Large Language Models for ChatGPT Applications
    Tuan, Nguyen Trung
    Moore, Philip
    Thanh, Dat Ha Vu
    Pham, Hai Van
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [6] Integrating large language models and generative artificial intelligence tools into information literacy instruction
    Carroll, Alexander J.
    Borycz, Joshua
    [J]. JOURNAL OF ACADEMIC LIBRARIANSHIP, 2024, 50 (04):
  • [7] GenAI against humanity: nefarious applications of generative artificial intelligence and large language models
    Ferrara, Emilio
    [J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024, 7 (01): : 549 - 569
  • [8] The academic industry's response to generative artificial intelligence: An institutional analysis of large language models
    Kshetri, Nir
    [J]. TELECOMMUNICATIONS POLICY, 2024, 48 (05)
  • [9] Updated Primer on Generative Artificial Intelligence and Large Language Models in Medical Imaging for Medical Professionals
    Kim, Kiduk
    Cho, Kyungjin
    Jang, Ryoungwoo
    Kyung, Sunggu
    Lee, Soyoung
    Ham, Sungwon
    Choi, Edward
    Hong, Gil-Sun
    Kim, Namkug
    [J]. KOREAN JOURNAL OF RADIOLOGY, 2024, 25 (03) : 224 - 242
  • [10] Constructing large scale surrogate models from big data and artificial intelligence
    Edwards, Richard E.
    New, Joshua
    Parker, Lynne E.
    Cui, Borui
    Dong, Jin
    [J]. APPLIED ENERGY, 2017, 202 : 685 - 699