Comprehensive testing of large language models for extraction of structured data in pathology

被引:0
|
作者
Bastian Grothey [1 ]
Jan Odenkirchen [2 ]
Adnan Brkic [1 ]
Birgid Schömig-Markiefka [1 ]
Alexander Quaas [1 ]
Reinhard Büttner [1 ]
Yuri Tolkach [1 ]
机构
[1] University Hospital Cologne,Institute of Pathology
[2] University of Cologne,Medical Faculty
来源
关键词
D O I
10.1038/s43856-025-00808-8
中图分类号
学科分类号
摘要
Pathology departments produce many diagnostic reports as free text, which is hard to analyze or use in research and computer projects. Converting this free text into more standard organized information like test results or diagnoses, makes it easier to use. This task often requires human experts and takes time. Large language models (LLMs), which are advanced computer systems designed to understand and generate human-like text, might simplify this process. Here, we tested six LLMs, including freely available models and the commercial GPT-4 model, using 579 pathology reports in English and German. Our results show that freely available models can perform as well as commercial, providing a cheaper solution while avoiding privacy concerns. The shared dataset will support future research in pathology data processing.
引用
下载
收藏
相关论文
共 50 条
  • [21] Trend Extraction and Analysis via Large Language Models
    Soru, Tommaso
    Marshall, Jim
    18TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC 2024, 2024, : 285 - 288
  • [22] The interaction of structured data using openEHR and large Language models for clinical decision support in prostate cancer
    Philippe Kaiser
    Shan Yang
    Michael Bach
    Christian Breit
    Kirsten Mertz
    Bram Stieltjes
    Jan Ebbing
    Christian Wetterauer
    Maurice Henkel
    World Journal of Urology, 43 (1)
  • [23] Leveraging Cognitive Science for Testing Large Language Models
    Srinivasan, Ramya
    Inakoshi, Hiroya
    Uchino, Kanji
    2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 169 - 171
  • [24] Controlling the Extraction of Memorized Data from Large Language Models via Prompt-Tuning
    Ozdayi, Mustafa Safa
    Peris, Charith
    Fitzgerald, Jack
    Dupuy, Christophe
    Majmudar, Jimit
    Khan, Haidar
    Parikh, Rahil
    Gupta, Rahul
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1512 - 1521
  • [25] Pipelines for Social Bias Testing of Large Language Models
    Nozza, Debora
    Bianchi, Federico
    Hovy, Dirk
    PROCEEDINGS OF WORKSHOP ON CHALLENGES & PERSPECTIVES IN CREATING LARGE LANGUAGE MODELS (BIGSCIENCE EPISODE #5), 2022, : 68 - 74
  • [26] A Survey of Testing Techniques Based on Large Language Models
    Qi, Fei
    Hou, Yingnan
    Lin, Ning
    Bao, Shanshan
    Xu, Nuo
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 280 - 284
  • [27] Testing theory of mind in large language models and humans
    Strachan, James W. A.
    Albergo, Dalila
    Borghini, Giulia
    Pansardi, Oriana
    Scaliti, Eugenio
    Gupta, Saurabh
    Saxena, Krati
    Rufo, Alessandro
    Panzeri, Stefano
    Manzi, Guido
    Graziano, Michael S. A.
    Becchio, Cristina
    NATURE HUMAN BEHAVIOUR, 2024, 8 (07): : 1285 - 1295
  • [28] Demystifying Data Management for Large Language Models
    Miao, Xupeng
    Jia, Zhihao
    Cui, Bin
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 547 - 555
  • [29] A Comprehensive Analysis of Various Tokenizers for Arabic Large Language Models
    Qarah, Faisal
    Alsanoosy, Tawfeeq
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [30] GIST: Transforming Overwhelming Information into Structured Knowledge with Large Language Models
    Wu, Meng
    Zhou, Xinyu
    Ma, Gang
    Lu, Zhangwei
    Zhang, Liuxin
    Zhang, Yu
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 41 - 45