Learning to match patients to clinical trials using large language models

被引:2
|
作者
Rybinski, Maciej [1 ]
Kusa, Wojciech [2 ]
Karimi, Sarvnaz [1 ]
Hanbury, Allan [2 ]
机构
[1] CSIRO Data61, 26 Pembroke Rd, Marsfield, NSW 2122, Australia
[2] TU Wien, Favoritenstr 9-11, A-1040 Vienna, Austria
基金
欧盟地平线“2020”;
关键词
Clinical trials; Patient to trials matching; TCRR; TREC CT; Large language models; Information retrieval; Learning-to-rank;
D O I
10.1016/j.jbi.2024.104734
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: This study investigates the use of Large Language Models (LLMs) for matching patients to clinical trials (CTs) within an information retrieval pipeline. Our objective is to enhance the process of patient-trial matching by leveraging the semantic processing capabilities of LLMs, thereby improving the effectiveness of patient recruitment for clinical trials. Methods: We employed a multi-stage retrieval pipeline integrating various methodologies, including BM25 and Transformer-based rankers, along with LLM-based methods. Our primary datasets were the TREC Clinical Trials 2021-23 track collections. We compared LLM-based approaches, focusing on methods that leverage LLMs in query formulation, filtering, relevance ranking, and re-ranking of CTs. Results: Our results indicate that LLM-based systems, particularly those involving re-ranking with a fine-tuned LLM, outperform traditional methods in terms of nDCG and Precision measures. The study demonstrates that fine-tuning LLMs enhances their ability to find eligible trials. Moreover, our LLM-based approach is competitive with state-of-the-art systems in the TREC challenges. The study shows the effectiveness of LLMs in CT matching, highlighting their potential in handling complex semantic analysis and improving patient-trial matching. However, the use of LLMs increases the computational cost and reduces efficiency. We provide a detailed analysis of effectiveness-efficiency trade-offs. Conclusion: This research demonstrates the promising role of LLMs in enhancing the patient-to-clinical trial matching process, offering a significant advancement in the automation of patient recruitment. Future work should explore optimising the balance between computational cost and retrieval effectiveness in practical applications.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Matching patients to clinical trials with large language models
    Jin, Qiao
    Wang, Zifeng
    Floudas, Charalampos S.
    Chen, Fangyuan
    Gong, Changlin
    Bracken-Clarke, Dara
    Xue, Elisabetta
    Yang, Yifan
    Sun, Jimeng
    Lu, Zhiyong
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [2] Distilling large language models for matching patients to clinical trials
    Nievas, Mauro
    Basu, Aditya
    Wang, Yanshan
    Singh, Hrituraj
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) : 1953 - 1963
  • [3] Transforming clinical trials: the emerging roles of large language models
    Ghim, Jong-Lyul
    Ahn, Sangzin
    TRANSLATIONAL AND CLINICAL PHARMACOLOGY, 2023, 31 (03) : 131 - 138
  • [4] Assessing the Risk of Bias in Randomized Clinical Trials With Large Language Models
    Lai, Honghao
    Ge, Long
    Sun, Mingyao
    Pan, Bei
    Huang, Jiajie
    Hou, Liangying
    Yang, Qiuyu
    Liu, Jiayi
    Liu, Jianing
    Ye, Ziying
    Xia, Danni
    Zhao, Weilong
    Wang, Xiaoman
    Liu, Ming
    Talukdar, Jhalok Ronjan
    Tian, Jinhui
    Yang, Kehu
    Estill, Janne
    JAMA NETWORK OPEN, 2024, 7 (05) : E2412687
  • [5] Large language models streamline automated machine learning for clinical studies
    Arasteh, Soroosh Tayebi
    Han, Tianyu
    Lotfinia, Mahshad
    Kuhl, Christiane
    Kather, Jakob Nikolas
    Truhn, Daniel
    Nebelung, Sven
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [6] Large language models streamline automated machine learning for clinical studies
    Soroosh Tayebi Arasteh
    Tianyu Han
    Mahshad Lotfinia
    Christiane Kuhl
    Jakob Nikolas Kather
    Daniel Truhn
    Sven Nebelung
    Nature Communications, 15
  • [7] Assertion Detection in Clinical Natural Language Processing using Large Language Models
    Ji, Yuelyu
    Yu, Zeshui
    Wang, Yanshan
    2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 242 - 247
  • [8] Cohort selection for clinical trials using deep learning models
    Segura-Bedmar, Isabel
    Raez, Pablo
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (11) : 1181 - 1188
  • [9] New tools automatically match patients with clinical trials
    Opar, Alisa
    NATURE MEDICINE, 2013, 19 (07) : 793 - 793
  • [10] New tools automatically match patients with clinical trials
    Alisa Opar
    Nature Medicine, 2013, 19 : 793 - 793