Improving the Efficiency of Clinical Trial Recruitment Using an Ensemble Machine Learning to Assist With Eligibility Screening

被引:9
|
作者
Cai, Tianrun [1 ]
Cai, Fiona [2 ]
Dahal, Kumar P. [1 ]
Cremone, Gabrielle [1 ]
Lam, Ethan [1 ]
Golnik, Charlotte [1 ]
Seyok, Thany [1 ]
Hong, Chuan [3 ]
Cai, Tianxi [3 ]
Liao, Katherine P. [4 ,5 ]
机构
[1] Brigham & Womens Hosp, Boston, MA 02115 USA
[2] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] Harvard Univ, Boston, MA 02115 USA
[4] Harvard Univ, Brigham & Womens Hosp, Boston, MA 02115 USA
[5] Vet Affairs Boston Healthcare Syst, Boston, MA USA
基金
美国国家卫生研究院;
关键词
COST;
D O I
10.1002/acr2.11289
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective Efficiently identifying eligible patients is a crucial first step for a successful clinical trial. The objective of this study was to test whether an approach using electronic health record (EHR) data and an ensemble machine learning algorithm incorporating billing codes and data from clinical notes processed by natural language processing (NLP) can improve the efficiency of eligibility screening. Methods We studied patients screened for a clinical trial of rheumatoid arthritis (RA) with one or more International Classification of Diseases (ICD) code for RA and age greater than 35 years, from a tertiary care center and a community hospital. The following three groups of EHR features were considered for the algorithm: 1) structured features, 2) the counts of NLP concepts from notes, 3) health care utilization. All features were linked to dates. We applied random forest and logistic regression with least absolute shrinkage and selection operator penalty against the following two standard approaches: 1) one or more RA ICD code and no ICD codes related to exclusion criteria (Screen(RAICD1)(+EX)) and 2) two or more RA ICD codes (Screen(RAICD2)). To test the portability, we trained the algorithm at one institution and tested it at the other. Results In total, 3359 patients at Brigham and Women's Hospital (BWH) and 642 patients at Faulkner Hospital (FH) were studied, with 461 (13.7%) eligible patients at BWH and 84 (13.4%) at FH. The application of the algorithm reduced ineligible patients from chart review by 40.5% at the tertiary care center and by 57.0% at the community hospital. In contrast, Screen(RAICD2) reduced patients for chart review by 2.7% to 11.3%; Screen(RAICD1+EX) reduced patients for chart review by 63% to 65% but excluded 22% to 27% of eligible patients. Conclusion The ensemble machine learning algorithm incorporating billing codes and NLP data increased the efficiency of eligibility screening by reducing the number of patients requiring chart review while not excluding eligible patients. Moreover, this approach can be trained at one institution and applied at another for multicenter clinical trials.
引用
收藏
页码:593 / 600
页数:8
相关论文
共 50 条
  • [31] Increasing the efficiency of trial-patient matching: automated clinical trial eligibility Pre-screening for pediatric oncology patients
    Ni, Yizhao
    Wright, Jordan
    Perentesis, John
    Lingren, Todd
    Deleger, Louise
    Kaiser, Megan
    Kohane, Isaac
    Solti, Imre
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2015, 15
  • [32] Recruitment Prediction using Machine Learning
    Reddy, Jagan Mohan D.
    Regella, Sirisha
    Seelam, Srinivasa Reddy
    [J]. PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
  • [33] Audit of clinical trial eligibility and recruitment rates in a single speciality cancer centre
    Maughan, TS
    Branston, L
    Batt, LA
    Shankland, SH
    [J]. BRITISH JOURNAL OF CANCER, 2000, 83 : 2 - 2
  • [34] Improving Clinical Trial Efficiency in Gastroenterology
    Ma, Christopher
    Guizzetti, Leonardo
    Jairath, Vipul
    [J]. GASTROENTEROLOGY, 2019, 157 (03) : 892 - 893
  • [35] Improving clinical fetal weight estimation using machine learning
    Cohen, Gal
    Girshovitz, Irena
    Amit, Guy
    Hila, Shalev Ram
    Akiva, Pinchas
    Biron-Shental, Tal
    [J]. AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2023, 228 (01) : S131 - S131
  • [36] NEOADJUVANT CHEMOTHERAPY IN ADVANCED OVARIAN CANCER PATIENTS: EFFICIENCY OF SCREENING BY LAPROSCOPY FOR CLINICAL TRIAL RECRUITMENT
    Rouzier, R.
    Chereau, E.
    Floquet, A.
    Selle, F.
    Fourchotte, V.
    Pomel, C.
    Follana, P.
    Martin-Francoise, S.
    Fauvet, R.
    Colombo, P. E.
    Kalbacher, E.
    Lesoin, A.
    Lecuru, F.
    Cottu, P.
    Joly, F.
    Mengui, V.
    Ghazi, Y.
    Morice, P.
    [J]. INTERNATIONAL JOURNAL OF GYNECOLOGICAL CANCER, 2014, 24 (09) : 165 - 166
  • [37] Improving the robustness of beach water quality modeling using an ensemble machine learning approach
    Wang, Leizhi
    Zhu, Zhenduo
    Sassoubre, Lauren
    Yu, Guan
    Liao, Chen
    Hu, Qingfang
    Wang, Yintang
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 765
  • [38] Improving groundwater nitrate concentration prediction using local ensemble of machine learning models
    Mahboobi, Hojjatollah
    Shakiba, Alireza
    Mirbagheri, Babak
    [J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 345
  • [39] Improving performance in colorectal cancer histology decomposition using deep and ensemble machine learning
    Prezja, Fabi
    Annala, Leevi
    Kiiskinen, Sampsa
    Lahtinen, Suvi
    Ojala, Timo
    Ruusuvuori, Pekka
    Kuopio, Teijo
    [J]. HELIYON, 2024, 10 (18)
  • [40] Improving Diagnosis Efficiency via Machine Learning
    Huang, Qicheng
    Fang, Chenlei
    Mittal, Soumya
    Blanton, R. D.
    [J]. 2018 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2018,