Improving the Efficiency of Clinical Trial Recruitment Using an Ensemble Machine Learning to Assist With Eligibility Screening

被引:9
|
作者
Cai, Tianrun [1 ]
Cai, Fiona [2 ]
Dahal, Kumar P. [1 ]
Cremone, Gabrielle [1 ]
Lam, Ethan [1 ]
Golnik, Charlotte [1 ]
Seyok, Thany [1 ]
Hong, Chuan [3 ]
Cai, Tianxi [3 ]
Liao, Katherine P. [4 ,5 ]
机构
[1] Brigham & Womens Hosp, Boston, MA 02115 USA
[2] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[3] Harvard Univ, Boston, MA 02115 USA
[4] Harvard Univ, Brigham & Womens Hosp, Boston, MA 02115 USA
[5] Vet Affairs Boston Healthcare Syst, Boston, MA USA
基金
美国国家卫生研究院;
关键词
COST;
D O I
10.1002/acr2.11289
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective Efficiently identifying eligible patients is a crucial first step for a successful clinical trial. The objective of this study was to test whether an approach using electronic health record (EHR) data and an ensemble machine learning algorithm incorporating billing codes and data from clinical notes processed by natural language processing (NLP) can improve the efficiency of eligibility screening. Methods We studied patients screened for a clinical trial of rheumatoid arthritis (RA) with one or more International Classification of Diseases (ICD) code for RA and age greater than 35 years, from a tertiary care center and a community hospital. The following three groups of EHR features were considered for the algorithm: 1) structured features, 2) the counts of NLP concepts from notes, 3) health care utilization. All features were linked to dates. We applied random forest and logistic regression with least absolute shrinkage and selection operator penalty against the following two standard approaches: 1) one or more RA ICD code and no ICD codes related to exclusion criteria (Screen(RAICD1)(+EX)) and 2) two or more RA ICD codes (Screen(RAICD2)). To test the portability, we trained the algorithm at one institution and tested it at the other. Results In total, 3359 patients at Brigham and Women's Hospital (BWH) and 642 patients at Faulkner Hospital (FH) were studied, with 461 (13.7%) eligible patients at BWH and 84 (13.4%) at FH. The application of the algorithm reduced ineligible patients from chart review by 40.5% at the tertiary care center and by 57.0% at the community hospital. In contrast, Screen(RAICD2) reduced patients for chart review by 2.7% to 11.3%; Screen(RAICD1+EX) reduced patients for chart review by 63% to 65% but excluded 22% to 27% of eligible patients. Conclusion The ensemble machine learning algorithm incorporating billing codes and NLP data increased the efficiency of eligibility screening by reducing the number of patients requiring chart review while not excluding eligible patients. Moreover, this approach can be trained at one institution and applied at another for multicenter clinical trials.
引用
收藏
页码:593 / 600
页数:8
相关论文
共 50 条
  • [41] Increasing the Efficiency of ALS Clinical Trials Using Machine Learning
    Ennist, David
    Taylor, Albert
    Beaulieu, Danielle
    Cuerdo, Jonavelle
    Conklin, Andrew
    Keymer, Mike
    [J]. NEUROLOGY, 2019, 92 (15)
  • [42] Osteoporosis Pre-Screening Using Ensemble Machine Learning in Postmenopausal Korean Women
    Kwon, Youngihn
    Lee, Juyeon
    Park, Joo Hee
    Kim, Yoo Mee
    Kim, Se Hwa
    Won, Young Jun
    Kim, Hyung-Yong
    [J]. HEALTHCARE, 2022, 10 (06)
  • [43] Assessing the Efficiency of Foreign Investment in a Certification Procedure Using an Ensemble Machine Learning Model
    Kemives, Aleksandar
    Barjaktarovic, Lidija
    Randelovic, Milan
    Cabarkapa, Milan
    Randelovic, Dragan
    [J]. MATHEMATICS, 2024, 12 (07)
  • [44] Predictors of Enrollment in a Smoking Cessation Clinical Trial After Eligibility Screening
    Dahm, Jamie Lyn
    Cook, Elaina
    Baugh, Kaylene
    Wileyto, E. Paul
    Pinto, Angela
    Leone, Frank
    Halbert, Chanita Hughes
    Schnoll, Robert A.
    [J]. JOURNAL OF THE NATIONAL MEDICAL ASSOCIATION, 2009, 101 (05) : 450 - 455
  • [45] RACIAL DISPARITIES IN SCREENING AND ELIGIBILITY DETERMINATION IN A SMOKING CESSATION CLINICAL TRIAL
    Matthews, Alicia K.
    Cao, Dingcai
    Southard, Catherine
    King, Andrea
    [J]. ANNALS OF BEHAVIORAL MEDICINE, 2011, 41 : S258 - S258
  • [46] Using Machine Learning to Assist Crime Prevention
    Lin, Ying-Lung
    Chen, Tenge-Yang
    Yu, Liang-Chih
    [J]. 2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 1029 - 1030
  • [47] A smart secured framework for detecting and averting online recruitment fraud using ensemble machine learning techniques
    Ullah, Zahid
    Jamjoom, Mona
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [48] Performance of EHR classifiers for patient eligibility in a clinical trial of precision screening
    Alexander, Nicholas V. J.
    Brunette, Charles A.
    Guardino, Eric T.
    Yi, Thomas
    Kerman, Benjamin J.
    MacIsaac, Katharine
    Harris, Elizabeth J.
    Antwi, Ashley A.
    Vassy, Jason L.
    [J]. CONTEMPORARY CLINICAL TRIALS, 2022, 121
  • [49] Improving clinical trial recruitment for breast cancer patient using web and APP solutions
    Schinkoethe, T.
    Mika, M.
    Soethe, C.
    Wallwiener, M.
    [J]. ONKOLOGIE, 2012, 35 : 70 - 70
  • [50] Improving Test and Diagnosis Efficiency through Ensemble Reduction and Learning
    Wang, Hongfei
    He, Kun
    [J]. ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2019, 24 (05)