A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data

被引:6
|
作者
Smith, Hayley [1 ]
Sweeting, Michael [1 ,2 ]
Morris, Tim [3 ]
Crowther, Michael [4 ]
机构
[1] Univ Leicester, Dept Hlth Sci, Leicester LE1 7RH, England
[2] Stat Innovat, Oncol Biometr, Oncol R&D, AstraZeneca, Cambridge, England
[3] UCL, MRC Clin Trials Unit, 90 High Holborn, London WC1V 6LJ, England
[4] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
关键词
Machine learning; Prognostic modelling; Clinical risk prediction; Survival analysis; Simulation studies; CENSORED SURVIVAL-DATA; FRACTIONAL POLYNOMIALS; CARDIOVASCULAR RISK; COX REGRESSION; MODELS; PROGNOSIS; DESIGN;
D O I
10.1186/s41512-022-00124-y
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background There is substantial interest in the adaptation and application of so-called machine learning approaches to prognostic modelling of censored time-to-event data. These methods must be compared and evaluated against existing methods in a variety of scenarios to determine their predictive performance. A scoping review of how machine learning methods have been compared to traditional survival models is important to identify the comparisons that have been made and issues where they are lacking, biased towards one approach or misleading.Methods We conducted a scoping review of research articles published between 1 January 2000 and 2 December 2020 using PubMed. Eligible articles were those that used simulation studies to compare statistical and machine learning methods for risk prediction with a time-to-event outcome in a medical/healthcare setting. We focus on data-generating mechanisms (DGMs), the methods that have been compared, the estimands of the simulation studies, and the performance measures used to evaluate them.Results A total of ten articles were identified as eligible for the review. Six of the articles evaluated a method that was developed by the authors, four of which were machine learning methods, and the results almost always stated that this developed method's performance was equivalent to or better than the other methods compared. Comparisons were often biased towards the novel approach, with the majority only comparing against a basic Cox proportional hazards model, and in scenarios where it is clear it would not perform well. In many of the articles reviewed, key information was unclear, such as the number of simulation repetitions and how performance measures were calculated.Conclusion It is vital that method comparisons are unbiased and comprehensive, and this should be the goal even if realising it is difficult. Fully assessing how newly developed methods perform and how they compare to a variety of traditional statistical methods for prognostic modelling is imperative as these methods are already being applied in clinical contexts. Evaluations of the performance and usefulness of recently developed methods for risk prediction should be continued and reporting standards improved as these methods become increasingly popular.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Prediction of Cardiovascular Disease in Patients with Systemic Lupus Erythematosus Using a Machine Learning Algorithm for Time-to-Event Outcomes: Random Survival Forest
    Liu, Hsin Yen
    Su, Jiandong
    Bonilla, Dennisse
    Duaibes, Sara
    Martinez, Juan Pablo Diaz
    Touma, Zahi
    [J]. ARTHRITIS & RHEUMATOLOGY, 2023, 75 : 2897 - 2900
  • [42] Individual participant data meta-analysis of intervention studies with time-to-event outcomes: A review of the methodology and an applied example
    de Jong, Valentijn M. T.
    Moons, Karel G. M.
    Riley, Richard D.
    Tudur Smith, Catrin
    Marson, Anthony G.
    Eijkemans, Marinus J. C.
    Debray, Thomas P. A.
    [J]. RESEARCH SYNTHESIS METHODS, 2020, 11 (02) : 148 - 168
  • [43] Machine Learning in Colorectal Cancer Risk Prediction from Routinely Collected Data: A Review
    Burnett, Bruce
    Zhou, Shang-Ming
    Brophy, Sinead
    Davies, Phil
    Ellis, Paul
    Kennedy, Jonathan
    Bandyopadhyay, Amrita
    Parker, Michael
    Lyons, Ronan A.
    [J]. DIAGNOSTICS, 2023, 13 (02)
  • [44] Biomarker discovery studies for patient stratification using machine learning analysis of omics data: a scoping review
    Glaab, Enrico
    Rauschenberger, Armin
    Banzi, Rita
    Gerardi, Chiara
    Garcia, Paula
    Demotes, Jacques
    [J]. BMJ OPEN, 2021, 11 (12):
  • [45] Investigation of a Data Split Strategy Involving the Time Axis in Adverse Event Prediction Using Machine Learning
    Morita, Katsuhisa
    Mizuno, Tadahaya
    Kusuhara, Hiroyuki
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (17) : 3982 - 3992
  • [46] Machine-learning-based adverse drug event prediction from observational health data: A review
    Denck, Jonas
    Ozkirimli, Elif
    Wang, Ken
    [J]. DRUG DISCOVERY TODAY, 2023, 28 (09)
  • [47] A review of statistical and machine learning methods for modeling cancer risk using structured clinical data
    Richter, Aaron N.
    Khoshgoftaar, Taghi M.
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2018, 90 : 1 - 14
  • [48] AutoScore-Survival: Developing interpretable machine learning-based time-to-event scores with right-censored survival data
    Xie, Feng
    Ning, Yilin
    Yuan, Han
    Goldstein, Benjamin Alan
    Ong, Marcus Eng Hock
    Liu, Nan
    Chakraborty, Bibhas
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 125
  • [49] Disease Prediction Using Graph Machine Learning Based on Electronic Health Data: A Review of Approaches and Trends
    Lu, Haohui
    Uddin, Shahadat
    [J]. HEALTHCARE, 2023, 11 (07)
  • [50] Machine learning for the life-time risk prediction of Alzheimer's disease: a systematic review
    Rowe, Thomas W.
    Katzourou, Ioanna K.
    Stevenson-Hoare, Joshua O.
    Bracher-Smith, Matthew R.
    Ivanov, Dobril K.
    Escott-Price, Valentina
    [J]. BRAIN COMMUNICATIONS, 2021, 3 (04)