Predicting the time to get back to work using statistical models and machine learning approaches

被引:0
|
作者
Bouliotis, George [1 ]
Underwood, M. [1 ]
Froud, R. [2 ]
机构
[1] Univ Warwick, Warwick Clin Trials Unit, Coventry, England
[2] Hoyskolen Kristiania, Oslo, Norway
关键词
Machine Learning; Survival analysis; Statistical methods; Return to work; Socioeconomic deprivation; EXTERNAL VALIDATION; REGULARIZATION; IMPUTATION;
D O I
10.1186/s12874-024-02390-4
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BackgroundWhether machine learning approaches are superior to classical statistical models for survival analyses, especially in the case of lack of proportionality, is unknown.ObjectivesTo compare model performance and predictive accuracy of classic regressions and machine learning approaches using data from the Inspiring Families programme.MethodsThe Inspiring Families programme aims to support members of families with complex issues to return to work. We explored predictors of time to return to work with proportional hazards (Semi-Parametric Cox in Stata) and (Flexible Parametric Parmar-Royston in Stata) against the Survival penalised regression with Elastic Net penalty (scikit-survival), (conditional) Survival Forest algorithm (pySurvival), and (kernel) Survival Support Vector Machine (pySurvival).ResultsAt baseline we obtained data on 61 binary variables from all 3161 participants. No model appeared superior, with a low predictive power (concordance index between 0.51 and 0.61). The median time for finding the first job was about 254 days. The top five contributing variables were 'family issues and additional barriers', 'restriction of hours', 'available CV', 'self-employment considered' and 'education'. The Harrell's Concordance index was range from 0.60 (Cox model) to 0.71 (Random Survival Forest) suggesting a better fit for the machine learning approaches. However, the comparison for predicting median time on a selected scenario based showed only minor differences.ConclusionImplementing a series of survival models with and without proportional hazards background provides a useful insight as well as better interpretation of the coefficients affected by non-linearities. However, that better fit does not translate to substantially higher predictive power and accuracy from using machine learning approaches. Further tuning of the machine learning algorithms may provide improved results.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Exploring and Comparing Machine Learning Approaches for Predicting Mood Over Time
    van Breda, Ward
    Pastor, Johnno
    Hoogendoorn, Mark
    Ruwaard, Jeroen
    Asselbergs, Joost
    Riper, Heleen
    INNOVATION IN MEDICINE AND HEALTHCARE 2016, 2016, 60 : 37 - 47
  • [22] Comparison of machine learning techniques with classical statistical models in predicting health outcomes
    Song, XW
    Mitnitski, A
    Cox, J
    Rockwood, K
    MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 736 - 740
  • [23] Comparing Machine Learning Models for Predicting Operative Time in Neurosurgical Procedures: Institutional Versus Nationwide Dataset Approaches
    Nathani, Karim Rizwan
    Bhandarkar, Archis R.
    Machlab, Laura
    von Kentzinsky, Hendrik
    Ibrahim, Sufyan
    Katsos, Konstantinos
    Ali, Rushna
    Tawk, Rabih G.
    Freedman, Brett
    Bydon, Mohamad
    NEUROSURGERY, 2025, 71 : 44 - 44
  • [24] ITS SEPTEMBER, TIME TO GET BACK TO WORK, TOGETHER
    TAYLOR, R
    CANADIAN VETERINARY JOURNAL-REVUE VETERINAIRE CANADIENNE, 1990, 31 (09): : 609 - 610
  • [25] It is time for some deep learning: a statistical commentary on machine learning for clinical prediction models using imbalanced datasets
    Stonko, David
    Jarman, Molly P.
    Byrne, James P.
    TRAUMA SURGERY & ACUTE CARE OPEN, 2024, 9 (01)
  • [26] Predicting the Outcome of Soccer Matches using Machine Learning and Statistical Analysis
    Elmiligi, Haytham
    Saad, Sherif
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 50 - 57
  • [27] Predicting osteoarthritis in adults using statistical data mining and machine learning
    Bertoncelli, Carlo M.
    Altamura, Paola
    Bagui, Sikha
    Bagui, Subhash
    Vieira, Edgar Ramos
    Costantini, Stefania
    Monticone, Marco
    Solla, Federico
    Bertoncelli, Domenico
    THERAPEUTIC ADVANCES IN MUSCULOSKELETAL DISEASE, 2022, 14
  • [28] Designing bioinspired green nanosilicas using statistical and machine learning approaches
    Dewulf, Luc
    Chiacchia, Mauro
    Yeardley, Aaron S.
    Milton, Robert A.
    Brown, Solomon F.
    Patwardhan, Siddharth, V
    MOLECULAR SYSTEMS DESIGN & ENGINEERING, 2021, 6 (04) : 293 - 307
  • [29] Using statistical and machine learning approaches to describe estuarine tidal dynamics
    Lauer, Franziska
    Koesters, Frank
    JOURNAL OF HYDROINFORMATICS, 2024, 26 (04) : 853 - 868
  • [30] Predicting the Occurrence of Metabolic Syndrome Using Machine Learning Models
    Trigka, Maria
    Dritsas, Elias
    Lahoz-Beltra, Rafael
    Zhang, Yudong
    COMPUTATION, 2023, 11 (09)