Predicting the time to get back to work using statistical models and machine learning approaches

被引:0
|
作者
Bouliotis, George [1 ]
Underwood, M. [1 ]
Froud, R. [2 ]
机构
[1] Univ Warwick, Warwick Clin Trials Unit, Coventry, England
[2] Hoyskolen Kristiania, Oslo, Norway
关键词
Machine Learning; Survival analysis; Statistical methods; Return to work; Socioeconomic deprivation; EXTERNAL VALIDATION; REGULARIZATION; IMPUTATION;
D O I
10.1186/s12874-024-02390-4
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BackgroundWhether machine learning approaches are superior to classical statistical models for survival analyses, especially in the case of lack of proportionality, is unknown.ObjectivesTo compare model performance and predictive accuracy of classic regressions and machine learning approaches using data from the Inspiring Families programme.MethodsThe Inspiring Families programme aims to support members of families with complex issues to return to work. We explored predictors of time to return to work with proportional hazards (Semi-Parametric Cox in Stata) and (Flexible Parametric Parmar-Royston in Stata) against the Survival penalised regression with Elastic Net penalty (scikit-survival), (conditional) Survival Forest algorithm (pySurvival), and (kernel) Survival Support Vector Machine (pySurvival).ResultsAt baseline we obtained data on 61 binary variables from all 3161 participants. No model appeared superior, with a low predictive power (concordance index between 0.51 and 0.61). The median time for finding the first job was about 254 days. The top five contributing variables were 'family issues and additional barriers', 'restriction of hours', 'available CV', 'self-employment considered' and 'education'. The Harrell's Concordance index was range from 0.60 (Cox model) to 0.71 (Random Survival Forest) suggesting a better fit for the machine learning approaches. However, the comparison for predicting median time on a selected scenario based showed only minor differences.ConclusionImplementing a series of survival models with and without proportional hazards background provides a useful insight as well as better interpretation of the coefficients affected by non-linearities. However, that better fit does not translate to substantially higher predictive power and accuracy from using machine learning approaches. Further tuning of the machine learning algorithms may provide improved results.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Time to Get Back to Work
    Loi D.
    Miller M.
    Spence J.
    Sullivan K.
    Interactions (N.Y.), 2024, 31 (01) : 14 - 16
  • [2] Statistical and machine learning models for predicting spalling in CRCP
    Al-Khateeb, Ghazi G.
    Alnaqbi, Ali
    Zeiada, Waleed
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [3] Predicting the Dynamic Behaviour of a Concrete Dam using Statistical and Machine Learning Models
    Pereira, Sérgio
    Mata, Juan
    Magalhães, Filipe
    Gomes, Jorge
    Cunha, Álvaro
    e-Journal of Nondestructive Testing, 2024, 29 (07):
  • [4] PREDICTING RETURN TO WORK AFTER CARDIAC REHABILITATION USING MACHINE LEARNING MODELS
    Yuan, Choo Jia
    Varathan, Kasturi Dewi
    Suhaimi, Anwar
    Ling, Lee Wan
    JOURNAL OF REHABILITATION MEDICINE, 2022, 55
  • [5] Machine learning and statistical models for predicting indoor air quality
    Wei, Wenjuan
    Ramalho, Olivier
    Malingre, Laeticia
    Sivanantham, Sutharsini
    Little, John C.
    Mandin, Corinne
    INDOOR AIR, 2019, 29 (05) : 704 - 726
  • [6] Statistical and machine learning approaches to predicting protein-ligand interactions
    Colwell, Lucy J.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2018, 49 : 123 - 128
  • [7] Systematic approaches to machine learning models for predicting pesticide toxicity
    Anandhi, Ganesan
    Iyapparaja, M.
    HELIYON, 2024, 10 (07)
  • [8] Using Stacking Approaches for Machine Learning Models
    Pavlyshenko, Bohdan
    2018 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2018, : 255 - 258
  • [9] Predicting Time to Dialysis and Unplanned Dialysis Start Using Machine Learning Models
    Shukla, Mahesh
    Garrett, Brendan C.
    Azari, Ali
    Kipping, Emily
    Culleton, Bruce F.
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2021, 32 (10): : 284 - 284
  • [10] Predicting Employee Attrition Using Machine Learning Approaches
    Raza, Ali
    Munir, Kashif
    Almutairi, Mubarak
    Younas, Faizan
    Fareed, Mian Muhammad Sadiq
    APPLIED SCIENCES-BASEL, 2022, 12 (13):