A comparison of time to event analysis methods, using weight status and breast cancer as a case study

被引:5
|
作者
Aivaliotis, Georgios [1 ,2 ,3 ]
Palczewski, Jan [1 ,2 ]
Atkinson, Rebecca [2 ]
Cade, Janet E. [4 ]
Morris, Michelle A. [2 ,3 ,5 ]
机构
[1] Univ Leeds, Sch Math, Leeds LS2 9JT, W Yorkshire, England
[2] Univ Leeds, Leeds Inst Data Analyt, Leeds LS2 9JT, W Yorkshire, England
[3] British Lib, Alan Turing Inst, London NW1 2DB, England
[4] Univ Leeds, Sch Food Sci & Nutr, Nutr Epidemiol Grp, Leeds LS2 9JT, W Yorkshire, England
[5] Univ Leeds, Sch Med, Leeds, W Yorkshire, England
基金
英国经济与社会研究理事会; 英国工程与自然科学研究理事会; 英国医学研究理事会;
关键词
PHYSICAL-ACTIVITY; RANDOM FORESTS; RISK; SURVIVAL; CONSUMPTION; COHORT; WOMEN;
D O I
10.1038/s41598-021-92944-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Survival analysis with cohort study data has been traditionally performed using Cox proportional hazards models. Random survival forests (RSFs), a machine learning method, now present an alternative method. Using the UK Women's Cohort Study (n=34,493) we evaluate two methods: a Cox model and an RSF, to investigate the association between Body Mass Index and time to breast cancer incidence. Robustness of the models were assessed by cross validation and bootstraping. Histograms of bootstrap coefficients are reported. C-Indices and Integrated Brier Scores are reported for all models. In post-menopausal women, the Cox model Hazard Ratios (HR) for Overweight (OW) and Obese (O) were 1.25 (1.04, 1.51) and 1.28 (0.98, 1.68) respectively and the RSF Odds Ratios (OR) with partial dependence on menopause for OW and O were 1.34 (1.31, 1.70) and 1.45 (1.42, 1.48). HR are non-significant results. Only the RSF appears confident about the effect of weight status on time to event. Bootstrapping demonstrated Cox model coefficients can vary significantly, weakening interpretation potential. An RSF was used to produce partial dependence plots (PDPs) showing OW and O weight status increase the probability of breast cancer incidence in post-menopausal women. All models have relatively low C-Index and high Integrated Brier Score. The RSF overfits the data. In our study, RSF can identify complex non-proportional hazard type patterns in the data, and allow more complicated relationships to be investigated using PDPs, but it overfits limiting extrapolation of results to new instances. Moreover, it is less easily interpreted than Cox models. The value of survival analysis remains paramount and therefore machine learning techniques like RSF should be considered as another method for analysis.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A comparison of time to event analysis methods, using weight status and breast cancer as a case study
    Georgios Aivaliotis
    Jan Palczewski
    Rebecca Atkinson
    Janet E. Cade
    Michelle A. Morris
    Scientific Reports, 11
  • [2] Empirical comparison of methods for analyzing multiple time-to-event outcomes in a non-inferiority trial: a breast cancer study
    Sameer Parpia
    Lehana Thabane
    Jim A Julian
    Timothy J Whelan
    Mark N Levine
    BMC Medical Research Methodology, 13
  • [3] Empirical comparison of methods for analyzing multiple time-to-event outcomes in a non-inferiority trial: a breast cancer study
    Parpia, Sameer
    Thabane, Lehana
    Julian, Jim A.
    Whelan, Timothy J.
    Levine, Mark N.
    BMC MEDICAL RESEARCH METHODOLOGY, 2013, 13
  • [4] Weight status and breast cancer incidence in the UK Women's Cohort Study: a survival analysis
    Morris, Michelle A.
    Hulme, Claire
    Clarke, Graham P.
    Edwards, Kimberley L.
    Cade, Janet E.
    LANCET, 2014, 384 : 53 - 53
  • [5] Comparison of Dimension Reduction Methods for Time Series Factor Analysis: A Case Study
    Lee, Daesu
    Song, Seongjoo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (04) : 597 - 607
  • [6] HER-2/neu status in breast cancer by real-time quantitative PCR:: A comparison of methods
    Bernard, PS
    Wittwer, CT
    Holden, J
    Layfield, L
    LABORATORY INVESTIGATION, 2001, 81 (01) : 22A - 22A
  • [7] HER-2/neu status in breast cancer by real-time quantitative PCR:: A comparison of methods
    Bernard, PS
    Wittwer, CT
    Holden, J
    Layfield, L
    MODERN PATHOLOGY, 2001, 14 (01) : 22A - 22A
  • [8] Analysis of weight trends over time in female breast cancer survivors
    Hansra, Damien Mikael
    Rollins, Rebecca
    Rados, Karen
    Johnson, Anita
    Ramey, Johnathan
    Panell, Rachel
    Daniels, Cynthia
    Hogeland, Cynthia
    Ninan, Mary
    McKnight, John Edward
    Hauser, Robert Sean
    Alvarez, Ricardo H.
    JOURNAL OF CLINICAL ONCOLOGY, 2018, 36 (15)
  • [9] A comparison of dynamic event tree methods - Case study on a chemical batch reactor
    Karanki, D. R.
    Dang, V. N.
    MacMillan, M. T.
    Podofillini, L.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2018, 169 : 542 - 553
  • [10] Aggregation methods and comparative study in time-to-event analysis models
    Fernandez, Camila
    Chen, Chung Shue
    Gaillard, Pierre
    Silva, Alonso
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,