Empirical power comparison of statistical tests in contemporary phase III randomized controlled trials with time-to-event outcomes in oncology

被引:3
|
作者
Horiguchi, Miki [1 ]
Hassett, Michael J. [1 ]
Uno, Hajime [1 ,2 ]
机构
[1] Dana Farber Canc Inst, Dept Med Oncol, 450 Brookline Ave, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Dept Data Sci, Boston, MA 02115 USA
关键词
Hazard ratio; log-rank test; restricted mean survival time; survival data analysis; weighted log-rank test; GOODNESS-OF-FIT; MEAN SURVIVAL-TIME; HAZARD RATIOS; MODEL;
D O I
10.1177/1740774520940256
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background: More than 95% of recent cancer randomized controlled trials used the log-rank test to detect a treatment difference making it the predominant tool for comparing two survival functions. As with other tests, the log-rank test has both advantages and disadvantages. One advantage is that it offers the highest power against proportional hazards differences, which may be a major reason why alternative methods have rarely been employed in practice. The performance of statistical tests has traditionally been investigated both theoretically and numerically for several patterns of difference between two survival functions. However, to the best of our knowledge, there has been no attempt to compare the performance of various statistical tests using empirical data from past oncology randomized controlled trials. So, it is unknown whether the log-rank test offers a meaningful power advantage over alternative testing methods in contemporary cancer randomized controlled trials. Focusing on recently reported phase III cancer randomized controlled trials, we assessed whether the log-rank test gave meaningfully greater power when compared with five alternative testing methods: generalized Wilcoxon, test based on maximum of test statistics from multiple weighted log-rank tests, difference int-year event rate, and difference in restricted mean survival time with fixed and adaptive tau. Methods: Using manuscripts from cancer randomized controlled trials recently published in high-tier clinical journals, we reconstructed patient-level data for overall survival (69 trials) and progression-free survival (54 trials). For each trial endpoint, we estimated the empirical power of each test. Empirical power was measured as the proportion of trials for which a test would have identified a significant result (pvalue < .05). Results: For overall survival,t-year event rate offered the lowest (30.4%) empirical power and restricted mean survival time with fixed tau offered the highest (43.5%). The empirical power of the other types of tests was almost identical (36.2%-37.7%). For progression-free survival, the tests we investigated offered numerically equivalent empirical power (55.6%-61.1%). No single test consistently outperformed any other test. Conclusion: The empirical power assessment with the past cancer randomized controlled trials provided new insights on the performance of statistical tests. Although the log-rank test has been used in almost all trials, our study suggests that the log-rank test is not the only option from an empirical power perspective. Near universal use of the log-rank test is not supported by a meaningful difference in empirical power. Clinical trial investigators could consider alternative methods, beyond the log-rank test, for their primary analysis when designing a cancer randomized controlled trial. Factors other than power (e.g. interpretability of the estimated treatment effect) should garner greater consideration when selecting statistical tests for cancer randomized controlled trials.
引用
收藏
页码:597 / 606
页数:10
相关论文
共 50 条
  • [41] Hybridization of conditional and predictive power for futility assessment in sequential clinical trials with time-to-event outcomes: A resampling approach
    Yi, Jing
    Fang, Liang
    Su, Zheng
    [J]. CONTEMPORARY CLINICAL TRIALS, 2012, 33 (01) : 138 - 142
  • [42] Statistical power of negative randomized controlled trials presented at American Society for Clinical Oncology annual meetings
    Bedard, Philippe L.
    Krzyzanowska, Monika K.
    Pintilie, Melania
    Tannock, Ian F.
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2007, 25 (23) : 3482 - 3487
  • [43] Inadequate statistical power to detect clinically significant differences in adverse event rates in randomized controlled trials
    Tsang, Ruth
    Colley, Lindsey
    Lynd, Larry D.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2009, 62 (06) : 609 - 616
  • [44] A comparison of different population-level summary measures for randomised trials with time-to-event outcomes, with a focus on non-inferiority trials
    Quartagno, Matteo
    Morris, Tim P.
    Gilbert, Duncan C.
    Langley, Ruth E.
    Nankivell, Matthew G.
    Parmar, Mahesh K. B.
    White, Ian R.
    [J]. CLINICAL TRIALS, 2023, 20 (06) : 594 - 602
  • [45] Observer bias in randomized clinical trials with time-to-event outcomes: systematic review of trials with both blinded and non-blinded outcome assessors
    Hrobjartsson, Asbjorn
    Thomsen, Ann Sofia Skou
    Emanuelsson, Frida
    Tendal, Britta
    Rasmussen, Jeppe Vejlgaard
    Hilden, Jorgen
    Boutron, Isabelle
    Ravaud, Philippe
    Brorson, Stig
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2014, 43 (03) : 937 - 948
  • [46] Empirical comparison of methods for analyzing multiple time-to-event outcomes in a non-inferiority trial: a breast cancer study
    Parpia, Sameer
    Thabane, Lehana
    Julian, Jim A.
    Whelan, Timothy J.
    Levine, Mark N.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2013, 13
  • [47] Optimising precision and power by machine learning in randomised trials with ordinal and time-to-event outcomes with an application to COVID-19
    Williams, Nicholas
    Rosenblum, Michael
    Diaz, Ivan
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2022, 185 (04) : 2156 - 2178
  • [48] Empirical comparison of methods for analyzing multiple time-to-event outcomes in a non-inferiority trial: a breast cancer study
    Sameer Parpia
    Lehana Thabane
    Jim A Julian
    Timothy J Whelan
    Mark N Levine
    [J]. BMC Medical Research Methodology, 13
  • [49] Meta-analysis of time-to-event outcomes from randomized trials using restricted mean survival time: application to individual participant data
    Wei, Yinghui
    Royston, Patrick
    Tierney, Jayne F.
    Parmar, Mahesh K. B.
    [J]. STATISTICS IN MEDICINE, 2015, 34 (21) : 2881 - 2898
  • [50] Empirical comparison of four baseline covariate adjustment methods in analysis of continuous outcomes in randomized controlled trials
    Zhang, Shiyuan
    Paul, James
    Nantha-Aree, Manyat
    Buckley, Norman
    Shahzad, Uswa
    Cheng, Ji
    DeBeer, Justin
    Winemaker, Mitchell
    Wismer, David
    Punthakee, Dinshaw
    Avram, Victoria
    Thabane, Lehana
    [J]. CLINICAL EPIDEMIOLOGY, 2014, 6 : 227 - 235