Power and sample-size analysis for the Royston Parmar combined test in clinical trials with a time-to-event outcome

被引:6
|
作者
Royston, Patrick [1 ]
机构
[1] UCL, MRC Clin Trials Unit, London, England
来源
STATA JOURNAL | 2018年 / 18卷 / 01期
关键词
st0510; power_ct; randomized controlled trial; time-to-event outcome; restricted mean survival time; log-rank test; Cox test; combined test; treatment effect; hypothesis testing; flexible parametric model; MEAN SURVIVAL-TIME; NONPROPORTIONAL HAZARDS; REGRESSION-ANALYSIS; PSEUDO-OBSERVATIONS; MODELS; PACLITAXEL; DIFFERENCE; CANCER; UPDATE;
D O I
10.1177/1536867X1801800102
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Randomized controlled trials with a time-to-event outcome are usually designed and analyzed assuming proportional hazards (PH) of the treatment effect. The sample-size calculation is based on a log-rank test or the nearly identical Cox test, henceforth called the Cox/log-rank test. Nonproportional hazards (non-PH) has become more common in trials and is recognized as a potential threat to interpreting the trial treatment effect and the power of the log-rank test hence to the success of the trial. To address the issue, in 2016, Royston and Parmar (BMC Medical Research Methodology 16: 16) proposed a "combined test" of the global null hypothesis of identical survival curves in each trial arm. The Cox/logrank test is combined with a new test derived from the maximal standardized difference in restricted mean survival time (RMST) between the trial arms. The test statistic is based on evaluations of the between-arm difference in RMST over several preselected time points. The combined test involves the minimum p-value across the Cox/log-rank and RMST-based tests, appropriately standardized to have the correct distribution under the global null hypothesis. In this article, I introduce a new command, power_ct, that uses simulation to implement power and sample-size calculations for the combined test. power_ct supports designs with PH or non-PH of the treatment effect. I provide examples in which the power of the combined test is compared with that of the Cox/log-rank test under PH and non-PH scenarios. I conclude by offering guidance for sample-size calculations in time-to-event trials to allow for possible non-PH.
引用
收藏
页码:3 / 21
页数:19
相关论文
共 50 条