Testing non-inferiority of a new treatment in three-arm clinical trials with binary endpoints

被引:11
|
作者
Tang, Nian-Sheng [1 ]
Yu, Bin [1 ]
Tang, Man-Lai [2 ]
机构
[1] Yunnan Univ, Dept Stat, Kunming 650091, Peoples R China
[2] Hang Seng Management Coll, Dept Math & Stat, Shatin Nt, Hong Kong, Peoples R China
基金
美国国家科学基金会; 高等学校博士学科点专项科研基金;
关键词
Approximate unconditional test; Bootstrap-resampling test; Non-inferiority trial; Rate difference; Saddlepoint approximation; Three-arm design; ASSESSING NONINFERIORITY; NULL HYPOTHESIS; EQUIVALENCE; DIFFERENCE; RATIO; RISK; PROPORTIONS; INFERENCE;
D O I
10.1186/1471-2288-14-134
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: A two-arm non-inferiority trial without a placebo is usually adopted to demonstrate that an experimental treatment is not worse than a reference treatment by a small pre-specified non-inferioritymargin due to ethical concerns. Selection of the non-inferioritymargin and establishment of assay sensitivity are two major issues in the design, analysis and interpretation for two-arm non-inferiority trials. Alternatively, a three-arm non-inferiority clinical trial including a placebo is usually conducted to assess the assay sensitivity and internal validity of a trial. Recently, some large-sample approaches have been developed to assess the non-inferiority of a new treatment based on the three-arm trial design. However, these methods behave badly with small sample sizes in the three arms. This manuscript aims to develop some reliable small-sample methods to test three-arm non-inferiority. Methods: Saddlepoint approximation, exact and approximate unconditional, and bootstrap-resampling methods are developed to calculate p-values of the Wald-type, score and likelihood ratio tests. Simulation studies are conducted to evaluate their performance in terms of type I error rate and power. Results: Our empirical results show that the saddlepoint approximation method generally behaves better than the asymptotic method based on the Wald-type test statistic. For small sample sizes, approximate unconditional and bootstrap-resampling methods based on the score test statistic perform better in the sense that their corresponding type I error rates are generally closer to the prespecified nominal level than those of other test procedures. Conclusions: Both approximate unconditional and bootstrap-resampling test procedures based on the score test statistic are generally recommended for three-arm non-inferiority trials with binary outcomes.
引用
收藏
页数:9
相关论文
共 50 条