Application of the Reverse Fragility Index to Statistically Nonsignificant Randomized Clinical Trial Results

被引：64

作者：

Khan, Muhammad Shahzeb ^{[2
]}

Fonarow, Gregg C. ^{[3
]}

Friede, Tim ^{[4
]}

Lateef, Noman ^{[5
]}

Khan, Safi U. ^{[6
]}

Anker, Stefan D. ^{[7
,8
,9
]}

Harrell, Frank E., Jr. ^{[10
]}

Butler, Javed ^{[1
]}

机构：

[1] Univ Mississippi, Dept Med, Med Ctr, 2500 N State St, Jackson, MS 39216 USA

[2] Cook Cty Hlth Sci, Dept Med, Chicago, IL USA

[3] Ronald Reagan Univ Calif Los Angeles, Med Ctr, Div Cardiol, Los Angeles, CA USA

[4] Univ Med Ctr Goettingen, Dept Med Stat, Gottingen, Germany

[5] Creighton Univ, Dept Med, Omaha, NE USA

[6] West Virginia Univ, Dept Med, Morgantown, WV 26506 USA

[7] German Ctr Cardiovasc Res, Partner Site Berlin, Dept Cardiol, Ctr Regenerat Therapies, Berlin, Germany

[8] German Ctr Cardiovasc Res, Partner Site Berlin, Berlin Inst Hlth, Ctr Regenerat Therapies, Berlin, Germany

[9] Charite Univ Med Berlin, Berlin, Germany

[10] Vanderbilt Univ, Dept Biostat, Sch Med, Nashville, TN USA

来源：

JAMA NETWORK OPEN | 2020年 / 3卷 / 08期

关键词：

ARTERY-BYPASS SURGERY; P-VALUE; SAMPLE-SIZE; PART;

D O I：

10.1001/jamanetworkopen.2020.12469

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Question In clinical trials with statistically nonsignificant primary end point results, what is the minimum number of events that must be changed to move the result from nonsignificant to statistically significant (ie, the reverse fragility index)? Findings In this cross-sectional study of 167 randomized clinical trials with statistically nonsignificant results, the median reverse fragility index at a threshold of P = .05 was 8. A median of 8 events were required to change to enable a nonsignificant primary end point to become statistically significant. Meaning Results of this cross-sectional study suggest that the reverse fragility index, along with effect sizes and associated 95% CIs, may provide a useful context for interpreting null clinical trial results. Importance Interpreting randomized clinical trials (RCTs) and their clinical relevance is challenging when P values are either marginally above or below the P = .05 threshold. Objective To use the concept of reverse fragility index (RFI) to provide a measure of confidence in the neutrality of RCT results when assessed from the clinical perspective. Design, Setting, and Participants In this cross-sectional study, a MEDLINE search was conducted for RCTs published from January 1, 2013, to December 31, 2018, in JAMA, the New England Journal of Medicine (NEJM), and The Lancet. Eligible studies were phase 3 and 4 trials with 1:1 randomization and statistically nonsignificant binary primary end points. Data analysis was performed from August 1, 2019, to August 31, 2019. Exposures Single vs multicenter enrollment, total number of events, private vs government funding, placebo vs active control, and time to event vs frequency data. Main Outcomes and Measures The primary outcome was the median RFI with interquartile range (IQR) at the P = .05 threshold. Secondary outcomes were the number of RCTs in which the number of participants lost to follow-up was greater than the RFI; the median RFI with IQR at different P value thresholds; the median reverse fragility quotient with IQR; and the correlation between sample sizes, number of events, and P values of the RCT and RFI. Results Of the 167 RCTs included, 76 (46%) were published in the NEJM, 50 (30%) in JAMA, and 41 (24%) in The Lancet. The median (IQR) sample size was 970 (470-3427) participants, and the median (IQR) number of events was 251 (105-570). The median (IQR) RFI at the P = .05 threshold was 8 (5-13). Fifty-seven RCTs (34%) had an RFI of 5 or lower, and in 68 RCTs (41%) the number of participants lost to follow-up was greater than the RFI. Trials with P values ranging from P = .06 to P = .10 had a median (IQR) RFI of 3 (2-4). When compared, median (IQR) RFIs were not statistically significant for single-center vs multicenter enrollment (5 [4-13] vs 8 [5-13]; P = .41), private vs government-funded studies (9 [5-13] vs 8 [5-13]; P = .34), and time-to-event primary end points vs frequency data (9 [5-14] vs 7 [4-13]; P = .43). The median (IQR) RFI at the P = .01 threshold was 12 (7-19) and at the P = .005 threshold was 14 (9-21). Conclusions and Relevance This cross-sectional study found that a relatively small number of events (median of 8) had to change to move the primary end point of an RCT from nonsignificant to statistically significant. These findings emphasize the nuance required when interpreting trial results that did not meet prespecified significance thresholds. This cross-sectional study describes application of the reverse fragility index in establishing the clinical relevance of published clinical trials with null results.

引用

页数：12

共 50 条

[11] The application of the reverse fragility index to randomized controlled trials in hand surgery
Ahdoot, Rodney
Pottepalem, Bhuvan
Benitez, Trista M.
Wang, Chien-Wei
Chung, Kevin C.
JOURNAL OF HAND SURGERY-EUROPEAN VOLUME, 2025,
[12] Reporting and Interpretation of Randomized Controlled Trials With Statistically Nonsignificant Results for Primary Outcomes
Boutron, Isabelle
Dutton, Susan
Ravaud, Philippe
Altman, Douglas G.
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2010, 303 (20): : 2058 - 2064
[13] The fragility of statistically significant results in otolaryngology randomized trials
Skinner, Mason
Tritz, Daniel
Farahani, Clayton
Ross, Andrew
Hamilton, Tom
Vassar, Matt
AMERICAN JOURNAL OF OTOLARYNGOLOGY, 2019, 40 (01) : 61 - 66
[14] Level and Prevalence of Spin in Published Cardiovascular Randomized Clinical Trial Reports With Statistically Nonsignificant Primary Outcomes A Systematic Review
Khan, Muhammad Shahzeb
Lateef, Noman
Siddiqi, Tariq Jamal
Rehman, Karim Abdur
Alnaimat, Saed
Khan, Safi U.
Riaz, Haris
Murad, M. Hassan
Mandrola, John
Doukky, Rami
Krasuski, Richard A.
JAMA NETWORK OPEN, 2019, 2 (05)
[15] THE FRAGILITY OF STATISTICALLY SIGNIFICANT RESULTS FROM RANDOMIZED TRIALS IN UROLOGY
Narayan, Vikram
Gandhi, Shreyas
Chrouser, Kristin
Evaniew, Nathan
Dahm, Philipp
JOURNAL OF UROLOGY, 2017, 197 (04): : E1131 - E1132
[16] Nonsignificant Clinical Trial Results Support Null Hypotheses
不详
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2023, 114 (03) : 497 - 497
[17] Fragility index and fragility quotient in randomized clinical trials
Fernandes Garcia, Marcos Vinicius
Ferreira, Juliana Carvalho
Caruso, Pedro
JORNAL BRASILEIRO DE PNEUMOLOGIA, 2023, 49 (01)
[18] Fragility Index and Fragility Quotient in Statistically Significant Randomized Controlled Trials in Plastic Breast Surgery
Skorochod, Ron
Gronovich, Yoav
PLASTIC AND RECONSTRUCTIVE SURGERY-GLOBAL OPEN, 2024, 12 (06)
[19] How Fragile Are the Results of a Trial? The Fragility Index
Dettori, Joseph R.
Norvell, Daniel C.
GLOBAL SPINE JOURNAL, 2020, 10 (07) : 940 - 942
[20] The statistical significance of randomized controlled trial results is frequently fragile: a case for a Fragility Index
Walsh, Michael
Srinathan, Sadeesh K.
McAuley, Daniel F.
Mrkobrada, Marko
Levine, Oren
Ribic, Christine
Molnar, Amber O.
Dattani, Neil D.
Burke, Andrew
Guyatt, Gordon
Thabane, Lehana
Walter, Stephen D.
Pogue, Janice
Devereaux, P. J.
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2014, 67 (06) : 622 - 628

← 1 2 3 4 5 →