False Discovery in A/B Testing

被引:8
|
作者
Berman, Ron [1 ]
Van den Bulte, Christophe [1 ]
机构
[1] Univ Penn, Wharton Sch, Marketing, Philadelphia, PA 19104 USA
关键词
statistics; design of experiments; decision analysis; inference; A/B testing; false discovery rate; STATISTICAL SIGNIFICANCE; POWER CALCULATIONS; DESIGN;
D O I
10.1287/mnsc.2021.4207
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We investigate what fraction of all significant results in website A/B testing is actually null effects (i.e., the false discovery rate (FDR)). Our data consist of 4,964 effects from 2,766 experiments conducted on a commercial A/B testing platform. Using three different methods, we find that the FDR ranges between 28% and 37% for tests conducted at 10% significance and between 18% and 25% for tests at 5% significance (two sided). These high FDRs stem mostly from the high fraction of true null effects, about 70%, rather than from low power. Using our estimates, we also assess the potential of various A/B test designs to reduce the FDR. The twomain implications are that decisionmakers should expect one in five interventions achieving significance at 5% confidence to be ineffective when deployed in the field and that analysts should consider using two-stage designs with multiple variations rather than basic A/B tests.
引用
收藏
页码:6762 / 6782
页数:21
相关论文
共 50 条
  • [31] Optimal Control of Directional False Discovery Rates in Large-Scale Testing
    Tang, Guozhu
    Kang, Yicheng
    Xiang, Dongdong
    STATISTICS IN MEDICINE, 2025, 44 (05)
  • [32] Estimation of false discovery rates in multiple testing: Application to gene microarray data
    Tsai, CA
    Hsueh, HM
    Chen, JJ
    BIOMETRICS, 2003, 59 (04) : 1071 - 1081
  • [33] Weighted False Discovery Rate Control in Large-Scale Multiple Testing
    Basu, Pallavi
    Cai, T. Tony
    Das, Kiranmoy
    Sun, Wenguang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (523) : 1172 - 1183
  • [34] THE PROBLEM OF False Discovery
    Foster, Kenneth R.
    Skufca, Joseph
    IEEE PULSE, 2016, 7 (02) : 37 - 40
  • [35] Exact Integral Formulas for False Discovery Rate and the Variance of False Discovery Proportion
    Sadygov, Rovshan G.
    Zhu, Justin X.
    Deberneh, Henock M.
    JOURNAL OF PROTEOME RESEARCH, 2024, 23 (06) : 2298 - 2305
  • [36] Testing for heterogeneous treatment effects in experimental data: false discovery risks and correction procedures
    Fink, Guenther
    McConnell, Margaret
    Vollmer, Sebastian
    JOURNAL OF DEVELOPMENT EFFECTIVENESS, 2014, 6 (01) : 44 - 57
  • [37] FarmTest: Factor-Adjusted Robust Multiple Testing With Approximate False Discovery Control
    Fan, Jianqing
    Ke, Yuan
    Sun, Qiang
    Zhou, Wen-Xin
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (528) : 1880 - 1893
  • [38] OPTIMAL FALSE DISCOVERY RATE CONTROL FOR LARGE SCALE MULTIPLE TESTING WITH AUXILIARY INFORMATION
    Cao, Hongyuan
    Chen, Jun
    Zhang, Xianyang
    ANNALS OF STATISTICS, 2022, 50 (02): : 807 - 857
  • [39] Bon-EV: an improved multiple testing procedure for controlling false discovery rates
    Li, Dongmei
    Xie, Zidian
    Zand, Martin
    Fogg, Thomas
    Dye, Timothy
    BMC BIOINFORMATICS, 2017, 18
  • [40] Joint testing and false discovery rate control in high-dimensional multivariate regression
    Xia, Yin
    Cai, T. Tony
    Li, Hongzhe
    BIOMETRIKA, 2018, 105 (02) : 249 - 269