False Discovery in A/B Testing

被引:8
|
作者
Berman, Ron [1 ]
Van den Bulte, Christophe [1 ]
机构
[1] Univ Penn, Wharton Sch, Marketing, Philadelphia, PA 19104 USA
关键词
statistics; design of experiments; decision analysis; inference; A/B testing; false discovery rate; STATISTICAL SIGNIFICANCE; POWER CALCULATIONS; DESIGN;
D O I
10.1287/mnsc.2021.4207
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We investigate what fraction of all significant results in website A/B testing is actually null effects (i.e., the false discovery rate (FDR)). Our data consist of 4,964 effects from 2,766 experiments conducted on a commercial A/B testing platform. Using three different methods, we find that the FDR ranges between 28% and 37% for tests conducted at 10% significance and between 18% and 25% for tests at 5% significance (two sided). These high FDRs stem mostly from the high fraction of true null effects, about 70%, rather than from low power. Using our estimates, we also assess the potential of various A/B test designs to reduce the FDR. The twomain implications are that decisionmakers should expect one in five interventions achieving significance at 5% confidence to be ineffective when deployed in the field and that analysts should consider using two-stage designs with multiple variations rather than basic A/B tests.
引用
收藏
页码:6762 / 6782
页数:21
相关论文
共 50 条
  • [41] False discovery rate control for multiple testing based on discrete p-values
    Chen, Xiongzhi
    BIOMETRICAL JOURNAL, 2020, 62 (04) : 1060 - 1079
  • [42] Hypothesis testing for high-dimensional multivariate regression with false discovery rate control
    Zhu, Yunlong
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (21) : 7476 - 7495
  • [43] Bon-EV: an improved multiple testing procedure for controlling false discovery rates
    Dongmei Li
    Zidian Xie
    Martin Zand
    Thomas Fogg
    Timothy Dye
    BMC Bioinformatics, 18
  • [44] Statistical genetics - False discovery or missed discovery?
    Devlin, B
    Roeder, K
    Wasserman, L
    HEREDITY, 2003, 91 (06) : 537 - 538
  • [45] Statistical Genetics: False discovery or missed discovery?
    B Devlin
    K Roeder
    L Wasserman
    Heredity, 2003, 91 : 537 - 538
  • [46] False discovery rate envelopes
    Mrkvicka, Tomas
    Myllymaki, Mari
    STATISTICS AND COMPUTING, 2023, 33 (05)
  • [47] Data dredging and false discovery
    Elston, Dirk M.
    JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2020, 82 (06) : 1301 - 1302
  • [48] Monotone false discovery rate
    Won, Joong-Ho
    Lim, Johan
    Yu, Donghyeon
    Kim, Byung Soo
    Kim, Kyunga
    STATISTICS & PROBABILITY LETTERS, 2014, 87 : 86 - 93
  • [49] ON A GENERALIZED FALSE DISCOVERY RATE
    Sarkar, Sanat K.
    Guo, Wenge
    ANNALS OF STATISTICS, 2009, 37 (03): : 1545 - 1565
  • [50] The Positive False Discovery Quantile
    De La Horra, Julian
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2011, 40 (15) : 2629 - 2638