Computing highly accurate or exact P-values using importance sampling

被引:6
|
作者
Lloyd, Chris J. [1 ]
机构
[1] Univ Melbourne, Carlton, Vic 3053, Australia
关键词
Bootstrap; Exact tests; Logistic regression; MODELS;
D O I
10.1016/j.csda.2011.11.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Especially for discrete data, standard first order P-values can suffer from poor accuracy, even for quite large sample sizes. Moreover, different test statistics can give practically different results. There are several approaches to computing P-values which do not suffer these defects, such as parametric bootstrap P-values or the partially maximised P-values of Berger and Boos (1994). Both these methods require computing the exact tail probability of the approximate P-value as a function of the nuisance parameter/s, known as the significance profile. For most practical problems, this is not computationally feasible. I develop an importance sampling approach to this problem. A major advantage is that significance can be simultaneously estimated at a grid of nuisance parameter values, without the need for smoothing away the simulation noise. The theory is fully developed for generalised linear models. The importance distribution is selected from the same generalised linear model family but with parameters biased towards an optimal point on the boundary of the tail-set. For logistic regression at least, standard guidelines for selecting the importance distribution can fail quite badly and a conceptually simple alternative algorithm for selecting these parameters is developed. This may have application to importance sampling more generally. (c) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1784 / 1794
页数:11
相关论文
共 50 条
  • [2] Computing exact P-values for community detection
    He, Zengyou
    Liang, Hao
    Chen, Zheng
    Zhao, Can
    Liu, Yan
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (03) : 833 - 869
  • [3] Computing exact P-values for DNA motifs
    Zhang, Jing
    Jiang, Bo
    Li, Ming
    Tromp, John
    Zhang, Xuegong
    Zhang, Michael Q.
    BIOINFORMATICS, 2007, 23 (05) : 531 - 537
  • [4] Computing exact P-values for community detection
    Zengyou He
    Hao Liang
    Zheng Chen
    Can Zhao
    Yan Liu
    Data Mining and Knowledge Discovery, 2020, 34 : 833 - 869
  • [5] Computing exact permutation p-values for association rules
    Wu, Jun
    He, Zengyou
    Gu, Feiyang
    Liu, Xiaoqing
    Zhou, Jianyu
    Yang, Can
    INFORMATION SCIENCES, 2016, 346 : 146 - 162
  • [6] P-VALUES EXACT
    JAGDIS, F
    ANNALS OF INTERNAL MEDICINE, 1986, 105 (04) : 641 - 642
  • [7] Importance sampling for spatial scan analysis:: computing scan statistic p-values for marked point processes
    Priebe, CE
    Naiman, DQ
    Cope, LM
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 35 (04) : 475 - 485
  • [8] Exact p-Values for Network Interference
    Athey, Susan
    Eckles, Dean
    Imbens, Guido W.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (521) : 230 - 240
  • [9] Studentization and deriving accurate p-values
    Fraser, D. A. S.
    Rousseau, Judith
    BIOMETRIKA, 2008, 95 (01) : 1 - 16
  • [10] Computing highly accurate confidence limits from discrete data using importance sampling
    Chris J. Lloyd
    Degui Li
    Statistics and Computing, 2014, 24 : 663 - 673