Computing highly accurate or exact P-values using importance sampling

被引:6
|
作者
Lloyd, Chris J. [1 ]
机构
[1] Univ Melbourne, Carlton, Vic 3053, Australia
关键词
Bootstrap; Exact tests; Logistic regression; MODELS;
D O I
10.1016/j.csda.2011.11.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Especially for discrete data, standard first order P-values can suffer from poor accuracy, even for quite large sample sizes. Moreover, different test statistics can give practically different results. There are several approaches to computing P-values which do not suffer these defects, such as parametric bootstrap P-values or the partially maximised P-values of Berger and Boos (1994). Both these methods require computing the exact tail probability of the approximate P-value as a function of the nuisance parameter/s, known as the significance profile. For most practical problems, this is not computationally feasible. I develop an importance sampling approach to this problem. A major advantage is that significance can be simultaneously estimated at a grid of nuisance parameter values, without the need for smoothing away the simulation noise. The theory is fully developed for generalised linear models. The importance distribution is selected from the same generalised linear model family but with parameters biased towards an optimal point on the boundary of the tail-set. For logistic regression at least, standard guidelines for selecting the importance distribution can fail quite badly and a conceptually simple alternative algorithm for selecting these parameters is developed. This may have application to importance sampling more generally. (c) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1784 / 1794
页数:11
相关论文
共 50 条
  • [21] Numerical inversion methods for computing approximate p-Values
    Kawakatsu H.
    Computational Economics, 2005, 26 (3-4) : 103 - 116
  • [22] Numerical Inversion Methods for Computing Approximate p-Values
    Hiroyuki Kawakatsu
    Computational Economics, 2007, 29 (3-4) : 429 - 429
  • [23] Exact algorithms for computing p-values of statistics-linear combination of 3-nomial variables
    Beninela, Farid
    Grelaud, Gerard
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (02) : 737 - 749
  • [24] EXACT P-VALUES FOR DISCRETE MODELS OBTAINED BY ESTIMATION AND MAXIMIZATION
    Lloyd, Chris J.
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2008, 50 (04) : 329 - 345
  • [25] Calculation of exact p-values when SNPs are tested using multiple genetic models
    Talluri, Rajesh
    Wang, Jian
    Shete, Sanjay
    BMC GENETICS, 2014, 15
  • [26] Computing scan statistic p values using importance sampling, with applications to genetics and medical image analysis
    Naiman, DQ
    Priebe, CE
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2001, 10 (02) : 296 - 328
  • [27] Calculation of exact p-values when SNPs are tested using multiple genetic models
    Rajesh Talluri
    Jian Wang
    Sanjay Shete
    BMC Genetics, 15
  • [28] Estimating cross-validatory predictive p-values with integrated importance sampling for disease mapping models
    Li, Longhai
    Feng, Cindy X.
    Qiu, Shi
    STATISTICS IN MEDICINE, 2017, 36 (14) : 2220 - 2236
  • [29] Can p-values be meaningfully interpreted without random sampling?
    Hirschauer, Norbert
    Gruener, Sven
    Musshoff, Oliver
    Becker, Claudia
    Jantsch, Antje
    STATISTICS SURVEYS, 2020, 14 : 71 - 91
  • [30] Computing p-values in conditional independence models for a contingency table
    Masahiro Kuroda
    Hiroki Hashiguchi
    Shigakazu Nakagawa
    Computational Statistics, 2010, 25 : 57 - 70