Computing highly accurate or exact P-values using importance sampling

被引：6

作者：

Lloyd, Chris J. ^{[1
]}

机构：

[1] Univ Melbourne, Carlton, Vic 3053, Australia

来源：

COMPUTATIONAL STATISTICS & DATA ANALYSIS | 2012年 / 56卷 / 06期

关键词：

Bootstrap; Exact tests; Logistic regression; MODELS;

D O I：

10.1016/j.csda.2011.11.003

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Especially for discrete data, standard first order P-values can suffer from poor accuracy, even for quite large sample sizes. Moreover, different test statistics can give practically different results. There are several approaches to computing P-values which do not suffer these defects, such as parametric bootstrap P-values or the partially maximised P-values of Berger and Boos (1994). Both these methods require computing the exact tail probability of the approximate P-value as a function of the nuisance parameter/s, known as the significance profile. For most practical problems, this is not computationally feasible. I develop an importance sampling approach to this problem. A major advantage is that significance can be simultaneously estimated at a grid of nuisance parameter values, without the need for smoothing away the simulation noise. The theory is fully developed for generalised linear models. The importance distribution is selected from the same generalised linear model family but with parameters biased towards an optimal point on the boundary of the tail-set. For logistic regression at least, standard guidelines for selecting the importance distribution can fail quite badly and a conceptually simple alternative algorithm for selecting these parameters is developed. This may have application to importance sampling more generally. (c) 2011 Elsevier B.V. All rights reserved.

引用

页码：1784 / 1794

页数：11

共 50 条

[41] Test the Overall Significance of p-values by Using Joint Tail Probability of Ordered p-values as Test Statistic
Fang, Yongxiang
Wit, Ernst
ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 435 - 443
[42] P-values and decision-making: discussion of 'Limitations of empirical calibration of p-values using observational data'
Franklin, Jessica M.
STATISTICS IN MEDICINE, 2016, 35 (22) : 3889 - 3891
[43] Nested Sampling for Frequentist Computation: Fast Estimation of Small p-Values
Fowlie, Andrew
Hoof, Sebastian
Handley, Will
PHYSICAL REVIEW LETTERS, 2022, 128 (02)
[44] Improved Confidence Intervals and P-Values by Sampling from the Normalized Likelihood
Ueckert, Sebastian
Riviere, Marie-Karelle
Mentre, France
JOURNAL OF PHARMACOKINETICS AND PHARMACODYNAMICS, 2015, 42 : S56 - S57
[45] Computing P-values for a class of permutation tests of equal survival functions
Dallas, MJ
Rao, PV
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2003, 71 (02) : 149 - 153
[46] Adjuvant Capecitabine for Biliary Cancer and the Importance of Looking Beyond P-Values
Fojo, Antonio Tito
Neugut, Alfred, I
ONCOLOGIST, 2024, 29 (02): : 102 - 105
[47] The undue influence of significant p-values on the perceived importance of study results
Bhandari, M
Montori, VM
Schemitsch, EH
ACTA ORTHOPAEDICA, 2005, 76 (03) : 291 - 295
[48] Upper bounds and importance sampling of p-values for DNA and protein sequence alignments (vol 9, pg 183, 2003)
Chan, HP
BERNOULLI, 2004, 10 (04) : 753 - 753
[49] Accurate Monte Carlo estimation of very small p-values in Markov chains
Wilbur, WJ
COMPUTATIONAL STATISTICS, 1998, 13 (02) : 153 - 168
[50] Using p-values for the comparison of classifiers: pitfalls and alternatives
Berrar, Daniel
DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (03) : 1102 - 1139

← 1 2 3 4 5 →