THE STRENGTH OF STATISTICAL EVIDENCE FOR COMPOSITE HYPOTHESES: INFERENCE TO THE BEST EXPLANATION

被引:28
|
作者
Bickel, David R. [1 ]
机构
[1] Univ Ottawa, Dept Math & Stat, Ottawa Inst Syst Biol, Dept Biochem Microbiol & Immunol, Ottawa, ON K1H 8M5, Canada
关键词
Bayes factor; Bayesian model selection; coherence; direct likelihood; hypothesis testing; evidential support; foundations of statistics; likelihoodism; model selection; strength of statistical evidence; FALSE DISCOVERY RATE; DIFFERENTIAL GENE-EXPRESSION; PROBABILITY-MEASURES; MEMBERSHIP FUNCTIONS; LIKELIHOOD METHODS; MULTIPLE COMPARISONS; FUZZY-SETS; POSTERIOR; CONFIDENCE; MODELS;
D O I
10.5705/ss.2009.125
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A general function to quantify the weight of evidence in a sample of data for one hypothesis over another is derived from the law of likelihood and from a statistical formalization of inference to the best explanation. For a fixed parameter of interest, the resulting weight of evidence that favors one composite hypothesis over another is the likelihood ratio using the parameter value consistent with each hypothesis that maximizes the likelihood function over the parameter of interest. Since the weight of evidence is generally only known up to a nuisance parameter, it is approximated by replacing the likelihood function with a reduced likelihood function on the interest parameter space. The resulting weight of evidence has both the interpretability of the Bayes factor and the objectivity of the p-value. In addition, the weight of evidence is coherent in the sense that it cannot support a hypothesis over any hypothesis that it entails. Further, when comparing the hypothesis that the parameter lies outside a non-trivial interval to the hypothesis that it lies within the interval, the proposed method of weighing evidence almost always asymptotically favors the correct hypothesis under mild regularity conditions. Even at small sample sizes, replacing a simple hypothesis with an interval hypothesis substantially reduces the probability of observing misleading evidence. Sensitivity of the weight of evidence to hypotheses' specification is mitigated by making them imprecise. The methodology is illustrated in the multiple comparisons setting of gene expression microarray data, and issues with simultaneous inference and multiplicity are addressed.
引用
收藏
页码:1147 / 1198
页数:52
相关论文
共 50 条