Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

被引：1630

作者：

Greenland, Sander ^{[1
,2
]}

Senn, Stephen J. ^{[3
]}

Rothman, Kenneth J. ^{[4
]}

Carlin, John B. ^{[5
]}

Poole, Charles ^{[6
]}

Goodman, Steven N. ^{[7
,8
]}

Altman, Douglas G. ^{[9
]}

机构：

[1] Univ Calif Los Angeles, Dept Epidemiol, Los Angeles, CA USA

[2] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA USA

[3] Luxembourg Inst Hlth, Competence Ctr Methodol & Stat, Strassen, Luxembourg

[4] Res Triangle Inst, RTI Hlth Solut, POB 12194, Res Triangle Pk, NC 27709 USA

[5] Univ Melbourne, Sch Populat Hlth, Murdoch Childrens Res Inst, Clin Epidemiol & Biostat Unit, Melbourne, Vic, Australia

[6] Univ N Carolina, Gillings Sch Global Publ Hlth, Dept Epidemiol, Chapel Hill, NC USA

[7] Stanford Univ, Sch Med, Meta Res Innovat Ctr, Dept Med, Stanford, CA 94305 USA

[8] Stanford Univ, Sch Med, Dept Hlth Res & Policy, Stanford, CA 94305 USA

[9] Univ Oxford, Ctr Stat Med, Nuffield Dept Orthopaed Rheumatol & Musculoskelet, Oxford, England

来源：

EUROPEAN JOURNAL OF EPIDEMIOLOGY | 2016年 / 31卷 / 04期

关键词：

Confidence intervals; Hypothesis testing; Null testing; P value; Power; Significance tests; Statistical testing; NULL-HYPOTHESIS; CLINICAL-TRIALS; SAMPLE-SIZE; INFERENCE; SCIENCE; EPIDEMIOLOGY; REPLICATION; DISCLOSURE; KNOWLEDGE; CRITERIA;

D O I：

10.1007/s10654-016-0149-3

中图分类号：

R1 [预防医学、卫生学];

学科分类号：

1004 ; 120402 ;

摘要：

Misinterpretation and abuse of statistical tests, confidence intervals, and statistical power have been decried for decades, yet remain rampant. A key problem is that there are no interpretations of these concepts that are at once simple, intuitive, correct, and foolproof. Instead, correct use and interpretation of these statistics requires an attention to detail which seems to tax the patience of working scientists. This high cognitive demand has led to an epidemic of shortcut definitions and interpretations that are simply wrong, sometimes disastrously so-and yet these misinterpretations dominate much of the scientific literature. In light of this problem, we provide definitions and a discussion of basic statistics that are more general and critical than typically found in traditional introductory expositions. Our goal is to provide a resource for instructors, researchers, and consumers of statistics whose knowledge of statistical theory and technique may be limited but who wish to avoid and spot misinterpretations. We emphasize how violation of often unstated analysis protocols (such as selecting analyses for presentation based on the P values they produce) can lead to small P values even if the declared test hypothesis is correct, and can lead to large P values even if that hypothesis is incorrect. We then provide an explanatory list of 25 misinterpretations of P values, confidence intervals, and power. We conclude with guidelines for improving statistical interpretation and reporting.

引用

下载

页码：337 / 350

页数：14

共 50 条

[31] ON P-VALUES AND CONFIDENCE-INTERVALS (WHY CANT WE P WITH MORE CONFIDENCE)
HARRIS, EK
CLINICAL CHEMISTRY, 1993, 39 (06) : 927 - 928
[32] Recurring controversies about P values and confidence intervals revisited
Spanos, Aris
ECOLOGY, 2014, 95 (03) : 645 - 651
[33] The importance of estimates and confidence intervals rather than P values
Ely, M
SOCIOLOGY-THE JOURNAL OF THE BRITISH SOCIOLOGICAL ASSOCIATION, 1999, 33 (01): : 185 - 190
[34] Confidence intervals and p-values in clinical decision making
Akobeng, Anthony K.
ACTA PAEDIATRICA, 2008, 97 (08) : 1004 - 1007
[35] More Confidence Intervals and Fewer p Values A Positive Trend?
Tijssen, Jan G. P.
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2021, 77 (12) : 1562 - 1563
[36] Confidence intervals and p-values for delivery to the end user
Newson, Roger
STATA JOURNAL, 2003, 3 (03): : 245 - 269
[37] Reporting Results of Orthopaedic Research: Confidence Intervals and p Values
Porcher, Raphael
CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2009, 467 (10) : 2736 - 2737
[38] Replacing P values with confidence intervals may not achieve anything
Park, Seo Young
JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2021, 161 (04): : 1379 - 1380
[39] Effect size, confidence intervals and statistical power in psychological research
Tellez, Arnoldo
Garcia, Cirilo H.
Corral-Verdugo, Victor
PSYCHOLOGY IN RUSSIA-STATE OF THE ART, 2015, 8 (03): : 27 - 46
[40] Confidence intervals and statistical significance
Sedgwick, Philip
BRITISH MEDICAL JOURNAL, 2012, 344

← 1 2 3 4 5 →