Examining publication bias-a simulation-based evaluation of statistical tests on publication bias

被引:15
|
作者
Schneck, Andreas [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Sociol, Munich, Germany
来源
PEERJ | 2017年 / 5卷
关键词
Statistics; Publication bias; Test for excess significance; Caliper test; Monte carlo simulation; p-uniform; Egger's test; FAT; CONFIRMATIONAL RESPONSE BIAS; RISK-FACTORS; P-CURVE; METAANALYSIS; PREVALENCE; POWER; REPLICATION; PSYCHOLOGY; EXCESS;
D O I
10.7717/peerj.4115
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background. Publication bias is a form of scientific misconduct. It threatens the validity of research results and the credibility of science. Although several tests on publication bias exist, no in-depth evaluations are available that examine which test performs best for different research settings. Methods. Four tests on publication bias, Egger's test (FAT), p-uniform, the test of excess significance (TES), as well as the caliper test, were evaluated in a Monte Carlo simulation. Two different types of publication bias and its degree (0%, 50%, 100%) were simulated. The type of publication bias was defined either as file-drawer, meaning the repeated analysis of new datasets, or p-hacking, meaning the inclusion of covariates in order to obtain a significant result. In addition, the underlying effect (beta = 0, 0.5, 1, 1.5), effect heterogeneity, the number of observations in the simulated primary studies (N = 100, 500), and the number of observations for the publication bias tests (K = 100, 1,000) were varied. Results. All tests evaluated were able to identify publication bias both in the filed raw er and p-hacking condition. The false positive rates were, with the exception of the 15%- and 20%-caliper test, unbiased. The FAT had the largest statistical power in the file-drawer conditions, whereas under p-hacking the TES was, except under effect heterogeneity, slightly better. The CTs were, however, inferior to the other tests under effect homogeneity and had a decent statistical power only in conditions with 1,000 primary studies. Discussion. The FAT is recommended as a test for publication bias in standard meta-analyses with no or only small effect heterogeneity. If two-sided publication bias is suspected as well as under p-hacking the TES is the first alternative to the FAT. The 5%-caliper test is recommended under conditions of effect heterogeneity and a large number of primary studies, which may be found if publication bias is examined in a discipline-wide setting when primary studies cover different research problems.
引用
收藏
页数:21
相关论文
共 50 条