An observational analysis of the trope "A p-value of <0.05 was considered statistically significant" and other cut-and-paste statistical methods

被引:10
|
作者
White, Nicole M. [1 ]
Balasubramaniam, Thirunavukarasu [2 ]
Nayak, Richi [2 ]
Barnett, Adrian G. [1 ]
机构
[1] Queensland Univ Technol, Australian Ctr Hlth Serv Innovat & Ctr Healthcare, Sch Publ Hlth & Social Work, Brisbane, Qld, Australia
[2] Queensland Univ Technol, Ctr Data Sci, Sch Comp Sci, Brisbane, Qld, Australia
来源
PLOS ONE | 2022年 / 17卷 / 03期
基金
英国医学研究理事会;
关键词
MEDICAL-RESEARCH;
D O I
10.1371/journal.pone.0264360
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Appropriate descriptions of statistical methods are essential for evaluating research quality and reproducibility. Despite continued efforts to improve reporting in publications, inadequate descriptions of statistical methods persist. At times, reading statistical methods sections can conjure feelings of deja vu, with content resembling cut-and-pasted or "boilerplate text" from already published work. Instances of boilerplate text suggest a mechanistic approach to statistical analysis, where the same default methods are being used and described using standardized text. To investigate the extent of this practice, we analyzed text extracted from published statistical methods sections from PLOS ONE and the Australian and New Zealand Clinical Trials Registry (ANZCTR). Topic modeling was applied to analyze data from 111,731 papers published in PLOS ONE and 9,523 studies registered with the ANZCTR. PLOS ONE topics emphasized definitions of statistical significance, software and descriptive statistics. One in three PLOS ONE papers contained at least 1 sentence that was a direct copy from another paper. 12,675 papers (11%) closely matched to the sentence "a p-value < 0.05 was considered statistically significant". Common topics across ANZCTR studies differentiated between study designs and analysis methods, with matching text found in approximately 3% of sections. Our findings quantify a serious problem affecting the reporting of statistical methods and shed light on perceptions about the communication of statistics as part of the scientific process. Results further emphasize the importance of rigorous statistical review to ensure that adequate descriptions of methods are prioritized over relatively minor details such as p-values and software when reporting research outcomes.
引用
收藏
页数:15
相关论文
共 2 条
  • [1] Is p-value &lt; 0.05 enough? A study on the statistical evaluation of classifiers
    Neumann, Nadine M.
    Plastino, Alexandre
    Pinto Junior, Jony A.
    Freitas, Alex A.
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2020, 36 (36):
  • [2] Changing the traditional p-value significance threshold to .005 does not decrease the number of statistically significant p-values in observational studies more than in randomised controlled trials
    Oztas, M.
    Dincer, Z. T.
    Sut, N.
    Yazici, H.
    [J]. CLINICAL AND EXPERIMENTAL RHEUMATOLOGY, 2021, 39 (03) : 696 - 696