Evaluating the effectiveness of interventions: A comprehensive scoring system versus testing for statistical significance

被引:0
|
作者
Sellahewa, Rav [1 ,4 ]
Webster, Hannah [1 ]
Rolnik, Daniel L. [1 ,2 ]
Mol, Ben W. [1 ,2 ,3 ]
机构
[1] Monash Univ, Sch Clin Sci, 246 Clayton Rd, Clayton, Vic 3168, Australia
[2] Monash Univ, Dept Obstet & Gynaecol, 246 Clayton Rd, Clayton, Vic 3168, Australia
[3] Univ Aberdeen, Aberdeen Ctr Womens Hlth Res, Sch Med, Aberdeen, Scotland
[4] Royal Melbourne Hosp, 300 Grattan St, Parkville, Vic 3050, Australia
关键词
P-value; Statistical significance; Research data; Data interpretation; Obstetrics; Gynaecology; VAGINAL PROGESTERONE; PRETERM BIRTH; METAANALYSIS; OUTCOMES;
D O I
10.1016/j.ejogrb.2023.03.044
中图分类号
R71 [妇产科学];
学科分类号
100211 ;
摘要
Background: Medical practice relies on reliable research observations. Whether such observations are true is traditionally tested by hypotheses and expressed with P-values. A strict P-value driven interpretation could potentially deny benefits of treatment. Objective: A strict P-value driven interpretation was compared to a context driven causality interpretation using the Bradford Hill Criteria to determine the clinical benefit of an intervention. Methods: We searched all randomised controlled trials in Women's Health, published in five leading medical journals since January 2014. These were then scored using the 10 Bradford Hill Criteria for causation. Each component of the Bradford Hill Criteria was given a score from zero to three, resulting in a total score between zero and 30 for each article, converted into a decimal value. These scores were then compared to conclusions based on the p-value and conclusions drawn by the authors. For results discordant between Bradford Hill Criteria and P-values, we compared results with meta-analysis. Results: We found 68 articles for extraction of data. Of these, 49 (72%) showed concordance between Bradford Hill criteria and p-value driven interpretation, 25 (37%) of the articles reporting effectiveness (true positive), and 24 (35%) reporting no effectiveness (true negative). In eight (12%) articles, Bradford Hill criteria scores suggested effetiveness while p-values driven interpretation did not. Seven of those eight articles had p-values between 0.05 and 0.10. Out of these eight articles, six had a subsequent meta-analysis' published on the intervention being studied. All six meta-analysis demonstrated effetiveness of the intervention. Conclusions: In the interpretation of clinical trials, a context driven interpretation of causality may be more clinically informative than a strict P-value driven approach.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 31 条
  • [1] A COMPREHENSIVE SCORING SYSTEM FOR EVALUATING NOONAN SYNDROME
    DUNCAN, WJ
    FOWLER, RS
    FARKAS, LG
    ROSS, RB
    WRIGHT, AW
    BLOOM, KR
    HUOT, DJ
    SONDHEIMER, HM
    ROWE, RD
    [J]. AMERICAN JOURNAL OF MEDICAL GENETICS, 1981, 10 (01): : 37 - 50
  • [2] Statistical Significance Testing and Clinical Effectiveness Studies
    Wise, Edward A.
    [J]. PSYCHOTHERAPY, 2011, 48 (03) : 225 - 228
  • [3] Expert system effectiveness testing and evaluating techniques
    Zhou, T
    Zhou, XF
    Ma, YX
    [J]. NEW TECHNOLOGIES ON COMPUTER SOFTWARE, 1997, : 166 - 170
  • [4] NPS: scoring and evaluating the statistical significance of peptidic natural product-spectrum matches
    Tagirdzhanov, Azat M.
    Shlemov, Alexander
    Gurevich, Alexey
    [J]. BIOINFORMATICS, 2019, 35 (14) : I315 - I323
  • [5] Evaluating the effectiveness of a comprehensive staff training package for behavioral interventions for children with autism
    Weinkauf, Sara M.
    Zeug, Nicole M.
    Anderson, Claire T.
    Ala'i-Rosales, Shahla
    [J]. RESEARCH IN AUTISM SPECTRUM DISORDERS, 2011, 5 (02) : 864 - 871
  • [6] Statistical significance versus biological relevance: A case study in neurobehavioral testing
    Trecki, Jordan
    [J]. NEUROTOXICOLOGY AND TERATOLOGY, 2012, 34 (03) : 380 - 380
  • [7] EVALUATING SPERM MOTILITY - A COMPARISON OF THE ROCHESTER MOTILITY SCORING SYSTEM VERSUS VIDEOMICROGRAPHY
    JENKS, JP
    COSENTINO, MJ
    COCKETT, ATK
    [J]. FERTILITY AND STERILITY, 1982, 38 (06) : 756 - 759
  • [8] Evaluating the Harms of Cancer Testing-A Systematic Review of the Adverse Psychological Correlates of Testing for Cancer and the Effectiveness of Interventions to Mitigate These
    Kwong, Fong Lien
    Davenport, Clare
    Sundar, Sudha
    [J]. CANCERS, 2023, 15 (13)
  • [9] How Significant Is Statistical Significance? Observations on the Independence of P Values and Importance in Effectiveness Studies of Educational Interventions
    Olson, Curtis A.
    Richardson, Conor A.
    [J]. JOURNAL OF CONTINUING EDUCATION IN THE HEALTH PROFESSIONS, 2014, 34 (03) : 151 - 154
  • [10] Analytical basis for evaluating the effect of unplanned interventions on the effectiveness of a human-robot system
    Shah, Julie A.
    Saleh, Joseph H.
    Hoffman, Jeffrey A.
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2008, 93 (08) : 1280 - 1286