Effect sizes can be misleading: is it time to change the way we measure change?

被引:41
|
作者
Hobart, Jeremy C. [1 ,2 ]
Cano, Stefan J. [2 ]
Thompson, Alan J. [2 ]
机构
[1] Peninsula Coll Med & Dent, Dept Clin Neurosci, Neurol Outcome Measures Unit, Plymouth PL6 8BX, Devon, England
[2] UCL Inst Neurol, London, England
来源
JOURNAL OF NEUROLOGY NEUROSURGERY AND PSYCHIATRY | 2010年 / 81卷 / 09期
基金
英国医学研究理事会;
关键词
FUNCTIONAL INDEPENDENCE MEASURE; RATING-SCALE RESPONSIVENESS; HEALTH-STATUS INSTRUMENTS; BARTHEL INDEX; INPATIENT REHABILITATION; MULTIPLE-SCLEROSIS; DISABILITY; IMPACT; INJURY; STROKE;
D O I
10.1136/jnnp.2009.201392
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Objectives Previous comparisons of the ability to detect change in the Barthel Index (BI) and Functional Independence Measure motor scale (FIMm) have implied these two scales are equally responsive when examined using traditional effect size statistics. Clinically, this is counterintuitive as the FIMm has greater potential to detect change than the BI and raises concerns about the validity of effect size statistics as indicators of rating scale responsiveness. To examine these concerns, in this study a sophisticated psychometric analysis was applied, Rasch measurement to BI and FIMm data. Methods BI and FIMm data were examined from 976 people at a single neurorehabilitation unit. Rasch analysis was used to compare the responsiveness of the BI and FIMm at the group comparison level ( effect sizes, relative efficiency, relative precision) and for each individual person in the sample by computing the significance of their change. Results Group level analyses from both interval measurements and ordinal scores implied the BI and FIMm had equivalent responsiveness ( BI and FIMm effect size ranges -0.82 to -1.12 and -0.77 to -1.05, respectively). However, individual person level analyses indicated that the FIMm detected significant improvement in almost twice as many people as the BI (50%, n=496 vs 31%, n=298), and recorded less people as unchanged on discharge ( FIMm 4%, n=38; BI 12%, n=115). This difference was found to be statistically significant (chi(2)=273.81; p<0.000). Conclusions These findings demonstrate that effect size calculations are limited and potentially misleading indicators of rating scale responsiveness at the group comparison level. Rasch analysis at the individual person level showed the superior responsiveness of the FIMm, supporting clinical expectation, and its added value as a method for examining and comparing rating scale responsiveness.
引用
收藏
页码:1044 / 1048
页数:5
相关论文
共 50 条
  • [31] CAN WE CHANGE TO IMPROVE ?
    Molina Jaen, Ma Dolores
    AULA DE ENCUENTRO, 2013, 15 : 9 - 11
  • [32] A Change We Can Believe In?
    Calleo, David
    SURVIVAL, 2009, 51 (04) : 193 - 199
  • [33] Change? Yes, we can
    Abbasi, Kamran
    JOURNAL OF THE ROYAL SOCIETY OF MEDICINE, 2008, 101 (12) : 611 - 613
  • [34] A Change We Can Believe In
    Walz, Martha
    R&D MAGAZINE, 2008, 50 (07): : 7 - 7
  • [35] Should we measure ambivalence to change?
    Daeppen, Jean-Bernard
    ADDICTION, 2016, 111 (11) : 1908 - 1909
  • [36] A challenge to change the way we help
    Laquatra, I
    Danish, SJ
    JOURNAL OF THE AMERICAN DIETETIC ASSOCIATION, 2001, 101 (11) : 1318 - 1318
  • [37] Time to change the way we collect and analyze data for aquifer characterization
    Yeh, Tian-Chyi Jim
    Lee, Cheng-Haw
    GROUND WATER, 2007, 45 (02) : 116 - 118
  • [38] Will the Internet change the way we vote?
    Stolper, A
    NATION, 1998, 267 (13) : 17 - 17
  • [39] Dynamic Assessment of Executive Functioning. (How) Can We Measure Change?
    Weingartz, Sebastian
    Wiedl, Karl H.
    Watzke, Stefan
    JOURNAL OF COGNITIVE EDUCATION AND PSYCHOLOGY, 2008, 7 (03): : 368 - 387
  • [40] HOW A CELL PHONE CAN CHANGE DRAMATICALLY THE WAY WE WATCH TV
    Foina, Aislan G.
    Ramirez-Fernandez, Javier
    EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS, 2009, : 1265 - 1271