Effect sizes can be misleading: is it time to change the way we measure change?

被引：41

作者：

Hobart, Jeremy C. ^{[1
,2
]}

Cano, Stefan J. ^{[2
]}

Thompson, Alan J. ^{[2
]}

机构：

[1] Peninsula Coll Med & Dent, Dept Clin Neurosci, Neurol Outcome Measures Unit, Plymouth PL6 8BX, Devon, England

[2] UCL Inst Neurol, London, England

来源：

JOURNAL OF NEUROLOGY NEUROSURGERY AND PSYCHIATRY | 2010年 / 81卷 / 09期

基金：

英国医学研究理事会;

关键词：

FUNCTIONAL INDEPENDENCE MEASURE; RATING-SCALE RESPONSIVENESS; HEALTH-STATUS INSTRUMENTS; BARTHEL INDEX; INPATIENT REHABILITATION; MULTIPLE-SCLEROSIS; DISABILITY; IMPACT; INJURY; STROKE;

D O I：

10.1136/jnnp.2009.201392

中图分类号：

R74 [神经病学与精神病学];

学科分类号：

摘要：

Objectives Previous comparisons of the ability to detect change in the Barthel Index (BI) and Functional Independence Measure motor scale (FIMm) have implied these two scales are equally responsive when examined using traditional effect size statistics. Clinically, this is counterintuitive as the FIMm has greater potential to detect change than the BI and raises concerns about the validity of effect size statistics as indicators of rating scale responsiveness. To examine these concerns, in this study a sophisticated psychometric analysis was applied, Rasch measurement to BI and FIMm data. Methods BI and FIMm data were examined from 976 people at a single neurorehabilitation unit. Rasch analysis was used to compare the responsiveness of the BI and FIMm at the group comparison level ( effect sizes, relative efficiency, relative precision) and for each individual person in the sample by computing the significance of their change. Results Group level analyses from both interval measurements and ordinal scores implied the BI and FIMm had equivalent responsiveness ( BI and FIMm effect size ranges -0.82 to -1.12 and -0.77 to -1.05, respectively). However, individual person level analyses indicated that the FIMm detected significant improvement in almost twice as many people as the BI (50%, n=496 vs 31%, n=298), and recorded less people as unchanged on discharge ( FIMm 4%, n=38; BI 12%, n=115). This difference was found to be statistically significant (chi(2)=273.81; p<0.000). Conclusions These findings demonstrate that effect size calculations are limited and potentially misleading indicators of rating scale responsiveness at the group comparison level. Rasch analysis at the individual person level showed the superior responsiveness of the FIMm, supporting clinical expectation, and its added value as a method for examining and comparing rating scale responsiveness.

引用

页码：1044 / 1048

页数：5

共 50 条

[31] CAN WE CHANGE TO IMPROVE ?
Molina Jaen, Ma Dolores
AULA DE ENCUENTRO, 2013, 15 : 9 - 11
[32] A Change We Can Believe In?
Calleo, David
SURVIVAL, 2009, 51 (04) : 193 - 199
[33] Change? Yes, we can
Abbasi, Kamran
JOURNAL OF THE ROYAL SOCIETY OF MEDICINE, 2008, 101 (12) : 611 - 613
[34] A Change We Can Believe In
Walz, Martha
R&D MAGAZINE, 2008, 50 (07): : 7 - 7
[35] Should we measure ambivalence to change?
Daeppen, Jean-Bernard
ADDICTION, 2016, 111 (11) : 1908 - 1909
[36] A challenge to change the way we help
Laquatra, I
Danish, SJ
JOURNAL OF THE AMERICAN DIETETIC ASSOCIATION, 2001, 101 (11) : 1318 - 1318
[37] Time to change the way we collect and analyze data for aquifer characterization
Yeh, Tian-Chyi Jim
Lee, Cheng-Haw
GROUND WATER, 2007, 45 (02) : 116 - 118
[38] Will the Internet change the way we vote?
Stolper, A
NATION, 1998, 267 (13) : 17 - 17
[39] Dynamic Assessment of Executive Functioning. (How) Can We Measure Change?
Weingartz, Sebastian
Wiedl, Karl H.
Watzke, Stefan
JOURNAL OF COGNITIVE EDUCATION AND PSYCHOLOGY, 2008, 7 (03): : 368 - 387
[40] HOW A CELL PHONE CAN CHANGE DRAMATICALLY THE WAY WE WATCH TV
Foina, Aislan G.
Ramirez-Fernandez, Javier
EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS, 2009, : 1265 - 1271

← 1 2 3 4 5 →