Multiple imputation of multiple multi-item scales when a full imputation model is infeasible Medical Research Methodology

被引:43
|
作者
Plumpton C.O. [1 ]
Morris T. [2 ,3 ]
Hughes D.A. [1 ]
White I.R. [4 ]
机构
[1] Centre for Health Economics and Medicines Evaluation, Bangor University, Ardudwy, Normal Site, Holyhead Road, Bangor, Gwynedd
[2] MRC Clinical Trials Unit, UCL, Institute of Clinical Trials and Methodology, 125 Kingsway, London
[3] London School of Hygiene and Tropical Medicine, Keppel Street, London
[4] MRC Biostatistics Unit, Cambridge Institute of Public Health, Robinson Way, Cambridge
基金
英国医学研究理事会;
关键词
Missing data; Multi-item scale; Multiple imputation; Survey data;
D O I
10.1186/s13104-016-1853-5
中图分类号
学科分类号
摘要
Background: Missing data in a large scale survey presents major challenges. We focus on performing multiple imputation by chained equations when data contain multiple incomplete multi-item scales. Recent authors have proposed imputing such data at the level of the individual item, but this can lead to infeasibly large imputation models. Methods: We use data gathered from a large multinational survey, where analysis uses separate logistic regression models in each of nine country-specific data sets. In these data, applying multiple imputation by chained equations to the individual scale items is computationally infeasible. We propose an adaptation of multiple imputation by chained equations which imputes the individual scale items but reduces the number of variables in the imputation models by replacing most scale items with scale summary scores. We evaluate the feasibility of the proposed approach and compare it with a complete case analysis. We perform a simulation study to compare the proposed method with alternative approaches: we do this in a simplified setting to allow comparison with the full imputation model. Results: For the case study, the proposed approach reduces the size of the prediction models from 134 predictors to a maximum of 72 and makes multiple imputation by chained equations computationally feasible. Distributions of imputed data are seen to be consistent with observed data. Results from the regression analysis with multiple imputation are similar to, but more precise than, results for complete case analysis; for the same regression models a 39 % reduction in the standard error is observed. The simulation shows that our proposed method can perform comparably against the alternatives. Conclusions: By substantially reducing imputation model sizes, our adaptation makes multiple imputation feasible for large scale survey data with multiple multi-item scales. For the data considered, analysis of the multiply imputed data shows greater power and efficiency than complete case analysis. The adaptation of multiple imputation makes better use of available data and can yield substantively different results from simpler techniques. © 2016 Plumpton et al.
引用
收藏
相关论文
共 24 条
  • [1] A comparison of multiple imputation strategies for handling missing data in multi-item scales: Guidance for longitudinal studies
    Mainzer, Rheanna
    Apajee, Jemishabye
    Nguyen, Cattram D.
    Carlin, John B.
    Lee, Katherine J.
    [J]. STATISTICS IN MEDICINE, 2021, 40 (21) : 4660 - 4674
  • [2] Missing data in a multi-item instrument were best handled by multiple imputation at the item score level
    Eekhout, Iris
    de Vet, Henrica C. W.
    Twisk, Jos W. R.
    Brand, Jaap P. L.
    de Boer, Michiel R.
    Heymans, Martijn W.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2014, 67 (03) : 335 - 342
  • [3] Multiple imputation for item scores when test data are factorially complex
    van Ginkel, Joost R.
    van der Ark, L. Andries
    Sijtsma, Klaas
    [J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2007, 60 : 315 - 337
  • [4] The use and reporting of multiple imputation in medical research - a review
    Mackinnon, A.
    [J]. JOURNAL OF INTERNAL MEDICINE, 2010, 268 (06) : 586 - 593
  • [5] Applying multiple imputation to multi-item patient reported outcome measures: advantages and disadvantages of imputing at the item, sub-scale or score level
    Rombach, Ines
    Burke, Orlaith
    Jenkinson, Crispin
    Gray, Alastair
    Rivero-Arias, Oliver
    [J]. HEALTH AND QUALITY OF LIFE OUTCOMES, 2016, 14
  • [6] Passive imputation and parcel summaries are both valid to handle missing items in studies with many multi-item scales
    Eekhout, Iris
    de Vet, Henrica C. W.
    de Boer, Michiel R.
    Twisk, Jos W. R.
    Heymans, Martijn W.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (04) : 1128 - 1140
  • [7] The rise of multiple imputation: a review of the reporting and implementation of the method in medical research
    Rezvan, Panteha Hayati
    Lee, Katherine J.
    Simpson, Julie A.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2015, 15
  • [8] The rise of multiple imputation: a review of the reporting and implementation of the method in medical research
    Panteha Hayati Rezvan
    Katherine J Lee
    Julie A Simpson
    [J]. BMC Medical Research Methodology, 15
  • [9] Full Information Multiple Imputation for Linear Regression Model with Missing Response Variable
    Song, Limin
    Guo, Guangbao
    [J]. IAENG International Journal of Applied Mathematics, 2024, 54 (01) : 77 - 81
  • [10] Multi-item fuzzy economic production quantity model with multiple deliveries
    Moghdani, Reza
    Sana, Shib Sankar
    Shahbandarzadeh, Hamid
    [J]. SOFT COMPUTING, 2020, 24 (14) : 10363 - 10387