A Bayesian Singular Value Decomposition Procedure for Missing Data Imputation

被引:2
|
作者
Zhai, Ruoshui [1 ]
Gutman, Roee [1 ]
机构
[1] Brown Univ, Dept Biostat, Providence, RI 02912 USA
基金
美国国家科学基金会;
关键词
Double exponential prior distribution; Matrix decomposition; Matrix recovery algorithm; Multiple imputation; MULTIPLE-IMPUTATION; MULTIVARIATE; INFERENCE; DISTRIBUTIONS; MODELS; MICE;
D O I
10.1080/10618600.2022.2107534
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Missing data are common in empirical studies. Multiple imputation is a method to handle missing values by replacing them with plausible values. A common imputation method is multiple imputation with chain equations (MICE). MICE defines a series of conditional distributions to impute missing values. Although MICE is relatively easy to implement, it may not converge to a proper joint distribution. An alternative strategy is to model the variables jointly using the general location model, but this model can become complex when the number of variables increases. Both approaches require integration of prior information when there are more variables than cases. We propose a Bayesian model that is based on the singular value decomposition components of a continuous data matrix to impute missing values. The model assumes that the matrix is of low rank by applying double exponential prior distributions on the singular values. We describe an efficient sampling algorithm to estimate the model's parameters and impute the missing data. The performance of the model is compared to current imputation methods in simulated and real datasets. Of all the methods considered and in most of the simulated and real datasets, the proposed procedure appears to be the most accurate and precise. Supplementary materials for this article are available online.
引用
收藏
页码:470 / 482
页数:13
相关论文
共 50 条
  • [41] Missing Value Imputation of Wireless Sensor Data for Environmental Monitoring
    Decorte, Thomas
    Mortier, Steven
    Lembrechts, Jonas J.
    Meysman, Filip J. R.
    Latre, Steven
    Mannens, Erik
    Verdonck, Tim
    [J]. SENSORS, 2024, 24 (08)
  • [42] SPSS syntax for missing value imputation in test and questionnaire data
    van Ginkel, JR
    van der Ark, LA
    [J]. APPLIED PSYCHOLOGICAL MEASUREMENT, 2005, 29 (02) : 152 - 153
  • [43] A Comparative Study of Missing Value Imputation Methods for Education Data
    Keerin, Phimmarin
    [J]. 29TH INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION (ICCE 2021), VOL II, 2021, : 109 - 117
  • [44] Effective Missing Value Imputation Methods for Building Monitoring Data
    Cho, Brian
    Dayrit, Teresa
    Gao, Yuan
    Wang, Zhe
    Hong, Tianzhen
    Sim, Alex
    Wu, Kesheng
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2866 - 2875
  • [45] Handling Missing Data in Presence of Categorical Variables: a New Imputation Procedure
    Ferrari, Pier Alda
    Barbiero, Alessandro
    Manzi, Giancarlo
    [J]. NEW PERSPECTIVES IN STATISTICAL MODELING AND DATA ANALYSIS, 2011, : 473 - 480
  • [46] Missing Data Imputation for MIMIC-III using Matrix Decomposition
    Yang, Xi
    Kim, Yeo Jin
    Khoshnevisan, Farzaneh
    Zhang, Yuan
    Chi, Min
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2019, : 556 - 558
  • [47] IMPUTATION USING THE SINGULAR VALUE DECOMPOSITION: VARIANTS OF EXISTING METHODS, PROPOSED AND ASSESSED
    Arciniegas-Alarcon, Sergio
    Garcia-Pena, Marisol
    Krzanowski, Wojtek Janusz
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2020, 16 (05): : 1681 - 1696
  • [48] Deep learning for missing value imputation of continuous data and the effect of data discretization
    Lin, Wei-Chao
    Tsai, Chih-Fong
    Zhong, Jia Rong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 239
  • [49] Missing data imputation: focusing on single imputation
    Zhang, Zhongheng
    [J]. ANNALS OF TRANSLATIONAL MEDICINE, 2016, 4 (01)
  • [50] Improving the performance of Bayesian networks in non-ignorable missing data imputation
    Niloofar, P.
    Ganjali, M.
    Rohani, M. R. Farid
    [J]. KUWAIT JOURNAL OF SCIENCE, 2013, 40 (02) : 83 - 101