A Bayesian Singular Value Decomposition Procedure for Missing Data Imputation

被引:2
|
作者
Zhai, Ruoshui [1 ]
Gutman, Roee [1 ]
机构
[1] Brown Univ, Dept Biostat, Providence, RI 02912 USA
基金
美国国家科学基金会;
关键词
Double exponential prior distribution; Matrix decomposition; Matrix recovery algorithm; Multiple imputation; MULTIPLE-IMPUTATION; MULTIVARIATE; INFERENCE; DISTRIBUTIONS; MODELS; MICE;
D O I
10.1080/10618600.2022.2107534
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Missing data are common in empirical studies. Multiple imputation is a method to handle missing values by replacing them with plausible values. A common imputation method is multiple imputation with chain equations (MICE). MICE defines a series of conditional distributions to impute missing values. Although MICE is relatively easy to implement, it may not converge to a proper joint distribution. An alternative strategy is to model the variables jointly using the general location model, but this model can become complex when the number of variables increases. Both approaches require integration of prior information when there are more variables than cases. We propose a Bayesian model that is based on the singular value decomposition components of a continuous data matrix to impute missing values. The model assumes that the matrix is of low rank by applying double exponential prior distributions on the singular values. We describe an efficient sampling algorithm to estimate the model's parameters and impute the missing data. The performance of the model is compared to current imputation methods in simulated and real datasets. Of all the methods considered and in most of the simulated and real datasets, the proposed procedure appears to be the most accurate and precise. Supplementary materials for this article are available online.
引用
收藏
页码:470 / 482
页数:13
相关论文
共 50 条
  • [1] Missing value imputation in a data matrix using the regularised singular value decomposition
    Arciniegas-Alarcon, Sergio
    Garcia-Pena, Marisol
    Krzanowski, Wojtek J.
    Rengifo, Camilo
    [J]. METHODSX, 2023, 11
  • [2] Imputation of Mixed Data With Multilevel Singular Value Decomposition
    Husson, Francois
    Josse, Julie
    Narasimhan, Balasubramanian
    Robin, Genevieve
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (03) : 552 - 566
  • [3] BAYESIAN IMPUTATION FOR MISSING DATA
    Nads, Azman A.
    Polestico, Daisy Lou L.
    [J]. ADVANCES AND APPLICATIONS IN STATISTICS, 2022, 79 : 83 - 104
  • [4] A Bayesian robust CP decomposition approach for missing traffic data imputation
    Zhu, Yun
    Wang, Weiye
    Yu, Gaohang
    Wang, Jun
    Tang, Lei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33171 - 33184
  • [5] A Bayesian robust CP decomposition approach for missing traffic data imputation
    Yun Zhu
    Weiye Wang
    Gaohang Yu
    Jun Wang
    Lei Tang
    [J]. Multimedia Tools and Applications, 2022, 81 : 33171 - 33184
  • [6] Faster Imputation Using Singular Value Decomposition for Sparse Data
    Phuc Nguyen
    Tran, Linh G. H.
    Le, Bao H.
    Nguyen, Thuong H. T.
    Thu Nguyen
    Nguyen, Hien D.
    Nguyen, Binh T.
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT I, 2023, 13995 : 135 - 146
  • [7] Missing-value imputation using the robust singular-value decomposition: Proposals and numerical evaluation
    Garcia-Pena, Marisol
    Arciniegas-Alarcon, Sergio
    Krzanowski, Wojtek J.
    Duarte, Diego
    [J]. CROP SCIENCE, 2021, 61 (05) : 3288 - 3300
  • [8] Incremental singular value decomposition of uncertain data with missing values
    Brand, M
    [J]. COMPUTER VISON - ECCV 2002, PT 1, 2002, 2350 : 707 - 720
  • [9] A Bayesian Treatment for Singular Value Decomposition
    Luo, Cheng
    Xiang, Yang
    Zhang, Bo
    Fang, Qiang
    [J]. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1761 - 1767
  • [10] Block Tensor Train Decomposition for Missing Value Imputation
    Lee, Namgil
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1338 - 1343