Missing value imputation method based on correlation analysis and Gaussian mixture model

被引:0
|
作者
Zhang, Jie [1 ]
Chang, Yuqing [1 ]
Wang, Ran [2 ]
Wang, Fuli [1 ]
机构
[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] North China Elect Power Univ, Dept Automat, Baoding, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Missing value imputation; correlation analysis; Gaussian mixture model; normality assessment; data grouping; TIME-SERIES;
D O I
10.1177/01423312241284660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proffers a novel procedure for missing value imputation, combining correlation analysis and Gaussian mixture model (GMM). Firstly, the normality of the data is assessed using normality assessment algorithm, and then the appropriate correlation coefficient calculation approach is selected owing to the normality of the data. Subsequently, the original correlation matrix is transformed into a binarized matrix based on a chosen threshold, which is used to group variables into different categories according to the correlation among them. Different missing value imputation methods are applied to these categories: mean imputation for single-variable groups and GMM-driven imputation for multi-variable groups. For multi-variable groups, a GMM model is trained using the Figueiredo-Jain algorithm, after which missing values are imputed using the mean derived from the model. Ultimately, the experimental evidence from Tennessee Eastman process and gold hydrometallurgy process further verify the superiority of the proposed algorithm.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Missing value imputation method for heterogeneous traffic flow data based on feature fusion
    Li L.
    Qu X.
    Zhang J.
    Wang Y.
    Li H.
    Ran B.
    Zhang, Jian (jianzhang@seu.edu.cn), 2018, Southeast University (48): : 972 - 978
  • [32] A Two-Stage Missing Value Imputation Method Based on Autoencoder Neural Network
    Yu, Jiayin
    He, Yulin
    Huang, Joshua Zhexue
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 6064 - 6066
  • [33] Microarray Missing Value Imputation: A Regularized Local Learning Method
    Wang, Aiguo
    Chen, Ye
    An, Ning
    Yang, Jing
    Li, Lian
    Jiang, Lili
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 980 - 993
  • [34] Non-Gaussian random vibration damage analysis of logistics based on a Gaussian mixture model method
    Guo, Tao
    Ge, Changfeng
    Xia, Sixuan
    Yin, Cheng
    Lin, Rang
    Qian, Jing
    Zhendong yu Chongji/Journal of Vibration and Shock, 2024, 43 (12): : 203 - 211
  • [35] Koopman Theory-Based Missing Value Imputation Model for Nonlinear Dynamical Systems
    Hwang, Yu Min
    Park, Sangjun
    Lee, Hyunyoung
    Ko, Seok-Kap
    Transactions of the Korean Institute of Electrical Engineers, 2024, 73 (11): : 2004 - 2010
  • [36] Hybrid prediction model with missing value imputation for medical data
    Purwar, Archana
    Singh, Sandeep Kumar
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5621 - 5631
  • [37] Missing Value Imputation Model Based on Adversarial Autoencoder Using Spatiotemporal Feature Extraction
    Shin, Dong-Hoon
    Lee, Seo-El
    Jeon, Byeong-Uk
    Chung, Kyungyong
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (02): : 1925 - 1940
  • [38] MVIRA: A model based on Missing Value Imputation and Reliability Assessment for mortality risk prediction
    Li, Bo
    Jin, Yide
    Yu, Xiaojing
    Song, Li
    Zhang, Jianjun
    Sun, Hongfeng
    Liu, Hui
    Shi, Yuliang
    Kong, Fanyu
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 178
  • [39] Adaptive Missing Data Imputation with Incremental Neuro-Fuzzy Gaussian Mixture Network (INFGMN)
    Mazzutti, Tiago
    Roisenberg, Mauro
    de Freitas Filho, Paulo Jose
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 713 - 720
  • [40] A constrained Gaussian mixture model for correlation-based cluster analysis of gene expression data
    Yukinawa, Naoto
    Yoshioka, Taku
    Kobayashi, Kazuo
    Ogasawara, Naotake
    Ishii, Shin
    IPSJ Transactions on Bioinformatics, 2009, 2 : 47 - 62