Missing value imputation method based on correlation analysis and Gaussian mixture model

被引:0
|
作者
Zhang, Jie [1 ]
Chang, Yuqing [1 ]
Wang, Ran [2 ]
Wang, Fuli [1 ]
机构
[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] North China Elect Power Univ, Dept Automat, Baoding, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Missing value imputation; correlation analysis; Gaussian mixture model; normality assessment; data grouping; TIME-SERIES;
D O I
10.1177/01423312241284660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proffers a novel procedure for missing value imputation, combining correlation analysis and Gaussian mixture model (GMM). Firstly, the normality of the data is assessed using normality assessment algorithm, and then the appropriate correlation coefficient calculation approach is selected owing to the normality of the data. Subsequently, the original correlation matrix is transformed into a binarized matrix based on a chosen threshold, which is used to group variables into different categories according to the correlation among them. Different missing value imputation methods are applied to these categories: mean imputation for single-variable groups and GMM-driven imputation for multi-variable groups. For multi-variable groups, a GMM model is trained using the Figueiredo-Jain algorithm, after which missing values are imputed using the mean derived from the model. Ultimately, the experimental evidence from Tennessee Eastman process and gold hydrometallurgy process further verify the superiority of the proposed algorithm.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Missing Value Imputation Based on Gaussian Mixture Model for the Internet of Things
    Yan, Xiaobo
    Xiong, Weiqing
    Hu, Liang
    Wang, Feng
    Zhao, Kuo
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [2] Performance Evaluation of Missing-Value Imputation Clustering Based on a Multivariate Gaussian Mixture Model
    Xiao, Jing
    Xu, Qiongqiong
    Wu, Chuanli
    Gao, Yuexia
    Hua, Tianqi
    Xu, Chenwu
    PLOS ONE, 2016, 11 (08):
  • [3] Distributed personalized imputation based on Gaussian mixture model for missing data
    Chen S.
    Liu Y.
    Neural Computing and Applications, 2024, 36 (23) : 14237 - 14250
  • [4] Gaussian processes for missing value imputation
    Jafrasteh, Bahram
    Hernandez-Lobato, Daniel
    Lubian-Lopez, Simon Pedro
    Benavente-Fernandez, Isabel
    KNOWLEDGE-BASED SYSTEMS, 2023, 273
  • [5] Missing value imputation method based on density clustering and grey relational analysis
    Peng, Li
    Ting-Ting, Zhang
    Tian-Ge, Liang
    Kai-Hui, Zhang
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (11): : 133 - 142
  • [6] Missing Value Imputation Using Correlation Coefficient
    Manna, Sweta
    Pati, Soumen Kumar
    COMPUTATIONAL INTELLIGENCE IN PATTERN RECOGNITION, CIPR 2020, 2020, 1120 : 551 - 558
  • [7] A Hierarchical Missing Value Imputation Method by Correlation-Based K-Nearest Neighbors
    Liu, Xin
    Lai, Xiaochen
    Zhang, Liyong
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2020, 1037 : 486 - 496
  • [8] A hybrid method for missing value imputation
    Karanikola, Aikaterini
    Kotsiantis, Sotiris
    PROCEEDINGS OF THE 23RD PAN-HELLENIC CONFERENCE OF INFORMATICS (PCI 2019), 2019, : 74 - 79
  • [9] Missing Value Imputation for Mixed Data via Gaussian Copula
    Zhao, Yuxuan
    Udell, Madeleine
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 636 - 646
  • [10] ProJect: a powerful mixed-model missing value imputation method
    Kong, Weijia
    Wong, Bertrand Jern Han
    Hui, Harvard Wai Hann
    Lim, Kai Peng
    Wang, Yulan
    Wong, Limsoon
    Goh, Wilson Wen Bin
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)