Missing value imputation method based on correlation analysis and Gaussian mixture model

被引:0
|
作者
Zhang, Jie [1 ]
Chang, Yuqing [1 ]
Wang, Ran [2 ]
Wang, Fuli [1 ]
机构
[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] North China Elect Power Univ, Dept Automat, Baoding, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Missing value imputation; correlation analysis; Gaussian mixture model; normality assessment; data grouping; TIME-SERIES;
D O I
10.1177/01423312241284660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proffers a novel procedure for missing value imputation, combining correlation analysis and Gaussian mixture model (GMM). Firstly, the normality of the data is assessed using normality assessment algorithm, and then the appropriate correlation coefficient calculation approach is selected owing to the normality of the data. Subsequently, the original correlation matrix is transformed into a binarized matrix based on a chosen threshold, which is used to group variables into different categories according to the correlation among them. Different missing value imputation methods are applied to these categories: mean imputation for single-variable groups and GMM-driven imputation for multi-variable groups. For multi-variable groups, a GMM model is trained using the Figueiredo-Jain algorithm, after which missing values are imputed using the mean derived from the model. Ultimately, the experimental evidence from Tennessee Eastman process and gold hydrometallurgy process further verify the superiority of the proposed algorithm.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Missing value imputation: a review and analysis of the literature (2006–2017)
    Wei-Chao Lin
    Chih-Fong Tsai
    Artificial Intelligence Review, 2020, 53 : 1487 - 1509
  • [42] Missing value imputation for the analysis of incomplete traffic accident data
    Deb, Rupam
    Liew, Alan Wee -Chung
    INFORMATION SCIENCES, 2016, 339 : 274 - 289
  • [43] Performance Analysis of Machine Learning Algorithms for Missing Value Imputation
    Abidin, Nadzurah Zainal
    Ismail, Amelia Ritahani
    Emran, Nurul A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (06) : 442 - 447
  • [44] Imputation Algorithm Based on Copula for Missing Value in Timeseries Data
    Afrianti, Y. S.
    Indratno, S. W.
    Pasaribu, U. S.
    2014 2ND INTERNATIONAL CONFERENCE ON TECHNOLOGY, INFORMATICS, MANAGEMENT, ENGINEERING, AND ENVIRONMENT (TIME-E 2014), 2014, : 252 - 257
  • [45] An airborne image stabilization method based on the Gaussian mixture model
    Hongbin Deng
    Yunde Jia
    Yihua Xu
    Wei Liang
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 2488 - 2492
  • [46] A Novel Jitter Separation Method Based on Gaussian Mixture Model
    Mistry, Deepak
    Joshi, Sunil
    Agrawal, Navneet
    2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [47] EEG Data Augmentation Method Based on the Gaussian Mixture Model
    Liao, Chuncheng
    Zhao, Shiyu
    Wang, Xiangcun
    Zhang, Jiacai
    Liao, Yongzhong
    Wu, Xia
    MATHEMATICS, 2025, 13 (05)
  • [48] Missing Data Dynamic Forecasting of Fuzzy Time Series Based on Gaussian Mixture Model
    Huo, Xu
    Hao, Kuangrong
    Chen, Lei
    Cai, Xin
    Liu, Xiaoyan
    Ren, Lihong
    2022 IEEE INTERNATIONAL SYMPOSIUM ON ADVANCED CONTROL OF INDUSTRIAL PROCESSES (ADCONIP 2022), 2022, : 222 - 227
  • [49] A method for robotic grasping based on improved Gaussian mixture model
    Tao, Yong
    Ren, Fan
    Chen, Youdong
    Wang, Tianmiao
    Zou, Yu
    Chen, Chaoyong
    Jiang, Shan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2020, 17 (02) : 1495 - 1510
  • [50] Gaussian Mixture Model Based Prediction Method of Movie Rating
    Zhu, Jiaxin
    Guo, Yijun
    Hao, Jianjun
    Li, Jianfeng
    Chen, Duo
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 2114 - 2118