Missing value imputation using unsupervised machine learning techniques

被引:46
|
作者
Raja, P. S. [1 ]
Thangavel, K. [1 ]
机构
[1] Periyar Univ, Dept Comp Sci, Salem, Tamil Nadu, India
关键词
K-means; Fuzzy C-means; Rough K-means; Machine learning; Missing values; Imputation; ALGORITHMS; SET;
D O I
10.1007/s00500-019-04199-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In data mining, preprocessing is one of the essential processes which involves data normalization, noise removal, handling missing values, etc. This paper focuses on handling missing values using unsupervised machine learning techniques. Soft computation approaches are combined with the clustering techniques to form a novel method to handle the missing values, which help us to overcome the problems of inconsistency. Rough K-means centroid-based imputation method is proposed and compared with K-means centroid-based imputation method, fuzzy C-means centroid-based imputation method, K-means parameter-based imputation method, fuzzy C-means parameter-based imputation method, and rough K-means parameter-based imputation methods. The experimental analysis is carried out on four benchmark datasets, viz. Dermatology, Pima, Wisconsin, and Yeast datasets, which have taken from UCI data repository. The proposed method proves the efficacy of different datasets, and the results are also promising one.
引用
收藏
页码:4361 / 4392
页数:32
相关论文
共 50 条
  • [1] Missing value imputation using unsupervised machine learning techniques
    P. S. Raja
    K. Thangavel
    [J]. Soft Computing, 2020, 24 : 4361 - 4392
  • [2] A systematic review of machine learning-based missing value imputation techniques
    Thomas, Tressy
    Rajabi, Enayat
    [J]. DATA TECHNOLOGIES AND APPLICATIONS, 2021, 55 (04) : 558 - 585
  • [3] MISSING VALUE IMPUTATION WITH UNSUPERVISED BACKPROPAGATION
    Gashler, Michael S.
    Smith, Michael R.
    Morris, Richard
    Martinez, Tony
    [J]. COMPUTATIONAL INTELLIGENCE, 2016, 32 (02) : 196 - 215
  • [4] Empirical comparison of supervised learning techniques for missing value imputation
    Tsai, Chih-Fong
    Hu, Ya-Han
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (04) : 1047 - 1075
  • [5] Performance Analysis of Machine Learning Algorithms for Missing Value Imputation
    Abidin, Nadzurah Zainal
    Ismail, Amelia Ritahani
    Emran, Nurul A.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (06) : 442 - 447
  • [6] Empirical comparison of supervised learning techniques for missing value imputation
    Chih-Fong Tsai
    Ya-Han Hu
    [J]. Knowledge and Information Systems, 2022, 64 : 1047 - 1075
  • [7] Evaluation of Machine Learning Classification Algorithms & Missing Data Imputation Techniques
    Nwulu, Nnamdi I.
    [J]. 2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [8] Missing Data Imputation using Machine Learning Algorithm for Supervised Learning
    Cenitta, D.
    Arjunan, R. Vijaya
    Prema, K., V
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [9] Missing value imputation using least squares techniques in contaminated matrices
    Garcia-Pena, Marisol
    Arciniegas-Alarcon, Sergio
    Krzanowski, Wojtek J.
    [J]. METHODSX, 2022, 9
  • [10] A Classifier Ensemble Machine Learning Approach to Improve Efficiency for Missing Value Imputation
    Chhabra, Geeta
    Vashisht, Vasudha
    Ranjan, Jayanthi
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 2018, : 23 - 27