A novel feature selection framework for incomplete data

被引:0
|
作者
Guo, Cong [1 ]
Yang, Wei [1 ]
Li, Zheng [1 ]
Liu, Chun [1 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Henan Key Lab Big Data Anal & Proc, Henan Engn Lab Spatial Informat Proc, Kaifeng 475004, Peoples R China
关键词
Feature selection; Incomplete data; ReliefF; MATRIX COMPLETION; MISSING VALUES; CLASSIFICATION;
D O I
10.1016/j.chemolab.2024.105193
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection on incomplete datasets is a challenging task. To address this challenge, existing methods first employ imputation methods to complete the dataset and then perform feature selection based on the imputed dataset. Since missing value imputation and feature selection are entirely independent, the importance of features cannot be considered during imputation. However, in real-world scenarios or datasets, different features have varying degrees of importance. To this end, we proposed a novel incomplete data feature selection framework that considers feature importance. The framework mainly consists of two alternating iterative stages: M-stage and W-stage. In the M-stage, missing values are imputed based on a given feature importance vector and multiple initial imputation results. In the W-stage, an improved reliefF algorithm is employed to learn the feature importance vector based on the imputed data. In particular, the feature importance output by the W-stage in the current iteration will be used as the input of the M-stage in the next iteration. Experimental results on artificial and real missing datasets demonstrate that the proposed method outperforms other approaches significantly.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Robust Feature Selection on Incomplete Data
    Zheng, Wei
    Zhu, Xiaofeng
    Zhu, Yonghua
    Zhang, Shichao
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3191 - 3197
  • [2] Online feature selection and classification with incomplete data
    Kalkan, Habil
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2014, 22 (06) : 1625 - 1636
  • [3] Bagging and Feature Selection for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2017, PT I, 2017, 10199 : 471 - 486
  • [4] Optimal and Novel Hybrid Feature Selection Framework for Effective Data Classification
    Venkataraman, Sivakumar
    Selvaraj, Rajalakshmi
    ADVANCES IN SYSTEMS, CONTROL AND AUTOMATION, 2018, 442 : 499 - 514
  • [5] Feature selection for incomplete set-valued data
    Li, Lulu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (01) : 1217 - 1235
  • [6] A Novel PSO-FLANN Framework of Feature Selection and Classification for Microarray Data
    Parhi, Pournamasi
    Mishra, Debahuti
    Mishra, Sashikala
    Shaw, Kailash
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 1644 - 1649
  • [7] Unsupervised Cross-View Feature Selection on incomplete data
    Xu, Yuanyuan
    Yin, Yu
    Wang, Jun
    Wei, Jinmao
    Liu, Jian
    Yao, Lina
    Zhang, Wenjie
    KNOWLEDGE-BASED SYSTEMS, 2021, 234
  • [8] Mutual information criterion for feature selection from incomplete data
    Qian, Wenbin
    Shu, Wenhao
    NEUROCOMPUTING, 2015, 168 : 210 - 220
  • [9] An Embedded Feature Selection Framework for Hybrid Data
    Boroujeni, Forough Rezaei
    Stantic, Bela
    Wang, Sen
    DATABASES THEORY AND APPLICATIONS, ADC 2017, 2017, 10538 : 138 - 150
  • [10] Accelerating Incomplete Feature Selection
    Qian, Yuhua
    Liang, Jiye
    Wei, Wei
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 350 - +