A novel feature selection framework for incomplete data

被引:0
|
作者
Guo, Cong [1 ]
Yang, Wei [1 ]
Li, Zheng [1 ]
Liu, Chun [1 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Henan Key Lab Big Data Anal & Proc, Henan Engn Lab Spatial Informat Proc, Kaifeng 475004, Peoples R China
关键词
Feature selection; Incomplete data; ReliefF; MATRIX COMPLETION; MISSING VALUES; CLASSIFICATION;
D O I
10.1016/j.chemolab.2024.105193
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection on incomplete datasets is a challenging task. To address this challenge, existing methods first employ imputation methods to complete the dataset and then perform feature selection based on the imputed dataset. Since missing value imputation and feature selection are entirely independent, the importance of features cannot be considered during imputation. However, in real-world scenarios or datasets, different features have varying degrees of importance. To this end, we proposed a novel incomplete data feature selection framework that considers feature importance. The framework mainly consists of two alternating iterative stages: M-stage and W-stage. In the M-stage, missing values are imputed based on a given feature importance vector and multiple initial imputation results. In the W-stage, an improved reliefF algorithm is employed to learn the feature importance vector based on the imputed data. In particular, the feature importance output by the W-stage in the current iteration will be used as the input of the M-stage in the next iteration. Experimental results on artificial and real missing datasets demonstrate that the proposed method outperforms other approaches significantly.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Unified View Imputation and Feature Selection Learning for Incomplete Multi-view Data
    Huang, Yanyong
    Shen, Zongxin
    Li, Tianrui
    Lv, Fengmao
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 4192 - 4200
  • [42] A generalized fuzzy clustering framework for incomplete data by integrating feature weighted and kernel learning
    Yang, Ying
    Chen, Haoyu
    Wu, Haoshen
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [43] A generalized fuzzy clustering framework for incomplete data by integrating feature weighted and kernel learning
    Yang Y.
    Chen H.
    Wu H.
    PeerJ Computer Science, 2023, 9
  • [44] A Novel Framework for Anomaly Detection via Feature Selection and Dimensionality Reduction
    Chang, Haotian
    Feng, Jing
    Duan, Chaofan
    Yan, Chao
    Yin, Min
    Li, Yi
    FUZZY SYSTEMS AND DATA MINING V (FSDM 2019), 2019, 320 : 511 - 522
  • [45] HMOSHSSA: a novel framework for solving simultaneous clustering and feature selection problems
    Kumar, Vijay
    Kumari, Rajani
    Kumar, Sandeep
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (35) : 82149 - 82175
  • [46] Maximum weight and minimum redundancy: A novel framework for feature subset selection
    Wang, Jianzhong
    Wu, Lishan
    Kong, Jun
    Li, Yuxin
    Zhang, Baoxue
    PATTERN RECOGNITION, 2013, 46 (06) : 1616 - 1627
  • [47] A Novel Retrieval Framework Using Classification, Feature Selection and Indexing Structure
    Feng, Yue
    Urruty, Thierry
    Jose, Joemon M.
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 731 - +
  • [48] A Framework for Distributed Feature Selection
    Sharifnezhad, Mona
    Rahmani, Mohsen
    Ghaffarian, Hossein
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (07)
  • [49] A Framework for Feature Selection in Clustering
    Witten, Daniela M.
    Tibshirani, Robert
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) : 713 - 726
  • [50] Local Adaptive Projection Framework for Feature Selection of Labeled and Unlabeled Data
    Chen, Xiaojun
    Yuan, Guowen
    Wang, Wenting
    Nie, Feiping
    Chang, Xiaojun
    Huang, Joshua Zhexue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 6362 - 6373