Stable feature selection based on instance learning, redundancy elimination and efficient subsets fusion

被引:0
|
作者
Afef Ben Brahim
机构
[1] Université de Tunis,Tunis Business School, LARODEC
来源
Neural Computing and Applications | 2021年 / 33卷
关键词
Feature selection; High dimensionality; Instance-based learning; Stability;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is frequently used as a preprocessing step to data mining and is attracting growing attention due to the increasing amounts of data emerging from different domains. The large data dimensionality increases the noise and thus the error of learning algorithms. Filter methods for feature selection are specially very fast and useful for high-dimensional datasets. Existing methods focus on producing feature subsets that improve predictive performance, but they often suffer from instability. Instance-based filters, for example, are considered as one of the most effective methods that rank features based on instances neighborhood. However, as the feature weight fluctuates with the instances, small changes in training data result in a different selected subset of features. By another hand, some other filters generate stable results but lead to a modest predictive performance. The absence of a trade-off between stability and classification accuracy decreases the reliability of the feature selection results. In order to deal with this issue, we propose filter methods that improve stability of feature selection while preserving an optimal predictive accuracy and without increasing the complexity of the feature selection algorithms. The proposed approaches first use the strength of instance learning to identify initial sets of relevant features, and the advantage of aggregation techniques to increase the stability of the final set in a second stage. Two classification algorithms are used to evaluate the predictive performance of our proposed instance-based filters compared to state-of-the-art algorithms. The obtained results show the efficiency of our methods in improving both classification accuracy and feature selection stability for high-dimensional datasets.
引用
收藏
页码:1221 / 1232
页数:11
相关论文
共 50 条
  • [21] Efficient and Stable Unsupervised Feature Selection Based on Novel Structured Graph and Data Discrepancy Learning
    Huang, Pei
    Kong, Zhaoming
    Wang, Limin
    Han, Xuming
    Yang, Xiaowei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [22] A hybrid feature selection method based on instance learning and cooperative subset search
    Ben Brahim, Afef
    Limam, Mohamed
    PATTERN RECOGNITION LETTERS, 2016, 69 : 28 - 34
  • [23] Hybrid-Recursive Feature Elimination for Efficient Feature Selection
    Jeon, Hyelynn
    Oh, Sejong
    APPLIED SCIENCES-BASEL, 2020, 10 (09):
  • [24] A feature selection method based on multiple feature subsets extraction and result fusion for improving classification performance
    Liu, Jia
    Li, Dong
    Shan, Wangweiyi
    Liu, Shulin
    APPLIED SOFT COMPUTING, 2024, 150
  • [25] A feature selection method based on minimum redundancy maximum relevance for learning to rank
    Shirzad, Mehrnoush Barani
    Keyvanpour, Mohammad Reza
    2015 AI & ROBOTICS (IRANOPEN), 2015,
  • [26] Deep Feature Fusion Multiple Instance Learning for WaDang Recognition
    Wen, Chao
    Li, Zhan
    Li, Aiping
    Qu, Jian
    IEEE ACCESS, 2019, 7 : 98555 - 98564
  • [27] A novel multiple instance learning approach for image retrieval based on AdaBoost feature selection
    Yuan, Xun
    Hua, Xian-Sheng
    Wang, Meng
    Qi, Guo-Jun
    Wu, Xiu-Qing
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1491 - +
  • [28] Enabling Efficient Deep Learning on MCU With Transient Redundancy Elimination
    Liu, Jiesong
    Zhang, Feng
    Guan, Jiawei
    Sung, Hsin-Hsuan
    Guo, Xiaoguang
    Long, Saiqin
    Du, Xiaoyong
    Shen, Xipeng
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (12) : 2649 - 2663
  • [29] A Multi-instance Multi-label Learning Algorithm Based on Feature Selection
    Chen Tong-tong
    Liu Chan-juan
    Zou Hai-lin
    Shen Qian
    Liu Ying
    Ding Xin-miao
    2015 10TH INTERNATIONAL CONFERENCE ON BROADBAND AND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS (BWCCA 2015), 2015, : 587 - 590
  • [30] Neighborhood Component Feature Selection for Multiple Instance Learning Paradigm
    Turri, Giacomo
    Romeo, Luca
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT I, ECML PKDD 2024, 2024, 14941 : 230 - 247