Stable feature selection based on instance learning, redundancy elimination and efficient subsets fusion

被引:0
|
作者
Afef Ben Brahim
机构
[1] Université de Tunis,Tunis Business School, LARODEC
来源
关键词
Feature selection; High dimensionality; Instance-based learning; Stability;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is frequently used as a preprocessing step to data mining and is attracting growing attention due to the increasing amounts of data emerging from different domains. The large data dimensionality increases the noise and thus the error of learning algorithms. Filter methods for feature selection are specially very fast and useful for high-dimensional datasets. Existing methods focus on producing feature subsets that improve predictive performance, but they often suffer from instability. Instance-based filters, for example, are considered as one of the most effective methods that rank features based on instances neighborhood. However, as the feature weight fluctuates with the instances, small changes in training data result in a different selected subset of features. By another hand, some other filters generate stable results but lead to a modest predictive performance. The absence of a trade-off between stability and classification accuracy decreases the reliability of the feature selection results. In order to deal with this issue, we propose filter methods that improve stability of feature selection while preserving an optimal predictive accuracy and without increasing the complexity of the feature selection algorithms. The proposed approaches first use the strength of instance learning to identify initial sets of relevant features, and the advantage of aggregation techniques to increase the stability of the final set in a second stage. Two classification algorithms are used to evaluate the predictive performance of our proposed instance-based filters compared to state-of-the-art algorithms. The obtained results show the efficiency of our methods in improving both classification accuracy and feature selection stability for high-dimensional datasets.
引用
收藏
页码:1221 / 1232
页数:11
相关论文
共 50 条
  • [1] Stable feature selection based on instance learning, redundancy elimination and efficient subsets fusion
    Ben Brahim, Afef
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1221 - 1232
  • [2] Efficient Feature Selection and Multiclass Classification with Integrated Instance and Model Based Learning
    Liu, Zhenqiu
    Bensmail, Halima
    Tan, Ming
    EVOLUTIONARY BIOINFORMATICS, 2012, 8 : 197 - 205
  • [3] Multi-label learning based on instance correlation and feature redundancy
    Zhang, Yong
    Jiang, Yuqing
    Zhang, Qi
    Liu, Da
    PATTERN RECOGNITION LETTERS, 2023, 176 : 123 - 130
  • [4] Feature Selection with Ensembles, Artificial Variables, and Redundancy Elimination
    Tuv, Eugene
    Borisov, Alexander
    Runger, George
    Torkkola, Kari
    JOURNAL OF MACHINE LEARNING RESEARCH, 2009, 10 : 1341 - 1366
  • [5] New data reduction algorithms based on the fusion of instance and feature selection
    Kusy, Maciej
    Zajdel, Roman
    KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [6] Efficient Spectral Feature Selection with Minimum Redundancy
    Zhao, Zheng
    Wang, Lei
    Liu, Huan
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 673 - 678
  • [7] CORRELATION-BASED FEATURE SELECTION WITH BAG-BASED FUSION SCHEME FOR MULTI-INSTANCE LEARNING APPLICATION
    Berahim, Mazniha
    Samsudin, Noor Azah
    Mustapha, Aida
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2022, 17 (06): : 3940 - 3955
  • [8] Local feature selection for multiple instance learning
    Aliasghar Shahrjooihaghighi
    Hichem Frigui
    Journal of Intelligent Information Systems, 2022, 59 : 45 - 69
  • [9] Feature selection in multi-instance learning
    Rui Gan
    Jian Yin
    Neural Computing and Applications, 2013, 23 : 907 - 912
  • [10] Feature Selection in Multi-instance Learning
    Zhang, Chun-Hua
    Tan, Jun-Yan
    Deng, Nai-Yang
    OPERATIONS RESEARCH AND ITS APPLICATIONS, 2010, 12 : 462 - +