The Impact of Under-sampling on the Performance of Bootstrap-based Ensemble Feature Selection

被引:0
|
作者
Guney, Huseyin [1 ]
Oztoprak, Huseyin [1 ]
机构
[1] Uluslararasi Kibris Univ, Bilgisayar Muhendisligi Bolumu, Lefkosa, Turkey
关键词
Support Vector Machine (SVM); Ensemble Feature Selection; Bootstrapping; Bagging; Under-sampling; Support Vector Machine Recursive Feature Elimination (SVM-RFE);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
DNA Microarrays are promising tool for cancer diagnosis and prognosis. DNA Microarrays are high-dimensional and gene selection is a difficult task. However, Bootstrap-based ensemble feature selection (Bagging) recently becomes popular and shows significant improvements in the field. This method aims to generate several slightly different sampled datasets, using bootstrap resampling, from training dataset. Afterwards, it aggregates all ranked feature lists, generated from sampled datasets, to obtain final (ensemble) feature list. Performance of bagging is proportional to diversity of generated sampled datasets. Therefore, it is proposed to use under-sampling of training set instead of using entire training set for bootstrap resampling to improve classification performance and gene selection stability. The proposed method was evaluated using support vector machine (SVM) as the classifier and recursive feature elimination (SVM-RFE) as the feature selection technique. Four microarray datasets were used for evaluation of the proposed method. The results show that 50% under-sampling approach have similar classification performance and outperforms conventional approach in terms of gene selection stability. In addition, 50% under-sampling uses only half of the samples of training dataset at each run of ensemble method so it has less computational cost.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] The psychometric function: II. Bootstrap-based confidence intervals and sampling
    Wichmann, FA
    Hill, NJ
    [J]. PERCEPTION & PSYCHOPHYSICS, 2001, 63 (08): : 1314 - 1329
  • [22] A binary PSO-based ensemble under-sampling model for rebalancing imbalanced training data
    Jinyan Li
    Yaoyang Wu
    Simon Fong
    Antonio J. Tallón-Ballesteros
    Xin-she Yang
    Sabah Mohammed
    Feng Wu
    [J]. The Journal of Supercomputing, 2022, 78 : 7428 - 7463
  • [23] An Imbalanced Multi-Label Data Ensemble Learning Method Based on Safe Under-Sampling
    Sun, Zhong-Bin
    Diao, Yu-Xuan
    Ma, Su-Yang
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (10): : 3392 - 3408
  • [24] Class Imbalance Problem: A Wrapper-Based Approach using Under-Sampling with Ensemble Learning
    Sikora, Riyaz
    Lee, Yoon Sang
    [J]. INFORMATION SYSTEMS FRONTIERS, 2024,
  • [25] A binary PSO-based ensemble under-sampling model for rebalancing imbalanced training data
    Li, Jinyan
    Wu, Yaoyang
    Fong, Simon
    Tallon-Ballesteros, Antonio J.
    Yang, Xin-she
    Mohammed, Sabah
    Wu, Feng
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (05): : 7428 - 7463
  • [26] An exact bootstrap-based bandwidth selection rule for kernel quantile estimators
    Liu, Xiaoyu
    Song, Yan
    Zhang, Kun
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (08) : 3699 - 3720
  • [27] BOOTSTRAP-BASED PENALTY CHOICE FOR THE LASSO, ACHIEVING ORACLE PERFORMANCE
    Hall, Peter
    Lee, Eun Ryung
    Park, Byeong U.
    [J]. STATISTICA SINICA, 2009, 19 (02) : 449 - 471
  • [28] Several SVM Ensemble Methods Integrated with Under-Sampling for Imbalanced Data Learning
    Lin, ZhiYong
    Hao, ZhiFeng
    Yang, XiaoWei
    Liu, XiaoLan
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 536 - +
  • [29] Ensemble learning based on approximate reducts and bootstrap sampling
    Jiang, Feng
    Yu, Xu
    Du, Junwei
    Gong, Dunwei
    Zhang, Youqiang
    Peng, Yanjun
    [J]. INFORMATION SCIENCES, 2021, 547 : 797 - 813
  • [30] Classification of pulsar signals using ensemble gradient boosting algorithms based on asymmetric under-sampling method
    Tariq, I
    Qiao, M.
    Wei, L.
    Yao, S.
    Zhou, C.
    Ali, Z.
    Azeem, S. W.
    Spanakis-Misirlis, A.
    [J]. JOURNAL OF INSTRUMENTATION, 2022, 17 (03):