A new feature selection approach based on ensemble methods in semi-supervised classification

被引:0
|
作者
Nesma Settouti
Mohamed Amine Chikh
Vincent Barra
机构
[1] LIMOS,Biomedical Engineering Laboratory GBM
[2] CNRS,undefined
[3] UMR 6158,undefined
[4] LIMOS,undefined
[5] Clermont-Université Université Blaise Pascal,undefined
[6] Tlemcen University,undefined
来源
关键词
Feature selection; Semi-supervised learning; Ensemble methods; Co-forest; Random Forest; Large datasets; Medical diagnosis;
D O I
暂无
中图分类号
学科分类号
摘要
In computer aided medical system, many practical classification applications are confronted to the massive multiplication of collection and storage of data, this is especially the case in areas such as the prediction of medical test efficiency, the classification of tumors and the detection of cancers. Data with known class labels (labeled data) can be limited but unlabeled data (with unknown class labels) are more readily available. Semi-supervised learning deals with methods for exploiting the unlabeled data in addition to the labeled data to improve performance on the classification task. In this paper, we consider the problem of using a large amount of unlabeled data to improve the efficiency of feature selection in large dimensional datasets, when only a small set of labeled examples is available. We propose a new semi-supervised feature evaluation method called Optimized co-Forest for Feature Selection (OFFS) that combines ideas from co-forest and the embedded principle of selecting in Random Forest based by the permutation of out-of-bag set. We provide empirical results on several medical and biological benchmark datasets, indicating an overall significant improvement of OFFS compared to four other feature selection approaches using filter, wrapper and embedded manner in semi-supervised learning. Our method proves its ability and effectiveness to select and measure importance to improve the performance of the hypothesis learned with a small amount of labeled samples by exploiting unlabeled samples.
引用
收藏
页码:673 / 686
页数:13
相关论文
共 50 条
  • [1] A new feature selection approach based on ensemble methods in semi-supervised classification
    Settouti, Nesma
    Chikh, Mohamed Amine
    Barra, Vincent
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (03) : 673 - 686
  • [2] Ensemble-Based Feature Ranking for Semi-supervised Classification
    Petkovic, Matej
    Dzeroski, Saso
    Kocev, Dragi
    [J]. DISCOVERY SCIENCE (DS 2019), 2019, 11828 : 290 - 305
  • [3] Weighting Based Approach for Semi-supervised Feature Selection
    Benabdeslem, Khalid
    Hindawi, Mohammed
    Makkhongkaew, Raywat
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2015, PT IV, 2015, 9492 : 300 - 307
  • [4] Semi-supervised Feature Selection for Gender Classification
    Wu, Jing
    Smith, William A. P.
    Hancock, Edwin R.
    [J]. COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 23 - 33
  • [5] A Survey on semi-supervised feature selection methods
    Sheikhpour, Razieh
    Sarram, Mehdi Agha
    Gharaghani, Sajjad
    Chahooki, Mohammad Ali Zare
    [J]. PATTERN RECOGNITION, 2017, 64 : 141 - 158
  • [6] Joint Semi-Supervised Feature Selection and Classification through Bayesian Approach
    Jiang, Bingbing
    Wu, Xingyu
    Yu, Kui
    Chen, Huanhuan
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3983 - 3990
  • [7] Adaptive Feature Selection and Feature Fusion for Semi-supervised Classification
    Wei Du
    Ronald Phlypo
    Tülay Adalı
    [J]. Journal of Signal Processing Systems, 2019, 91 : 521 - 537
  • [8] Adaptive Feature Selection and Feature Fusion for Semi-supervised Classification
    Du, Wei
    Phlypo, Ronald
    Adali, Tulay
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 91 (05): : 521 - 537
  • [9] A novel feature selection based semi-supervised method for image classification
    Tahir, M. A.
    Smith, J. E.
    Caleb-Solly, P.
    [J]. COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 484 - 493
  • [10] Mass Classification in Mammogram with Semi-Supervised Relief Based Feature Selection
    Liu, Xiaoming
    Liu, Jun
    Feng, Zhilin
    Xu, Xin
    Tang, J.
    [J]. FIFTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2013), 2014, 9069