Feature elimination approach based on random forest for cancer diagnosis

被引:0
|
作者
Nguyen, Ha-Nam [1 ]
Vu, Trung-Nghia [1 ]
Ohn, Syng-Yup [1 ]
Park, Young-Mee [2 ]
Han, Mi Young [3 ]
Kim, Chul Woo [4 ]
机构
[1] Hankuk Aviat Univ, Dept Comp & Informat Engn, Seoul, South Korea
[2] Roswell Pk Cancer Inst, Dept Cell Stress Biol, Buffalo, NY USA
[3] Bioinfra Inc, Seoul, South Korea
[4] Seoul Natl Univ Coll Med, Tumor Immun Med Res Ctr, Dept Pathol, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of learning tasks is very sensitive to the characteristics of training data. There are several ways to increase the effect of learning performance including standardization, normalization, signal enhancement, linear or non-linear space embedding methods, etc. Among those methods, determining the relevant and informative features is one of the key steps in the data analysis process that helps to improve the performance, reduce the generation of data, and understand the characteristics of data. Researchers have developed the various methods to extract the set of relevant features but no one method prevails. Random Forest, which is an ensemble classifier based on the set of tree classifiers, turns out good classification performance. Taking advantage of Random Forest and using wrapper approach first introduced by Kohavi et al, we propose a new algorithm to find the optimal subset of features. The Random Forest is used to obtain the feature ranking values. And these values are applied to decide which features are eliminated in the each iteration of the algorithm. We conducted experiments with two public datasets: colon cancer and leukemia cancer. The experimental results of the real world data showed that the proposed method results in a higher prediction rate than a baseline method for certain data sets and also shows comparable and sometimes better performance than the feature selection methods widely used.
引用
收藏
页码:532 / +
页数:3
相关论文
共 50 条
  • [1] Structure damage detection based on random forest recursive feature elimination
    Zhou, Qifeng
    Zhou, Hao
    Zhou, Qingqing
    Yang, Fan
    Luo, Linkai
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2014, 46 (01) : 82 - 90
  • [2] DRFE: Dynamic Recursive Feature Elimination for gene identification based on Random Forest
    Nguyen, Ha-Nam
    Ohn, Syng-Yup
    NEURAL INFORMATION PROCESSING, PT 3, PROCEEDINGS, 2006, 4234 : 1 - 10
  • [3] Feature Selection of Power System Transient Stability Assessment Based on Random Forest and Recursive Feature Elimination
    Zhang, Chun
    Li, Yansong
    Yu, Zhihong
    Tian, Fang
    2016 IEEE PES ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC), 2016, : 1264 - 1268
  • [4] Feature elimination based random subspace ensembles learning for ECG arrhythmia diagnosis
    Shivajirao Jadhav
    Sanjay Nalbalwar
    Ashok Ghatol
    Soft Computing, 2014, 18 : 579 - 587
  • [5] Feature elimination based random subspace ensembles learning for ECG arrhythmia diagnosis
    Jadhav, Shivajirao
    Nalbalwar, Sanjay
    Ghatol, Ashok
    SOFT COMPUTING, 2014, 18 (03) : 579 - 587
  • [6] An automatically recursive feature elimination method based on threshold decision in random forest classification
    Chen, Chao
    Liang, Jintao
    Sun, Weiwei
    Yang, Gang
    Meng, Xiangchao
    GEO-SPATIAL INFORMATION SCIENCE, 2024,
  • [7] A Model-Free Feature Selection Technique of Feature Screening and Random Forest-Based Recursive Feature Elimination
    Xia, Siwei
    Yang, Yuehan
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [8] A Breast Cancer Diagnosis Method Based on VIM Feature Selection and Hierarchical Clustering Random Forest Algorithm
    Huang, Zexian
    Chen, Daqi
    IEEE ACCESS, 2022, 10 : 3284 - 3293
  • [9] A Guided Random Forest based Feature Selection Approach for Activity Recognition
    Uddin, Md. Taufeeq
    Uddin, Md. Azher
    2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION COMMUNICATION TECHNOLOGY (ICEEICT 2015), 2015,
  • [10] Enhanced Cancer Recognition System Based on Random Forests Feature Elimination Algorithm
    Akin Ozcift
    Journal of Medical Systems, 2012, 36 : 2577 - 2585