Enhanced Cancer Recognition System Based on Random Forests Feature Elimination Algorithm

被引:11
|
作者
Ozcift, Akin [1 ]
机构
[1] Gaziantep Univ, Gaziantep Vocat Sch Higher Educ, Comp Programming Div, Gaziantep, Turkey
关键词
Random forests; Feature selection High-dimensional dataset; Cancer diagnosis; Classifier performance; CLASSIFICATION; SELECTION;
D O I
10.1007/s10916-011-9730-1
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Accurate classifiers are vital to design precise computer aided diagnosis (CADx) systems. Classification performances of machine learning algorithms are sensitive to the characteristics of data. In this aspect, determining the relevant and discriminative features is a key step to improve performance of CADx. There are various feature extraction methods in the literature. However, there is no universal variable selection algorithm that performs well in every data analysis scheme. Random Forests (RF), an ensemble of trees, is used in classification studies successfully. The success of RF algorithm makes it eligible to be used as kernel of a wrapper feature subset evaluator. We used best first search RF wrapper algorithm to select optimal features of four medical datasets: colon cancer, leukemia cancer, breast cancer and lung cancer. We compared accuracies of 15 widely used classifiers trained with all features versus to extracted features of each dataset. The experimental results demonstrated the efficiency of proposed feature extraction strategy with the increase in most of the classification accuracies of the algorithms.
引用
收藏
页码:2577 / 2585
页数:9
相关论文
共 50 条
  • [1] Enhanced Cancer Recognition System Based on Random Forests Feature Elimination Algorithm
    Akin Ozcift
    Journal of Medical Systems, 2012, 36 : 2577 - 2585
  • [2] Facial Feature Selection for Gender Recognition based on Random Decision Forests
    Kayim, Guney
    Sari, Cihan
    Akgul, Ceyhun Burak
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [3] Feature elimination approach based on random forest for cancer diagnosis
    Nguyen, Ha-Nam
    Vu, Trung-Nghia
    Ohn, Syng-Yup
    Park, Young-Mee
    Han, Mi Young
    Kim, Chul Woo
    MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 532 - +
  • [4] A Mandarin Tone Recognition Algorithm Based on Random Forest and Feature Fusion
    Yan, Jiameng
    Meng, Qiang
    Tian, Lan
    Wang, Xiaoyu
    Liu, Junhui
    Li, Meng
    Zeng, Ming
    Xu, Huifang
    MATHEMATICS, 2023, 11 (08)
  • [5] Signal recognition algorithm based on random forests for spectrum sensing in cognitive network
    Wang, J. (wjk@mail.neuq.edu.cn), 1600, Binary Information Press (11):
  • [6] CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests
    Li Ma
    Suohai Fan
    BMC Bioinformatics, 18
  • [7] CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests
    Ma, Li
    Fan, Suohai
    BMC BIOINFORMATICS, 2017, 18
  • [8] EFFNet: A skin cancer classification model based on feature fusion and random forests
    Ma, Xiaopu
    Shan, Jiangdan
    Ning, Fei
    Li, Wentao
    Li, He
    PLOS ONE, 2023, 18 (10):
  • [9] Gene Selection Using Iterative Feature Elimination Random Forests for Survival Outcomes
    Pang, Herbert
    George, Stephen L.
    Hui, Ken
    Tong, Tiejun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (05) : 1422 - 1431
  • [10] Face Recognition Based on Random Feature
    Li, Shasha
    Deng, Weihong
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,