Enhanced Cancer Recognition System Based on Random Forests Feature Elimination Algorithm

被引:11
|
作者
Ozcift, Akin [1 ]
机构
[1] Gaziantep Univ, Gaziantep Vocat Sch Higher Educ, Comp Programming Div, Gaziantep, Turkey
关键词
Random forests; Feature selection High-dimensional dataset; Cancer diagnosis; Classifier performance; CLASSIFICATION; SELECTION;
D O I
10.1007/s10916-011-9730-1
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Accurate classifiers are vital to design precise computer aided diagnosis (CADx) systems. Classification performances of machine learning algorithms are sensitive to the characteristics of data. In this aspect, determining the relevant and discriminative features is a key step to improve performance of CADx. There are various feature extraction methods in the literature. However, there is no universal variable selection algorithm that performs well in every data analysis scheme. Random Forests (RF), an ensemble of trees, is used in classification studies successfully. The success of RF algorithm makes it eligible to be used as kernel of a wrapper feature subset evaluator. We used best first search RF wrapper algorithm to select optimal features of four medical datasets: colon cancer, leukemia cancer, breast cancer and lung cancer. We compared accuracies of 15 widely used classifiers trained with all features versus to extracted features of each dataset. The experimental results demonstrated the efficiency of proposed feature extraction strategy with the increase in most of the classification accuracies of the algorithms.
引用
收藏
页码:2577 / 2585
页数:9
相关论文
共 50 条
  • [21] Research on Recognition Technology of Human Lower Limbs Feature Based on the Random Forest Algorithm
    Liu, Yankai
    Yu, Meijuan
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2018, 423 : 709 - 714
  • [22] Wavelet-based Feature Extraction Algorithm for an Iris Recognition System
    Panganiban, Ayra
    Linsangan, Noel
    Caluyo, Felicito
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2011, 7 (03): : 425 - 434
  • [23] Exact Recognition of Compound Features by Feature Adjacency Matrix Elimination Algorithm
    Yu Yong
    Tang Rongxi (School of Mechanical Engineering and Automation
    Computer Aided Drafting,Design and Manufacturing, 1998, Design and Manufacturing.1998 (02) : 8 - 15
  • [24] Feature Integration with Random Forests for Real-time Human Activity Recognition
    Kataoka, Hirokatsu
    Hashimoto, Kiyoshi
    Aoki, Yoshimitsu
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445
  • [25] Traffic sign detection and recognition based on random forests
    Ellahyani, Ayoub
    El Ansari, Mohamed
    El Jaafari, Ilyas
    APPLIED SOFT COMPUTING, 2016, 46 : 805 - 815
  • [26] Git Recognition with Incomplete GEI Based on Random Forests
    Zhu, Qing
    Zhang, Jie
    COMPUTER AND INFORMATION TECHNOLOGY, 2014, 519-520 : 661 - 666
  • [27] A Shadow Elimination Algorithm Based on HSV Spatial Feature and Texture Feature
    Song, Ranran
    Liu, Min
    Wu, Minghu
    Wang, Juan
    Liu, Cong
    ADVANCES IN INTERNETWORKING, DATA & WEB TECHNOLOGIES, EIDWT-2017, 2018, 6 : 585 - 591
  • [28] SSiCP: A new SVM based recursive feature elimination algorithm for multiclass cancer classification
    Peng, S. (shpeng@shou.edu.cn), 1600, Science and Engineering Research Support Society (09):
  • [29] MGRFE: Multilayer Recursive Feature Elimination Based on an Embedded Genetic Algorithm for Cancer Classification
    Peng, Cheng
    Wu, Xinyu
    Yuan, Wen
    Zhang, Xinran
    Zhang, Yu
    Li, Ying
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (02) : 621 - 632
  • [30] DIRICHLET-TREE DISTRIBUTION ENHANCED RANDOM FORESTS FOR FACIAL FEATURE DETECTION
    Liu, Yuanyuan
    Chen, Jingying
    Shan, Cunjie
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 234 - 238