Enhanced Cancer Recognition System Based on Random Forests Feature Elimination Algorithm

被引:11
|
作者
Ozcift, Akin [1 ]
机构
[1] Gaziantep Univ, Gaziantep Vocat Sch Higher Educ, Comp Programming Div, Gaziantep, Turkey
关键词
Random forests; Feature selection High-dimensional dataset; Cancer diagnosis; Classifier performance; CLASSIFICATION; SELECTION;
D O I
10.1007/s10916-011-9730-1
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Accurate classifiers are vital to design precise computer aided diagnosis (CADx) systems. Classification performances of machine learning algorithms are sensitive to the characteristics of data. In this aspect, determining the relevant and discriminative features is a key step to improve performance of CADx. There are various feature extraction methods in the literature. However, there is no universal variable selection algorithm that performs well in every data analysis scheme. Random Forests (RF), an ensemble of trees, is used in classification studies successfully. The success of RF algorithm makes it eligible to be used as kernel of a wrapper feature subset evaluator. We used best first search RF wrapper algorithm to select optimal features of four medical datasets: colon cancer, leukemia cancer, breast cancer and lung cancer. We compared accuracies of 15 widely used classifiers trained with all features versus to extracted features of each dataset. The experimental results demonstrated the efficiency of proposed feature extraction strategy with the increase in most of the classification accuracies of the algorithms.
引用
收藏
页码:2577 / 2585
页数:9
相关论文
共 50 条
  • [41] A Hybrid Approach for Feature Selection Based on Genetic Algorithm and Recursive Feature Elimination
    Rani, Pooja
    Kumar, Rajneesh
    Jain, Anurag
    Chawla, Sunil Kumar
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2021, 12 (02) : 17 - 38
  • [42] Object Recognition Based on Dynamic Random Forests and SURF Descriptor
    Jayech, Khaoula
    Mahjoub, Mohamed Ali
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2017, 2017, 10585 : 355 - 364
  • [43] A feature selection algorithm for intrusion detection system based on the enhanced heuristic optimizer
    Yu, Hongchen
    Zhang, Wei
    Kang, Chunying
    Xue, Yankun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 265
  • [44] Rectal Cancer Outcome Prediction Based On Institutional Data with Random Forests and Random Survival Forests
    Huang, M.
    Zhong, H.
    Liu, D.
    Gabriel, P.
    Ben-Josef, E.
    Yin, L.
    Geng, H.
    Cheng, C.
    Bilker, W.
    Xiao, Y.
    MEDICAL PHYSICS, 2017, 44 (06)
  • [45] Fault Diagnosis for Power Converters based on Random Forests and Feature Transformation
    Kou, Lei
    Liu, Chuang
    Cai, Guo-wei
    Zhang, Zhe
    Li, Xue-jiao
    Yuan, Quan-de
    2020 IEEE 9TH INTERNATIONAL POWER ELECTRONICS AND MOTION CONTROL CONFERENCE (IPEMC2020-ECCE ASIA), 2020, : 1821 - 1826
  • [46] Feature Selection with Random Forests Predicting Metagenome-Based Disease
    Huong Hoang Luong
    Thanh Huyen Nguyen Thi
    An Duc Le
    Hai Thanh Nguyen
    ARTIFICIAL INTELLIGENCE AND SUSTAINABLE COMPUTING FOR SMART CITY, AIS2C2 2021, 2021, 1434 : 254 - 266
  • [47] Particle Swarm Optimization with Random Forests for Handwritten Arabic Recognition System
    Sahlol, Ahmed
    Abd Elfattah, Mohamed
    Suen, Ching Y.
    Hassanien, Aboul Ella
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 437 - 446
  • [48] Structure damage detection based on random forest recursive feature elimination
    Zhou, Qifeng
    Zhou, Hao
    Zhou, Qingqing
    Yang, Fan
    Luo, Linkai
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2014, 46 (01) : 82 - 90
  • [49] Predicting SDC Vulnerability of Instructions Based on Random Forests Algorithm
    Liu, LiPing
    Ci, LinLin
    Liu, Wei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT III, 2018, 11336 : 593 - 607
  • [50] Face Tracking Algorithm based on Online random forests Detection
    Bao, Fang
    Zhang, Yankai
    14TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS, ENGINEERING AND SCIENCE (DCABES 2015), 2015, : 320 - 323