Feature selection for outcome prediction in oesophageal cancer using genetic algorithm and random forest classifier

被引:84
|
作者
Paul, Desbordes [1 ,2 ]
Su, Ruan [1 ]
Romain, Modzelewski [1 ,3 ]
Sebastien, Vauclin [2 ]
Pierre, Vera [1 ,3 ]
Isabelle, Gardin [1 ,3 ]
机构
[1] Univ Rouen, LITIS QUANTIF, 22 Blvd Gambetta, F-76000 Rouen, France
[2] DOSISOFT, 45-47 Ave Carnot, F-94230 Cachan, France
[3] Henri Becquerel Ctr, 1 Rue Amiens, F-76038 Rouen, France
关键词
Feature selection; Oesophageal cancer; Random forest; Genetic algorithm; Radiomics; TEXTURE ANALYSIS; TUMOR VOLUME; IMAGES;
D O I
10.1016/j.compmedimag.2016.12.002
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The outcome prediction of patients can greatly help to personalize cancer treatment. A large amount of quantitative features (clinical exams, imaging,...) are potentially useful to assess the patient outcome. The challenge is to choose the most predictive subset of features. In this paper, we propose a new feature selection strategy called GARF (genetic algorithm based on random forest) extracted from positron emission tomography (PET) images and clinical data. The most relevant features, predictive of the therapeutic response or which are prognoses of the patient survival 3 years after the end of treatment, were selected using 'GARF on a cohort of 65 patients with a local advanced oesophageal cancer eligible for chemoradiation therapy. The most relevant predictive results were obtained with a subset of 9 features leading to a random forest misclassification rate of 18 +/- 4% and an areas under the of receiver operating characteristic (ROC) curves (AUC) of 0.823 +/- 0.032. The most relevant prognostic results were obtained with 8 features leading to an error rate of 20 +/- 7% and an AUC of 0.750 +/- 0.108. Both predictive and prognostic results show better performances using GARF than using 4 other studied methods. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:42 / 49
页数:8
相关论文
共 50 条
  • [41] Breast Cancer Classification with Random Forest Classifier with Feature Decomposition Using Principal Component Analysis
    Chudhey, Arshdeep Singh
    Goel, Mohak
    Singh, Mrityunjay
    [J]. ADVANCES IN DATA AND INFORMATION SCIENCES, 2022, 318 : 111 - 120
  • [42] Breast Cancer Classification with Random Forest Classifier with Feature Decomposition Using Principal Component Analysis
    Abd Manan, Nur Anis Syarafinaz
    Ahmad, Wan Amiza Amneera Wan
    Sulaiman, Nik Meriam Nik
    Mahmood, Noor Zalina
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON GREEN ENVIRONMENTAL ENGINEERING AND TECHNOLOGY (ICONGEET 2021), 2022, 214 : 385 - 389
  • [43] Default Risk Prediction Using Random Forest and XGBoosting Classifier
    Sharma, Alok Kumar
    Li, Li-Hua
    Ahmad, Ramli
    [J]. 2021 INTERNATIONAL CONFERENCE ON SECURITY AND INFORMATION TECHNOLOGIES WITH AI, INTERNET COMPUTING AND BIG-DATA APPLICATIONS, 2023, 314 : 91 - 101
  • [44] Relevant feature selection and ensemble classifier design using bi-objective genetic algorithm
    Das, Asit Kumar
    Pati, Soumen Kumar
    Ghosh, Arka
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (02) : 423 - 455
  • [45] Relevant feature selection and ensemble classifier design using bi-objective genetic algorithm
    Asit Kumar Das
    Soumen Kumar Pati
    Arka Ghosh
    [J]. Knowledge and Information Systems, 2020, 62 : 423 - 455
  • [46] Football Match Result Prediction Using the Random Forest Classifier
    Pugsee, Pakawan
    Pattawong, Pattarachai
    [J]. PROCEEDINGS OF 2019 2ND INTERNATIONAL CONFERENCE ON BIG DATA TECHNOLOGIES (ICBDT 2019), 2019, : 154 - 158
  • [47] Feature selection and classification of leukocytes using random forest
    Saraswat, Mukesh
    Arya, K. V.
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2014, 52 (12) : 1041 - 1052
  • [48] A Risk Prediction Model for Type 2 Diabetes Based on Weighted Feature Selection of Random Forest and XGBoost Ensemble Classifier
    Xu, Zhongxian
    Wang, Zhiliang
    [J]. 2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 278 - 283
  • [49] Feature selection with genetic algorithm for protein function prediction
    Santos, Bruno C.
    Rodrigues, Marcos W.
    Pinto, Cristiano L. N.
    Nobre, Cristiane N.
    Zarate, Luis E.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 2434 - 2439
  • [50] Feature selection and classification of leukocytes using random forest
    Mukesh Saraswat
    K. V. Arya
    [J]. Medical & Biological Engineering & Computing, 2014, 52 : 1041 - 1052