A novel class dependent feature selection method for cancer biomarker discovery

被引:27
|
作者
Zhou, Wengang [1 ]
Dickerson, Julie A. [2 ]
机构
[1] DuPont Pioneer, 7200 NW 62nd Ave, Johnston, IA 50131 USA
[2] Iowa State Univ, Elect & Comp Engn Dept, Ames, IA 50010 USA
关键词
Feature selection; Class dependent multi-category classification; Support vector machine; Binary particle swarm optimization; Cancer biomarker discovery; PROGASTRIN-RELEASING PEPTIDE; MOLECULAR CLASSIFICATION; MUTUAL INFORMATION; GENE SELECTION; EXPRESSION; PREDICTION; CARCINOMAS; DIAGNOSIS;
D O I
10.1016/j.compbiomed.2014.01.014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying key biomarkers for different cancer types can improve diagnosis accuracy and treatment. Gene expression data can help differentiate between cancer subtypes. However the limitation of having a small number of samples versus a larger number of genes represented in a dataset leads to the overfitting of classification models. Feature selection methods can help select the most distinguishing feature sets for classifying different cancers. A new class dependent feature selection approach integrates the F-statistic, Maximum Relevance Binary Particle Swarm Optimization (MRBPSO) and Class Dependent Multi-category Classification (CDMC) system. This feature selection method combines filter and wrapper based methods. A set of highly differentially expressed genes (features) are pre-selected using the F statistic for each dataset as a filter for selecting the most meaningful features. MRBPSO and CDMC function as a wrapper to select desirable feature subsets for each class and classify the samples using those chosen class-dependent feature subsets. The performance of the proposed methods is evaluated on eight real cancer datasets. The results indicate that the class-dependent approaches can effectively identify biomarkers related to each cancer type and improve classification accuracy compared to class independent feature selection methods. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:66 / 75
页数:10
相关论文
共 50 条
  • [1] An Ensemble Feature Selection Method for Biomarker Discovery
    Shahrjooihaghighi, Aliasghar
    Frigui, Hichem
    Zhang, Xiang
    Wei, Xiaoli
    Shi, Biyun
    Trabelsi, Ameni
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, : 416 - 421
  • [2] A Novel Approach for Feature Selection Based on MapReduce for Biomarker Discovery
    Kourid, Ahlem
    Batouche, Mohamed
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE ANALYSIS APPLICATIONS, 2015,
  • [3] Stable feature selection for biomarker discovery
    He, Zengyou
    Yu, Weichuan
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2010, 34 (04) : 215 - 225
  • [4] A novel feature selection method to predict protein structural class
    Yuan, Mingshun
    Yang, Zijiang
    Huang, Guangzao
    Ji, Guoli
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2018, 76 : 118 - 129
  • [5] Bayesian Error Analysis for Feature Selection in Biomarker Discovery
    Pour, Ali Foroughi
    Dalton, Lori A.
    [J]. IEEE ACCESS, 2019, 7 : 127544 - 127563
  • [6] A Comparative Study of Feature Selection Methods for Biomarker Discovery
    Mungloo-Dilmohamud, Zahra
    Marigliano, Gary
    Jaufeerally-Fakim, Yasmina
    Pena-Reyes, Carlos
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2789 - 2791
  • [7] Novel Regularization Method for Biomarker Selection and Cancer Classification
    Liu, Xiao-Ying
    Wang, Sai
    Zhang, Hai
    Zhang, Hui
    Yang, Zi-Yi
    Liang, Yong
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (04) : 1329 - 1340
  • [8] Robust Biomarker Discovery for Cancer Diagnosis Based on Meta-Ensemble Feature Selection
    Boucheham, Anouar
    Batouche, Mohamed
    [J]. 2014 SCIENCE AND INFORMATION CONFERENCE (SAI), 2014, : 452 - 460
  • [9] Research Techniques Made Simple: Feature Selection for Biomarker Discovery
    Torres, Rodrigo
    Judson-Torres, Robert L.
    [J]. JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2019, 139 (10) : 2068 - +
  • [10] Multiple Sclerosis Biomarker Discovery via Bayesian Feature Selection
    Pour, Ali Foroughi
    Dalton, Lori A.
    [J]. PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 540 - 541