A novel class dependent feature selection method for cancer biomarker discovery

被引:27
|
作者
Zhou, Wengang [1 ]
Dickerson, Julie A. [2 ]
机构
[1] DuPont Pioneer, 7200 NW 62nd Ave, Johnston, IA 50131 USA
[2] Iowa State Univ, Elect & Comp Engn Dept, Ames, IA 50010 USA
关键词
Feature selection; Class dependent multi-category classification; Support vector machine; Binary particle swarm optimization; Cancer biomarker discovery; PROGASTRIN-RELEASING PEPTIDE; MOLECULAR CLASSIFICATION; MUTUAL INFORMATION; GENE SELECTION; EXPRESSION; PREDICTION; CARCINOMAS; DIAGNOSIS;
D O I
10.1016/j.compbiomed.2014.01.014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying key biomarkers for different cancer types can improve diagnosis accuracy and treatment. Gene expression data can help differentiate between cancer subtypes. However the limitation of having a small number of samples versus a larger number of genes represented in a dataset leads to the overfitting of classification models. Feature selection methods can help select the most distinguishing feature sets for classifying different cancers. A new class dependent feature selection approach integrates the F-statistic, Maximum Relevance Binary Particle Swarm Optimization (MRBPSO) and Class Dependent Multi-category Classification (CDMC) system. This feature selection method combines filter and wrapper based methods. A set of highly differentially expressed genes (features) are pre-selected using the F statistic for each dataset as a filter for selecting the most meaningful features. MRBPSO and CDMC function as a wrapper to select desirable feature subsets for each class and classify the samples using those chosen class-dependent feature subsets. The performance of the proposed methods is evaluated on eight real cancer datasets. The results indicate that the class-dependent approaches can effectively identify biomarkers related to each cancer type and improve classification accuracy compared to class independent feature selection methods. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:66 / 75
页数:10
相关论文
共 50 条
  • [41] A Novel Feature Selection Method for Fault Diagnosis
    Voulgaris, Zacharias
    Sconyers, Chris
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 262 - 269
  • [42] Biomarker discovery in inflammatory bowel diseases using network-based feature selection
    Abbas, Mostafa
    Matta, John
    Thanh Le
    Bensmail, Halima
    Obafemi-Ajayi, Tayo
    Honavar, Vasant
    EL-Manzalawy, Yasser
    [J]. PLOS ONE, 2019, 14 (11):
  • [43] Feature Selection Methods for Early Predictive Biomarker Discovery Using Untargeted Metabolomic Data
    Grissa, Dhouha
    Petera, Melanie
    Brandolini, Marion
    Napoli, Amedeo
    Comte, Blandine
    Pujos-Guillot, Estelle
    [J]. FRONTIERS IN MOLECULAR BIOSCIENCES, 2016, 3
  • [44] A critical assessment of the feature selection methods used for biomarker discovery in current metaproteomics studies
    Tang, Jing
    Wang, Yunxia
    Fu, Jianbo
    Zhou, Ying
    Luo, Yongchao
    Zhang, Ying
    Li, Bo
    Yang, Qingxia
    Xue, Weiwei
    Lou, Yan
    Qiu, Yunqing
    Zhu, Feng
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1378 - 1390
  • [45] Optimizing hybrid ensemble feature selection strategies for transcriptomic biomarker discovery in complex diseases
    Claude, Elsa
    Leclercq, Mickael
    Thebault, Patricia
    Droit, Arnaud
    Uricaru, Raluca
    [J]. NAR GENOMICS AND BIOINFORMATICS, 2024, 6 (03)
  • [46] sEnhanced Feature Selection for Biomarker Discovery in LC-MS Data using GP
    Ahmed, Soha
    Zhang, Mengjie
    Peng, Lifeng
    [J]. 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 584 - 591
  • [47] Novel Semi-feature selection method based on hybrid feature selection mechanism
    Zheng, Shangzhi
    Bu, Hualong
    [J]. 2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 4, 2010, : 590 - 593
  • [48] A two-stage gene selection method for biomarker discovery from microarray data for cancer classification
    Shukla, Alok Kumar
    Singh, Pradeep
    Vardhan, Manu
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 183 : 47 - 58
  • [49] A novel approach to feature selection based on analysis of class regions
    Thawonmas, R
    Abe, S
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1997, 27 (02): : 196 - 207
  • [50] A Novel Neighborhood Rough Set-Based Feature Selection Method and Its Application to Biomarker Identification of Schizophrenia
    Xing, Ying
    Kochunov, Peter
    van Erp, Theo G. M.
    Ma, Tianzhou
    Calhoun, Vince D.
    Du, Yuhui
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (01) : 215 - 226