A New GP-based Wrapper Feature Construction Approach to Classification and Biomarker Identification

被引:0
|
作者
Ahmed, Soha [1 ]
Zhang, Mengjie [1 ]
Peng, Lifeng [2 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6014, New Zealand
[2] Victoria Univ Wellington, Sch Biol Sci, Wellington 6014, New Zealand
关键词
ALGORITHM; SELECTION; CANCER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mass spectrometry (MS) is a technology used for identification and quantification of proteins and metabolites. It helps in the discovery of proteomic or metabolomic biomarkers, which aid in diseases detection and drug discovery. The detection of biomarkers is performed through the classification of patients from healthy samples. The mass spectrometer produces high dimensional data where most of the features are irrelevant for classification. Therefore, feature reduction is needed before the classification of MS data can be done effectively. Feature construction can provide a means of dimensionality reduction and aims at improving the classification performance. In this paper, genetic programming (GP) is used for construction of multiple features. Two methods are proposed for this objective. The proposed methods work by wrapping a Random Forest (RF) classifier to GP to ensure the quality of the constructed features. Meanwhile, five other classifiers in addition to RF are used to test the impact of the constructed features on the performance of these classifiers. The results show that the proposed GP methods improved the performance of classification over using the original set of features in five MS data sets.
引用
收藏
页码:2756 / 2763
页数:8
相关论文
共 50 条
  • [1] Using Feature Clustering for GP-Based Feature Construction on High-Dimensional Data
    Binh Tran
    Xue, Bing
    Zhang, Mengjie
    [J]. GENETIC PROGRAMMING, EUROGP 2017, 2017, 10196 : 210 - 226
  • [2] A GP-based Kernel Construction and Optimization Method for RVM
    Wu Bing
    Zhang Wen-qiong
    Chen Ling
    Liang Jia-hong
    [J]. 2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 4, 2010, : 419 - 423
  • [3] An autonomous GP-based system for regression and classification problems
    Oltean, Mihai
    Diosan, Laura
    [J]. APPLIED SOFT COMPUTING, 2009, 9 (01) : 49 - 60
  • [4] A Wrapper Feature Selection Approach to Classification with Missing Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2016, PT I, 2016, 9597 : 685 - 700
  • [5] Visual feature selection for GP-based localization using an omnidirectional camera
    Do, Huan N.
    Choi, Jongeun
    Lim, Chae Young
    [J]. 2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 4210 - 4215
  • [6] Multiple Feature Construction for Effective Biomarker Identification and Classification using Genetic Programming
    Ahmed, Soha
    Zhang, Mengjie
    Peng, Lifeng
    Xue, Bing
    [J]. GECCO'14: PROCEEDINGS OF THE 2014 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2014, : 249 - 256
  • [7] A Novel Filter-Wrapper Based Feature Selection Approach for Cancer Data Classification
    Mufassirin, M. M. Mohamed
    Ragel, Roshan G.
    [J]. 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS' 2018), 2018,
  • [8] Ranked MSD: A New Feature Ranking and Feature Selection Approach for Biomarker Identification
    Verma, Ghanshyam
    Jha, Alokkumar
    Rebholz-Schuhmann, Dietrich
    Madden, Michael G.
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, CD-MAKE 2019, 2019, 11713 : 147 - 167
  • [9] Key Process Variable Identification for Quality Classification Based on PLSR Model and Wrapper Feature Selection
    Tian, Wen-meng
    He, Zhen
    Yan, Wei
    [J]. PROCEEDINGS OF 2012 3RD INTERNATIONAL ASIA CONFERENCE ON INDUSTRIAL ENGINEERING AND MANAGEMENT INNOVATION (IEMI2012), 2013, : 263 - 270
  • [10] A new hybrid filter/wrapper algorithm for feature selection in classification
    Zhang, Jixiong
    Xiong, Yanmei
    Min, Shungeng
    [J]. ANALYTICA CHIMICA ACTA, 2019, 1080 : 43 - 54