Multiclass cancer classification and biomarker discovery using GA-based algorithms

被引:117
|
作者
Liu, JJ
Cutler, G
Li, WX
Pan, Z
Peng, SH
Hoey, T
Chen, LB
Ling, XFB
机构
[1] Tularik Inc, San Francisco, CA 94080 USA
[2] Zhejiang Univ, Hangzhou 310027, Peoples R China
[3] Chinese Acad Sci, Inst Genet & Dev Biol, Beijing 100101, Peoples R China
关键词
D O I
10.1093/bioinformatics/bti419
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The development of microarray-based high-throughput gene profiling has led to the hope that this technology could provide an efficient and accurate means of diagnosing and classifying tumors, as well as predicting prognoses and effective treatments. However, the large amount of data generated by microarrays requires effective reduction of discriminant gene features into reliable sets of tumor biomarkers for such multiclass tumor discrimination. The availability of reliable sets of biomarkers, especially serum biomarkers, should have a major impact on our understanding and treatment of cancer. Results: We have combined genetic algorithm (GA) and all paired (AP) support vector machine (SVM) methods for multiclass cancer categorization. Predictive features can be automatically determined through iterative GA/SVM, leading to very compact sets of non-redundant cancer-relevant genes with the best classification performance reported to date. Interestingly, these different classifier sets harbor only modest overlapping gene features but have similar levels of accuracy in leave-one-out cross-validations (LOOCV). Further characterization of these optimal tumor discriminant features, including the use of nearest shrunken centroids (NSC), analysis of annotations and literature text mining, reveals previously unappreciated tumor subclasses and a series of genes that could be used as cancer biomarkers. With this approach, we believe that microarray-based multiclass molecular analysis can be an effective tool for cancer biomarker discovery and subsequent molecular cancer diagnosis.
引用
收藏
页码:2691 / 2697
页数:7
相关论文
共 50 条
  • [1] Critical Assessment of the Biomarker Discovery and Classification Methods for Multiclass Metabolomics
    Yang, Qingxia
    Gong, Yaguo
    Zhu, Feng
    [J]. ANALYTICAL CHEMISTRY, 2023, 95 (13) : 5542 - 5552
  • [2] Improving GA-based NoC mapping algorithms using a formal model
    Palaniveloo, Vinitha Arakkonam
    Ambrose, Jude Angelo
    Sowmya, Arcot
    [J]. 2014 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2014, : 345 - 350
  • [3] Feature selection for modular GA-based classification
    Zhu, FM
    Guan, S
    [J]. APPLIED SOFT COMPUTING, 2004, 4 (04) : 381 - 393
  • [4] FEATURE SELECTION FOR CLASSIFICATION BY USING A GA-BASED NEURAL NETWORK APPROACH
    Li, Te-Sheng
    [J]. JOURNAL OF INDUSTRIAL AND PRODUCTION ENGINEERING, 2006, 23 (01) : 55 - 64
  • [5] Audio Classification Using GA-Based Fuzzy C-Means
    Kang, Myeongsu
    Kim, Jong-Myon
    [J]. FRONTIER AND INNOVATION IN FUTURE COMPUTING AND COMMUNICATIONS, 2014, 301 : 393 - 400
  • [6] Comparison of GA-Based Algorithms: A Viewpoint of Learning Scheme
    Hao, Guo-sheng
    Shi, Qiu-yi
    Wang, Gai-ge
    Zhang, Zhao-jun
    Zou, De-xuan
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY (CIMNS 2017), 2017, : 288 - 293
  • [7] GA-Based heuristic algorithms for QoS based multicast routing
    Haghighat, AT
    Faez, K
    Dehghan, M
    Mowlaei, A
    Ghahremani, Y
    [J]. KNOWLEDGE-BASED SYSTEMS, 2003, 16 (5-6) : 305 - 312
  • [8] Multiclass Classification of Brain Cancer with Machine Learning Algorithms
    Erkal, Begum
    Basak, Selen
    Ciloglu, Alper
    Sener, Duygu Dede
    [J]. 2020 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2020,
  • [9] GA-based feature subset selection for myoelectric classification
    Oskoei, Mohammadreza Asghari
    Hu, Huosheng
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-3, 2006, : 1465 - +
  • [10] Online Algorithms for Multiclass Classification Using Partial Labels
    Bhattacharjee, Rajarshi
    Manwani, Naresh
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 249 - 260