Multiclass classification and gene selection with a stochastic algorithm

被引:19
|
作者
Le Cao, Kim-Anh [1 ,2 ,3 ]
Bonnet, Agnes [4 ]
Gadat, Sebastien [1 ,2 ]
机构
[1] Univ Toulouse, Inst Math, F-31062 Toulouse, France
[2] CNRS, UMR 5219, F-31062 Toulouse, France
[3] INRA, Stn Ameliorat Genet Animaux UR631, F-31326 Castanet Tolosan, France
[4] INRA, Lab Genet Cellulaire UMR 444, F-31326 Castanet Tolosan, France
关键词
SUPPORT VECTOR MACHINES; MULTIPLE CANCER TYPES; EXPRESSION; PREDICTION; DIAGNOSIS;
D O I
10.1016/j.csda.2009.02.028
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray technology allows for the monitoring of thousands of gene expressions in various biological conditions, but most of these genes are irrelevant for classifying these conditions. Feature selection is consequently needed to help reduce the dimension of the variable space. Starting from the application of the stochastic meta-algorithm "Optimal Feature Weighting" (OFW) for selecting features in various classification problems, focus is made on the multiclass problem that wrapper methods rarely handle. From a computational point of view, one of the main difficulties comes from the unbalanced classes situation that is commonly encountered in microarray data. From a theoretical point of view, very few methods have been developed so far to minimize the classification error made on the minority classes. The OFW approach is developed to handle multiclass problems using CART and one-vs-one SVM classifiers. Comparisons are made with other multiclass selection algorithms such as Random Forests and the filter method F-test on five public microarray data sets with various complexities. Statistical relevancy of the gene selections is assessed by computing the performances and the stability of these different approaches and the results obtained show that the two proposed approaches are competitive and relevant to selecting genes classifying the minority classes. Application to a pig folliculogenesis study follows and a detailed interpretation of the genes that were selected shows that the OFW approach answers the biological question. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:3601 / 3615
页数:15
相关论文
共 50 条
  • [41] A TRIZ-inspired bat algorithm for gene selection in cancer classification
    Al-Betar, Mohammed Azmi
    Alomari, Osama Ahmad
    Abu-Romman, Saeid M.
    GENOMICS, 2020, 112 (01) : 114 - 126
  • [42] A hybrid LDA and genetic algorithm for gene selection and classification of microarray data
    Bonilla Huerta, Edmundo
    Duval, Beatrice
    Hao, Jin-Kao
    NEUROCOMPUTING, 2010, 73 (13-15) : 2375 - 2383
  • [43] HYBRIDIZATION OF GENETIC AND QUANTUM ALGORITHM FOR GENE SELECTION AND CLASSIFICATION OF MICROARRAY DATA
    Abderrahim, Allani
    Talbi, El-Ghazali
    Khaled, Mellouli
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2012, 23 (02) : 431 - 444
  • [44] Hybrid approach for gene selection and classification using Filter and Genetic Algorithm
    Singh, Pradeep
    Shukla, Alok
    Vardhan, Manu
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 832 - 837
  • [45] A novel gene selection algorithm for cancer classification using microarray datasets
    Alanni, Russul
    Hou, Jingyu
    Azzawi, Hasseeb
    Xiang, Yong
    BMC MEDICAL GENOMICS, 2019, 12 (1)
  • [46] Information Gain with Chaotic Genetic Algorithm for Gene Selection and Classification Problem
    Yang, Cheng-San
    Chuang, Li-Yeh
    Li, Jung-Chike
    Yang, Cheng-Hong
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1127 - +
  • [47] Cauchy greedy algorithm for robust sparse recovery and multiclass classification
    Wang, Yulong
    Zou, Cuiming
    Tang, Yuan Yan
    Li, Luoqing
    Shang, Zhaowei
    SIGNAL PROCESSING, 2019, 164 : 284 - 294
  • [48] Multiclass IFROWNN classification Algorithm using OVA and OVO strategy
    Andhalkar, Shital
    Momin, B. F.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [49] Modified Whale Optimization Algorithm for Multiclass Skin Cancer Classification
    Majid, Abdul
    Alrasheedi, Masad A.
    Alharbi, Abdulmajeed Atiah
    Allohibi, Jeza
    Lee, Seung-Won
    MATHEMATICS, 2025, 13 (06)
  • [50] Automatic EEG channel selection for multiclass brain-computer interface classification using multiobjective improved firefly algorithm
    Anurag Tiwari
    Amrita Chaturvedi
    Multimedia Tools and Applications, 2023, 82 : 5405 - 5433