Multiclass classification and gene selection with a stochastic algorithm

被引:19
|
作者
Le Cao, Kim-Anh [1 ,2 ,3 ]
Bonnet, Agnes [4 ]
Gadat, Sebastien [1 ,2 ]
机构
[1] Univ Toulouse, Inst Math, F-31062 Toulouse, France
[2] CNRS, UMR 5219, F-31062 Toulouse, France
[3] INRA, Stn Ameliorat Genet Animaux UR631, F-31326 Castanet Tolosan, France
[4] INRA, Lab Genet Cellulaire UMR 444, F-31326 Castanet Tolosan, France
关键词
SUPPORT VECTOR MACHINES; MULTIPLE CANCER TYPES; EXPRESSION; PREDICTION; DIAGNOSIS;
D O I
10.1016/j.csda.2009.02.028
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray technology allows for the monitoring of thousands of gene expressions in various biological conditions, but most of these genes are irrelevant for classifying these conditions. Feature selection is consequently needed to help reduce the dimension of the variable space. Starting from the application of the stochastic meta-algorithm "Optimal Feature Weighting" (OFW) for selecting features in various classification problems, focus is made on the multiclass problem that wrapper methods rarely handle. From a computational point of view, one of the main difficulties comes from the unbalanced classes situation that is commonly encountered in microarray data. From a theoretical point of view, very few methods have been developed so far to minimize the classification error made on the minority classes. The OFW approach is developed to handle multiclass problems using CART and one-vs-one SVM classifiers. Comparisons are made with other multiclass selection algorithms such as Random Forests and the filter method F-test on five public microarray data sets with various complexities. Statistical relevancy of the gene selections is assessed by computing the performances and the stability of these different approaches and the results obtained show that the two proposed approaches are competitive and relevant to selecting genes classifying the minority classes. Application to a pig folliculogenesis study follows and a detailed interpretation of the genes that were selected shows that the OFW approach answers the biological question. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:3601 / 3615
页数:15
相关论文
共 50 条
  • [31] Gene Selection for Multiclass Prediction by Weighted Fisher Criterion
    Xuan, Jianhua
    Wang, Yue
    Dong, Yibin
    Feng, Yuanjian
    Wang, Bin
    Khan, Javed
    Bakay, Maria
    Wang, Zuyi
    Pachman, Lauren
    Winokur, Sara
    Chen, Yi-Wen
    Clarke, Robert
    Hoffman, Eric
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2007, (01):
  • [32] Improving MSVM-RFE for Multiclass Gene Selection
    Zhao, Yan-Mei
    Yang, Zhi-Xia
    COMPUTATIONAL SYSTEMS BIOLOGY, 2010, 13 : 43 - +
  • [33] Discriminative Least Squares Regression for Multiclass Classification and Feature Selection
    Xiang, Shiming
    Nie, Feiping
    Meng, Gaofeng
    Pan, Chunhong
    Zhang, Changshui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (11) : 1738 - 1754
  • [34] Stable feature selection and classification algorithms for multiclass microarray data
    Sebastian Student
    Krzysztof Fujarewicz
    Biology Direct, 7
  • [35] Multiclass cancer classification based on gene expression comparison
    Yang, Sitan
    Naiman, Daniel Q.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2014, 13 (04) : 477 - 496
  • [36] A new fast algorithm for multiclass hyperspectral image classification with SVM
    Hosseini, S. A.
    Ghassemian, H.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2011, 32 (23) : 8657 - 8683
  • [37] Constructing large margin polytope classifiers with a multiclass classification algorithm
    Pilaszy, Istvan
    Dobrowiecki, Tadeusz
    IDAACS 2007: PROCEEDINGS OF THE 4TH IEEE WORKSHOP ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS, 2007, : 261 - 264
  • [38] A novel gene selection algorithm for cancer classification using microarray datasets
    Russul Alanni
    Jingyu Hou
    Hasseeb Azzawi
    Yong Xiang
    BMC Medical Genomics, 12
  • [39] Hybrid Algorithm Applied on Gene Selection and Classification from Different Diseases
    Montiel, L. A. H.
    IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (02) : 930 - 935
  • [40] Hybridization of Genetic and Quantum Algorithm for Gene Selection and Classification of Microarray Data
    Abderrahim, Allani
    Talbi, El-Ghazali
    Khaled, Mellouli
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 2226 - +