A centroid-based gene selection method for microarray data classification

被引:22
|
作者
Guo, Shun [1 ,2 ]
Guo, Donghui [1 ]
Chen, Lifei [3 ]
Jiang, Qingshan [2 ]
机构
[1] Xiamen Univ, Dept Elect Engn, Xiamen 361005, Fujian, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518000, Peoples R China
[3] Fujian Normal Univ, Sch Math & Comp Sci, Fuzhou 350117, Fujian, Peoples R China
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Class centroid; Microarray data; Classification; L1; regularization; Gene selection; DISCRIMINANT-ANALYSIS; ALGORITHMS; EFFICIENT;
D O I
10.1016/j.jtbi.2016.03.034
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
For classification problems based on microarray data, the data typically contains a large number of irrelevant and redundant features. In this paper, a new gene selection method is proposed to choose the best subset of features for microarray data with the irrelevant and redundant features removed. We formulate the selection problem as a L1-regularized optimization problem, based on a newly defined linear discriminant analysis criterion. Instead of calculating the mean of the samples, a kernel-based approach is used to estimate the class centroid to define both the between-class separability and the within-class compactness for the criterion. Theoretical analysis indicates that the global optimal solution of the L1-regularized criterion can be reached with a general condition, on which an efficient algorithm is derived to the feature selection problem in a linear time complexity with respect to the number of features and the number of samples. The experimental results on ten publicly available microarray datasets demonstrate that the proposed method performs effectively and competitively compared with state-of-the-art methods. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 41
页数:10
相关论文
共 50 条
  • [41] GT-kernelPLS: Game theory based hybrid gene selection method for microarray data classification
    Shakoor, Adnan
    Peng, Qinke
    Sun, Shiquan
    Wang, Xiao
    Lv, Jia
    [J]. 2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, SNPD 2015 - Proceedings, 2015,
  • [42] Anti-spam filtering: A centroid-based classification approach
    Soonthornphisaj, N
    Chaikulseriwat, K
    Tang-On, P
    [J]. 2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 1096 - 1099
  • [43] Ensemble gene selection by grouping for microarray data classification
    Liu, Huawen
    Liu, Lei
    Zhang, Huijie
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (01) : 81 - 87
  • [44] Advances in metaheuristics for gene selection and classification of microarray data
    Duval, Beatrice
    Hao, Jin-Kao
    [J]. BRIEFINGS IN BIOINFORMATICS, 2010, 11 (01) : 127 - 141
  • [45] A STUDY ON GENE SELECTION AND CLASSIFICATION ALGORITHMS FOR CLASSIFICATION OF MICROARRAY GENE EXPRESSION DATA
    Chin, Yeo Lee
    Deris, Safaai
    [J]. JURNAL TEKNOLOGI, 2005, 43
  • [46] Random forest for gene selection and microarray data classification
    Moorthy, Kohbalan
    Mohamad, Mohd Saberi
    [J]. BIOINFORMATION, 2011, 7 (03) : 142 - 146
  • [47] Random Forest for Gene Selection and Microarray Data Classification
    Moorthy, Kohbalan
    Mohamad, Mohd Saberi
    [J]. KNOWLEDGE TECHNOLOGY, 2012, 295 : 174 - 183
  • [48] CENTROID-BASED TEXTURE CLASSIFICATION USING THE GENERALIZED GAMMA DISTRIBUTION
    Schutz, Aurelien
    Bombrun, Lionel
    Berthoumieu, Yannick
    Najim, Mohamed
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [49] Meta-heuristic Search based Gene Selection and Classification of Microarray Data
    Kumar, Mukesh
    Rath, Santanu Kumar
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [50] A Novel Kernel-based Gene Selection and Classification Scheme for Microarray Data
    Huang, Hsiao-Yun
    Chang, Hui-Yi
    Liu, Jeng-Fu
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1679 - 1683