A centroid-based gene selection method for microarray data classification

被引:22
|
作者
Guo, Shun [1 ,2 ]
Guo, Donghui [1 ]
Chen, Lifei [3 ]
Jiang, Qingshan [2 ]
机构
[1] Xiamen Univ, Dept Elect Engn, Xiamen 361005, Fujian, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518000, Peoples R China
[3] Fujian Normal Univ, Sch Math & Comp Sci, Fuzhou 350117, Fujian, Peoples R China
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Class centroid; Microarray data; Classification; L1; regularization; Gene selection; DISCRIMINANT-ANALYSIS; ALGORITHMS; EFFICIENT;
D O I
10.1016/j.jtbi.2016.03.034
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
For classification problems based on microarray data, the data typically contains a large number of irrelevant and redundant features. In this paper, a new gene selection method is proposed to choose the best subset of features for microarray data with the irrelevant and redundant features removed. We formulate the selection problem as a L1-regularized optimization problem, based on a newly defined linear discriminant analysis criterion. Instead of calculating the mean of the samples, a kernel-based approach is used to estimate the class centroid to define both the between-class separability and the within-class compactness for the criterion. Theoretical analysis indicates that the global optimal solution of the L1-regularized criterion can be reached with a general condition, on which an efficient algorithm is derived to the feature selection problem in a linear time complexity with respect to the number of features and the number of samples. The experimental results on ten publicly available microarray datasets demonstrate that the proposed method performs effectively and competitively compared with state-of-the-art methods. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 41
页数:10
相关论文
共 50 条
  • [1] Centroid-Based Classification of Categorical Data
    Chen, Lifei
    Guo, Gongde
    [J]. WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 472 - 475
  • [2] Improving Deep Classification by Centroid-based Candidate Selection Strategy
    He, Li
    Tan, Junwu
    Jia, Yan
    Han, Weihong
    Tan, Shuang
    [J]. 2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1419 - 1423
  • [3] Centroid-Based Particle Swarm Optimization Variant for Data Classification
    Al-Sawwa, Jamil
    Ludwig, Simone A.
    [J]. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 672 - 679
  • [4] An Iterative GASVM-Based Method: Gene Selection and Classification of Microarray Data
    Mohamad, Mohd Saberi
    Omatu, Sigeru
    Deris, Safaai
    Yoshioka, Michifumi
    [J]. DISTRIBUTED COMPUTING, ARTIFICIAL INTELLIGENCE, BIOINFORMATICS, SOFT COMPUTING, AND AMBIENT ASSISTED LIVING, PT II, PROCEEDINGS, 2009, 5518 : 187 - +
  • [5] An improvement of centroid-based classification algorithm for text classification
    Cataltepe, Zehra
    Aygun, Eser
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1-2, 2007, : 952 - 956
  • [6] A novel aggregate gene selection method for microarray data classification
    Thanh Nguyen
    Khosravi, Abbas
    Creighton, Douglas
    Nahavandi, Saeid
    [J]. PATTERN RECOGNITION LETTERS, 2015, 60-61 : 16 - 23
  • [7] A Centroid-Based Outlier Detection Method
    Wang, Xiaochun
    Chen, Yiqin
    Wang, Xia Li
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 1411 - 1416
  • [8] A new Chinese text feature selection method in centroid-based classifier
    Gu, Yijun
    Wang, Rong
    Wang, Jianhua
    Yu, Jiangde
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 88 - +
  • [9] An Entropy-based gene selection method for cancer classification using microarray data
    Liu, XX
    Krishnan, A
    Mondry, A
    [J]. BMC BIOINFORMATICS, 2005, 6
  • [10] An Entropy-based gene selection method for cancer classification using microarray data
    Xiaoxing Liu
    Arun Krishnan
    Adrian Mondry
    [J]. BMC Bioinformatics, 6 (1)