A centroid-based gene selection method for microarray data classification

被引:22
|
作者
Guo, Shun [1 ,2 ]
Guo, Donghui [1 ]
Chen, Lifei [3 ]
Jiang, Qingshan [2 ]
机构
[1] Xiamen Univ, Dept Elect Engn, Xiamen 361005, Fujian, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518000, Peoples R China
[3] Fujian Normal Univ, Sch Math & Comp Sci, Fuzhou 350117, Fujian, Peoples R China
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Class centroid; Microarray data; Classification; L1; regularization; Gene selection; DISCRIMINANT-ANALYSIS; ALGORITHMS; EFFICIENT;
D O I
10.1016/j.jtbi.2016.03.034
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
For classification problems based on microarray data, the data typically contains a large number of irrelevant and redundant features. In this paper, a new gene selection method is proposed to choose the best subset of features for microarray data with the irrelevant and redundant features removed. We formulate the selection problem as a L1-regularized optimization problem, based on a newly defined linear discriminant analysis criterion. Instead of calculating the mean of the samples, a kernel-based approach is used to estimate the class centroid to define both the between-class separability and the within-class compactness for the criterion. Theoretical analysis indicates that the global optimal solution of the L1-regularized criterion can be reached with a general condition, on which an efficient algorithm is derived to the feature selection problem in a linear time complexity with respect to the number of features and the number of samples. The experimental results on ten publicly available microarray datasets demonstrate that the proposed method performs effectively and competitively compared with state-of-the-art methods. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 41
页数:10
相关论文
共 50 条
  • [31] A gene selection method for microarray data based on risk genes
    Wong, Tzu-Tsung
    Chen, Ding-Qun
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (11) : 14065 - 14071
  • [32] A Robust Gene Selection Method for Microarray-based Cancer Classification
    Wang, Xiaosheng
    Gotoh, Osamu
    [J]. CANCER INFORMATICS, 2010, 9 : 15 - 30
  • [33] A-New-Nonparametric-Gene-Selection-Method-for-Classification-of- Microarray-Data-
    Ye, Lihua
    Li, Yonggang
    Yang, Kun
    [J]. ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1928 - +
  • [34] Gene Correlation Guided Gene Selection for Microarray Data Classification
    Yang, Dong
    Zhu, Xuchang
    [J]. BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [35] A high performance centroid-based classification approach for language identification
    Takci, Hidayet
    Gungor, Tunga
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (16) : 2077 - 2084
  • [36] An Anomaly-Based IDS Framework Using Centroid-Based Classification
    Lin, Iuon-Chang
    Chang, Ching-Chun
    Peng, Chih-Hsiang
    [J]. SYMMETRY-BASEL, 2022, 14 (01):
  • [37] A NON-CONVEX PROXIMAL APPROACH FOR CENTROID-BASED CLASSIFICATION
    Kahanam, Mewe-Hezoudah
    Le-Brusquet, Laurent
    Martin, Segolene
    Pesquet, Jean-Christophe
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 5702 - 5706
  • [38] Analysis of inverse class frequency in centroid-based text classification
    Lertnattee, V
    Theeramunkong, T
    [J]. IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS, 2004, : 1171 - 1176
  • [39] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    [J]. AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [40] GT-kernelPLS: Game Theory Based Hybrid Gene Selection Method for Microarray Data Classification
    Shakoor, Adnan
    Peng, Qinke
    Sun, Shiquan
    Wang, Xiao
    Lv, Jia
    [J]. 2015 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2015, : 205 - 210