Gene Correlation Guided Gene Selection for Microarray Data Classification

被引:2
|
作者
Yang, Dong [1 ]
Zhu, Xuchang [2 ]
机构
[1] Tianjin Union Med Ctr, Dept Colorectal Surg, Tianjin 300121, Peoples R China
[2] Nanjing Med Univ, Dept Gastrointestinal Surg, Lianshui Peoples Hosp, Kangda Coll, Huaian 223300, Peoples R China
关键词
DIFFERENTIALLY EXPRESSED GENES; UNSUPERVISED FEATURE-SELECTION; CANCER CLASSIFICATION; SVM-RFE; ALGORITHMS; GRAPH; IDENTIFICATION; REPRESENTATION; EFFICIENT; TISSUES;
D O I
10.1155/2021/6490118
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The microarray cancer data obtained by DNA microarray technology play an important role for cancer prevention, diagnosis, and treatment. However, predicting the different types of tumors is a challenging task since the sample size in microarray data is often small but the dimensionality is very high. Gene selection, which is an effective means, is aimed at mitigating the curse of dimensionality problem and can boost the classification accuracy of microarray data. However, many of previous gene selection methods focus on model design, but neglect the correlation between different genes. In this paper, we introduce a novel unsupervised gene selection method by taking the gene correlation into consideration, named gene correlation guided gene selection (G(3)CS). Specifically, we calculate the covariance of different gene dimension pairs and embed it into our unsupervised gene selection model to regularize the gene selection coefficient matrix. In such a manner, redundant genes can be effectively excluded. In addition, we utilize a matrix factorization term to exploit the cluster structure of original microarray data to assist the learning process. We design an iterative updating algorithm with convergence guarantee to solve the resultant optimization problem. Experimental results on six publicly available microarray datasets are conducted to validate the efficacy of our proposed method.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Efficient gene selection for classification of microarray data
    Ho, SY
    Lee, CC
    Chen, HM
    Huang, HL
    [J]. 2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 1753 - 1760
  • [2] A STUDY ON GENE SELECTION AND CLASSIFICATION ALGORITHMS FOR CLASSIFICATION OF MICROARRAY GENE EXPRESSION DATA
    Chin, Yeo Lee
    Deris, Safaai
    [J]. JURNAL TEKNOLOGI, 2005, 43
  • [3] Ensemble gene selection by grouping for microarray data classification
    Liu, Huawen
    Liu, Lei
    Zhang, Huijie
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (01) : 81 - 87
  • [4] Advances in metaheuristics for gene selection and classification of microarray data
    Duval, Beatrice
    Hao, Jin-Kao
    [J]. BRIEFINGS IN BIOINFORMATICS, 2010, 11 (01) : 127 - 141
  • [5] Random forest for gene selection and microarray data classification
    Moorthy, Kohbalan
    Mohamad, Mohd Saberi
    [J]. BIOINFORMATION, 2011, 7 (03) : 142 - 146
  • [6] Random Forest for Gene Selection and Microarray Data Classification
    Moorthy, Kohbalan
    Mohamad, Mohd Saberi
    [J]. KNOWLEDGE TECHNOLOGY, 2012, 295 : 174 - 183
  • [7] Gene selection for classification of microarray data based on the Bayes error
    Ji-Gang Zhang
    Hong-Wen Deng
    [J]. BMC Bioinformatics, 8
  • [8] An integrative gene selection with association analysis for microarray data classification
    Fang, Ong Huey
    Mustapha, Norwati
    Sulaiman, Md. Nasir
    [J]. INTELLIGENT DATA ANALYSIS, 2014, 18 (04) : 739 - 758
  • [9] A Novel BPSO Approach for Gene Selection and Classification of Microarray Data
    Yang, Cheng-San
    Chuang, Li-Yeh
    li, Jung-Chike
    Yang, Cheng-Hong
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2147 - +
  • [10] Gene selection and classification of human lymphoma from microarray data
    Kamruzzaman, J
    Lim, S
    Gondal, I
    Begg, R
    [J]. BIOLOGICAL AND MEDICAL DATA ANALYSIS, PROCEEDINGS, 2005, 3745 : 379 - +