Gene and sample selection using T-score with sample selection

被引:12
|
作者
Mundra, Piyushkumar A. [1 ]
Rajapakse, Jagath C. [1 ,2 ,3 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Bioinformat Res Ctr, Singapore, Singapore
[2] Singapore MIT Alliance, Singapore, Singapore
[3] MIT, Dept Biol Engn, Cambridge, MA 02139 USA
关键词
Feature selection; Gene expression; Logistic regression; SVM-RFE; Approximate support vectors; CANCER CLASSIFICATION; MICROARRAY DATA; VARIABLE SELECTION; RANDOM FOREST; SVM-RFE; PREDICTION; FILTER; TUMOR;
D O I
10.1016/j.jbi.2015.11.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Gene selection from high-dimensional microarray gene-expression data is statistically a challenging problem. Filter approaches to gene selection have been popular because of their simplicity, efficiency, and accuracy. Due to small sample size, all samples are generally used to compute relevant ranking statistics and selection of samples in filter-based gene selection methods has not been addressed. In this paper, we extend previously-proposed simultaneous sample and gene selection approach. In a backward elimination method, a modified logistic regression loss function is used to select relevant samples at each iteration, and these samples are used to compute the T-score to rank genes. This method provides a compromise solution between T-score and other support vector machine (SVM) based algorithms. The performance is demonstrated on both simulated and real datasets with criteria such as classification performance, stability and redundancy. Results indicate that computational complexity and stability of the method are improved compared to SVM based methods without compromising the classification performance. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:31 / 41
页数:11
相关论文
共 50 条
  • [1] Bayesian robust inference of sample selection using selection-t models
    Ding, Peng
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 124 : 451 - 464
  • [2] SAMPLE SELECTION
    ORAV, J
    [J]. INTERNATIONAL JOURNAL FOR QUALITY IN HEALTH CARE, 1995, 7 (03) : 285 - 287
  • [3] Sample selection for microarray gene expression studies
    Repsilber, D
    Fink, L
    Jacobsen, M
    Bläsing, O
    Ziegler, A
    [J]. METHODS OF INFORMATION IN MEDICINE, 2005, 44 (03) : 461 - 467
  • [4] Sample Classification Based on Gene Subset Selection
    Das, Sunanda
    Das, Asit Kumar
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 : 227 - 236
  • [5] Gene selection for sample classifications in microarray experiments
    Tsai, CA
    Chen, CH
    Lee, TC
    Ho, IC
    Yang, UC
    Chen, JJ
    [J]. DNA AND CELL BIOLOGY, 2004, 23 (10) : 607 - 614
  • [6] Unsupervised gene selection using biological knowledge : application in sample clustering
    Acharya, Sudipta
    Saha, Sriparna
    Nikhil, N.
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [7] Unsupervised gene selection using biological knowledge : application in sample clustering
    Sudipta Acharya
    Sriparna Saha
    N. Nikhil
    [J]. BMC Bioinformatics, 18
  • [8] OCtS: an alternative of the t-Score method sensitive to outliers and correlation in feature selection
    Demirarslan, Mert
    Suner, Asli
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (03) : 1409 - 1422
  • [9] SAMPLE SELECTION IN SIMULATION
    YE, MH
    WILLIAMS, GB
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1991, 19 (05): : 490 - 493
  • [10] Selection of a Representative Sample
    Herbert K. H. Lee
    Matthew Taddy
    Genetha A. Gray
    [J]. Journal of Classification, 2010, 27 : 41 - 53