New filter-based feature selection criteria for identifying differentially expressed genes

被引:0
|
作者
Loo, LH [1 ]
Roberts, S [1 ]
Hrebien, L [1 ]
Kam, M [1 ]
机构
[1] Harvard Univ, Bauer Ctr Genom Res, Cambridge, MA 02138 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose two new filter-based feature selection criteria for identifying differentially expressed genes, namely the average difference score (ADS) and the mean difference score (MDS). These criteria replace the serial noise estimator used in existing criteria by a parallel noise estimator. The result is better detection of changes in the variance of expression levels, which t-statistic type criteria tend to under-emphasize. We compare the performance of the new criteria to that of several commonly used feature selection criteria, including the Welch t-statistic, the Fisher correlation score, the Wilcoxon rank sum, and the Independently Consistent Expression discriminator, on synthetic data and real biological data obtained from acute lymphoblastic leukemia and acute myeloid leukemia patients. We find that ADS and MDS outperform the other criteria by exhibiting higher sensitivity and comparable specificity. ADS is also able to flag several biologically important genes that are missed by the Welch t-statistic.
引用
收藏
页码:135 / 144
页数:10
相关论文
共 50 条
  • [41] Filter-Based Feature Selection Using Two Criterion Functions and Evolutionary Fuzzification
    Sornil, Ohm
    [J]. MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, (MIWAI 2016), 2016, 10053 : 173 - 183
  • [42] The impact of sample imbalance on identifying differentially expressed genes
    Kun Yang
    Jianzhong Li
    Hong Gao
    [J]. BMC Bioinformatics, 7
  • [43] The impact of sample imbalance on identifying differentially expressed genes
    Yang, Kun
    Li, Jianzhong
    Gao, Hong
    [J]. BMC BIOINFORMATICS, 2006, 7 (Suppl 4)
  • [44] Protocol Protocol for identifying differentially expressed genes the RumBall
    Nagai, Luis Augusto Eijy
    Lee, Seohyun
    Nakato, Ryuichiro
    [J]. STAR PROTOCOLS, 2024, 5 (01):
  • [45] Identifying differentially expressed genes in cDNA microarray experiments
    Baggerly, KA
    Coombes, KR
    Hess, KR
    Stivers, DN
    Abruzzo, LV
    Zhang, W
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) : 639 - 659
  • [46] Detection of differentially expressed genes using feature selection approach from RNA-seq
    Piao, Yongjun
    Ryu, Keun Ho
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2017, : 304 - 308
  • [47] A Novel Meta-heuristic Search Based on Mutual Information for Filter-Based Feature Selection
    Bui Quoc Trung
    Duong Viet Anh
    Bui Thi Mai Anh
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT I, 2023, 13995 : 395 - 407
  • [48] Particle swarm optimization based on filter-based population initialization method for feature selection in classification
    Xue Y.
    Cai X.
    Jia W.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (06) : 7355 - 7366
  • [49] Impact of Threshold Values for Filter-based Univariate Feature Selection in Heart Disease Classification
    Benhar, Houda
    Idri, Ali
    Hosni, Mohamed
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2020, : 391 - 398
  • [50] Surrogate-Assisted and Filter-Based Multiobjective Evolutionary Feature Selection for Deep Learning
    Espinosa, Raquel
    Jimenez, Fernando
    Palma, Jose
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9591 - 9605