Bayesian Error Analysis for Feature Selection in Biomarker Discovery

被引:1
|
作者
Pour, Ali Foroughi [1 ]
Dalton, Lori A. [1 ]
机构
[1] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
来源
IEEE ACCESS | 2019年 / 7卷
基金
美国国家科学基金会;
关键词
Biomarker discovery; feature selection; error analysis; validation; Bayesian methods; bioinformatics; VARIABLE-SELECTION; MODEL ASSESSMENT; BREAST-CANCER; VALIDATION; EXPRESSION; PROPORTION;
D O I
10.1109/ACCESS.2019.2932622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a novel Bayesian validation paradigm with several validation metrics tailored to biomarker discovery, including moments (the mean and variance) of the number of false discoveries, the number of missed discoveries, and the false discovery rate. All of these validation metrics can be used with a variety of Bayesian variable selection methods already available in the literature. When used in conjunction with Bayesian models with independent Gaussian features, we call these validation metrics optimal Bayesian feature filtering moments (OBFMs). We find closed-form expressions for OBFMs and show that they are asymptotically Gaussian and consistent even when the modeling assumptions are violated. In both synthetic simulations and real data analysis, OBFMs perform very well in biomarker discovery relative to other methods from the literature.
引用
收藏
页码:127544 / 127563
页数:20
相关论文
共 50 条
  • [41] BAYESIAN FEATURE SELECTION WITH DATA INTEGRATION
    Pour, Ali Foroughi
    Dalton, Lori A.
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 504 - 508
  • [42] The impact of Bayesian optimization on feature selection
    Kaixin Yang
    Long Liu
    Yalu Wen
    Scientific Reports, 14
  • [43] Sparse Bayesian Approach for Feature Selection
    Li, Chang
    Chen, Huanhuan
    2014 IEEE Symposium on Computational Intelligence in Big Data (CIBD), 2014, : 7 - 13
  • [44] A Comparative Analysis of Feature Selection Methods for Biomarker Discovery in Study of Toxicant-Treated Atlantic Cod (Gadus Morhua) Liver
    Zhang, Xiaokang
    Jonassen, Inge
    NORDIC ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2019, 1056 : 114 - 123
  • [45] Unsupervised Robust Bayesian Feature Selection
    Sun, Jianyong
    Zhou, Aimin
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 558 - 564
  • [46] Bayesian Feature Selection for Clustering Problems
    Hruschka, Eduardo
    Hruschka, Estevam, Jr.
    Covoes, Thiago
    Ebecken, Nelson
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2006, 5 (04) : 315 - 327
  • [47] BOLSTERED ERROR ESTIMATOR WITH FEATURE SELECTION
    Sima, Chao
    Vu, Thang
    Braga-Neto, Ulisses M.
    Dougherty, Edward R.
    2009 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2009), 2009, : 17 - +
  • [48] Impact of error estimation on feature selection
    Sima, C
    Attoor, S
    Brag-Neto, U
    Lowey, J
    Suh, E
    Dougherty, ER
    PATTERN RECOGNITION, 2005, 38 (12) : 2472 - 2482
  • [49] On feature selection, curse-of-dimensionality and error probability in discriminant analysis
    Pavlenko, T
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2003, 115 (02) : 565 - 584
  • [50] Using Regression Error Analysis and Feature Selection to Automatic Cluster Labeling
    Soares Silva, Lucia Emilia
    Machado, Vinicius Ponte
    Araujo, Sidiney Souza
    Alves de Lima, Bruno Vicente
    Souza Veras, Rodrigo de Melo
    PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2021), 2021, 12981 : 376 - 388