Empirical Bayes methods in variable selection

被引:0
|
作者
Bar, Haim [1 ]
Liu, Kangyan [1 ]
机构
[1] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
基金
美国国家科学基金会;
关键词
empirical Bayes; hierarchical models; mixture models; variable selection; G-PRIORS; MIXTURES;
D O I
10.1002/wics.1455
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The emergence of technologies that produce massive amounts of data such as gene sequencing and distributed sensor systems, created an urgent need to develop statistical methodologies and computational tools to deal with Big Data. Linear regression, which is arguably among the most commonly used inferential tools in statistics, was not designed with Big Data applications in mind. Modern applications in which the number of predictors is often in the thousands, and may even be in the millions, renders the traditional solution to regression analysis, useless. Thus, before any regression analysis can be done, it is necessary to begin with "variable selection". The goal of any good variable selection algorithm is to select only the variables which are useful in predicting the outcome. At the same time, such algorithms have to be computationally efficient. The empirical Bayes approach provides a sound statistical framework for developing variable selection methods. Empirical Bayes allows to "borrow strength" across predictors, thus increasing the power to detect significant ones, while controlling the false discovery rate. While the modeling approach is Bayesian, the crucial step in the empirical Bayes approach replaces the potentially cumbersome and slow integration via Monte Carlo simulations with a simple approximation to the posterior distribution of the regression coefficients. This article is categorized under: Statistical Models > Bayesian Models
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Calibration and empirical Bayes variable selection
    George, EI
    Foster, DP
    [J]. BIOMETRIKA, 2000, 87 (04) : 731 - 747
  • [2] Empirical Bayes vs. fully Bayes variable selection
    Cui, Wen
    George, Edward I.
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2008, 138 (04) : 888 - 900
  • [3] BAYES AND EMPIRICAL-BAYES MULTIPLICITY ADJUSTMENT IN THE VARIABLE-SELECTION PROBLEM
    Scott, James G.
    Berger, James O.
    [J]. ANNALS OF STATISTICS, 2010, 38 (05): : 2587 - 2619
  • [4] Efficient empirical Bayes variable selection and estimation in linear models
    Yuan, M
    Lin, Y
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (472) : 1215 - 1225
  • [5] A Scalable Empirical Bayes Approach to Variable Selection in Generalized Linear Models
    Bar, Haim Y.
    Booth, James G.
    Wells, Martin T.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (03) : 535 - 546
  • [6] BAYES AND EMPIRICAL BAYES METHODS FOR DATA ANALYSIS
    Bradley P. Carlin
    Thomas A. Louis
    [J]. Statistics and Computing, 1997, 7 (2) : 153 - 154
  • [7] ILLUSTRATING EMPIRICAL BAYES METHODS
    CASELLA, G
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1992, 16 (02) : 107 - 125
  • [8] EMPIRICAL BAYES RANKING METHODS
    LAIRD, NM
    LOUIS, TA
    [J]. JOURNAL OF EDUCATIONAL STATISTICS, 1989, 14 (01): : 29 - 46
  • [9] Integrating biological knowledge into variable selection: an empirical Bayes approach with an application in cancer biology
    Hill, Steven M.
    Neve, Richard M.
    Bayani, Nora
    Kuo, Wen-Lin
    Ziyad, Safiyyah
    Spellman, Paul T.
    Gray, Joe W.
    Mukherjee, Sach
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [10] Facilitating high-dimensional transparent classification via empirical Bayes variable selection
    Bar, Haim
    Booth, James
    Wells, Martin T.
    Liu, Kangyan
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2018, 34 (06) : 949 - 961