Feature-specific inference for penalized regression using local false discovery rates

被引:1
|
作者
Miller, Ryan [1 ]
Breheny, Patrick [2 ]
机构
[1] Grinnell Coll, Dept Math, Grinnell, IA 50112 USA
[2] Univ Iowa, Dept Biostat, Iowa City, IA USA
关键词
false discovery rates; high-dimensional data; high-dimensional models; lasso; penalized regression; CONFIDENCE-INTERVALS; P-VALUES; SELECTION;
D O I
10.1002/sim.9678
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Penalized regression methods such as the lasso are a popular approach to analyzing high-dimensional data. One attractive property of the lasso is that it naturally performs variable selection. An important area of concern, however, is the reliability of these selections. Motivated by local false discovery rate methodology from the large-scale hypothesis testing literature, we propose a method for calculating a local false discovery rate for each variable under consideration by the lasso model. These rates can be used to assess the reliability of an individual feature, or to estimate the model's overall false discovery rate. The method can be used for any level of regularization. This is particularly useful for models with a few highly significant features but a high overall false discovery rate, a relatively common occurrence when using cross validation to select a model. It is also flexible enough to be applied to many varieties of penalized likelihoods including generalized linear models and Cox regression, and a variety of penalties, including the minimax concave penalty (MCP) and smoothly clipped absolute deviation (SCAD) penalty. We demonstrate the validity of this approach and contrast it with other inferential methods for penalized regression as well as with local false discovery rates for univariate hypothesis tests. Finally, we show the practical utility of our method by applying it to a case study involving gene expression in breast cancer patients.
引用
收藏
页码:1412 / 1429
页数:18
相关论文
共 50 条
  • [21] Feature-specific biometric sensing using ceiling view based pyroelectric infrared sensors
    Liu, Tong
    Liu, Jun
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [22] Feature-specific biometric sensing using ceiling view based pyroelectric infrared sensors
    Tong Liu
    Jun Liu
    EURASIP Journal on Advances in Signal Processing, 2012
  • [23] A Feature-Specific Local Cooling System to Control Tensile Strength and Dimensional Accuracy in Fused Filament Fabrication
    Mueller, Kilian Maria Arthur
    Pammer, Sebastian Tobias
    Leonhardt, Stefan
    Mela, Petra
    3D PRINTING AND ADDITIVE MANUFACTURING, 2023, 10 (01) : 50 - 59
  • [24] Using false discovery rates for multiple comparisons in ecology and evolution
    Pike, Nathan
    METHODS IN ECOLOGY AND EVOLUTION, 2011, 2 (03): : 278 - 282
  • [25] Estimating local false discovery rates to identify the differentially expressed genes in microarrays
    Qi, Y. (qys@ujs.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (08):
  • [26] Examining the merits of feature-specific similarity functions in the news domain using human judgments
    Starke, Alain D.
    Solberg, Vegard R.
    Overhaug, Sebastian
    Trattner, Christoph
    USER MODELING AND USER-ADAPTED INTERACTION, 2024, 34 (04) : 995 - 1042
  • [27] Forecasting exchange rates using local regression
    Alvarez-Diaz, Marcos
    Alvarez, Alberto
    APPLIED ECONOMICS LETTERS, 2010, 17 (05) : 509 - 514
  • [28] Feature-specific terrain park-injury rates and risk factors in snowboarders: a case-control study
    Russell, Kelly
    Meeuwisse, Willem H.
    Nettel-Aguirre, Alberto
    Emery, Carolyn A.
    Wishart, Jillian
    Romanow, Nicole T. R.
    Rowe, Brian H.
    Goulet, Claude
    Hagel, Brent E.
    BRITISH JOURNAL OF SPORTS MEDICINE, 2014, 48 (01) : 23 - +
  • [29] A Sparse Regression Method for Group-Wise Feature Selection with False Discovery Rate Control
    Gossmann, Alexej
    Cao, Shaolong
    Brzyski, Damian
    Zhao, Lan-Juan
    Deng, Hong-Wen
    Wang, Yu-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (04) : 1066 - 1078
  • [30] Garlic (Allium sativum) feature-specific nutrient dosage based on using machine learning models
    Hahn, Leandro
    Parent, Leon-Etienne
    Paviani, Angela Cristina
    Feltrim, Anderson Luiz
    Wamser, Anderson Fernando
    Rozane, Danilo Eduardo
    Ender, Marcos Matos
    Grando, Douglas Luiz
    Moura-Bueno, Jean Michel
    Brunetto, Gustavo
    PLOS ONE, 2022, 17 (05):