In the context of supervised parametric models, we introduce the concept of e-values. An e-value is a scalar quantity that represents the proximity of the sampling distribution of parameter estimates in a model trained on a subset of features to that of the model trained on all features (i.e. the full model). Under general conditions, a rank ordering of e-values separates models that contain all essential features from those that do not. The e-values are applicable to a wide range of parametric models. We use data depths and a fast resampling-based algorithm to implement a feature selection procedure using e-values, providing consistency results. For a p-dimensional feature space, this procedure requires fitting only the full model and evaluating p + 1 models, as opposed to the traditional requirement of fitting and evaluating 2(p) models. Through experiments across several model settings and synthetic and real datasets, we establish that the e-values method as a promising general alternative to existing model-specific methods of feature selection.
机构:
Amer Coll Physicians, 190 N Independence Mall West, Philadelphia, PA 19106 USAUniv Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, 635 Blockley Hall,423 Guardian Dr, Philadelphia, PA 19104 USA
Stack, Catherine B.
Griswold, Michael E.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Mississippi, Med Ctr, Dept Data Sci, New Guyton Suite G651,2500 North State St, Jackson, MS 39216 USAUniv Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, 635 Blockley Hall,423 Guardian Dr, Philadelphia, PA 19104 USA
机构:
Stanford Univ, Stanford Prevent Res Ctr, Sch Med, Stanford, CA 94305 USA
Stanford Univ, Meta Res Innovat Ctr Stanford METRICS, Stanford, CA 94305 USAStanford Univ, Stanford Prevent Res Ctr, Sch Med, Stanford, CA 94305 USA
Ioannidis, John P. A.
Tan, Yuan Jin
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Meta Res Innovat Ctr Stanford METRICS, Stanford, CA 94305 USAStanford Univ, Stanford Prevent Res Ctr, Sch Med, Stanford, CA 94305 USA
Tan, Yuan Jin
Blum, Manuel R.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Meta Res Innovat Ctr Stanford METRICS, Stanford, CA 94305 USA
Univ Bern, Bern Univ Hosp, Bern, SwitzerlandStanford Univ, Stanford Prevent Res Ctr, Sch Med, Stanford, CA 94305 USA
机构:
Stanford Univ, Dept Stat, 390 Jane Stanford Way, Stanford, CA 94305 USAStanford Univ, Dept Stat, 390 Jane Stanford Way, Stanford, CA 94305 USA
Gablenz, Paula
Sabatti, Chiara
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Stat, 390 Jane Stanford Way, Stanford, CA 94305 USA
Stanford Univ, Dept Biomed Data Sci, Med Sch Off Bldg 1265 Welch Rd MC5464, Stanford, CA 94305 USAStanford Univ, Dept Stat, 390 Jane Stanford Way, Stanford, CA 94305 USA