A simple approach for local and global variable importance in nonlinear regression models

被引:0
|
作者
Winn-Nunez, Emily T. [1 ]
Griffin, Maryclare [2 ]
Crawford, Lorin [3 ,4 ,5 ]
机构
[1] Brown Univ, Div Appl Math, Providence, RI 02912 USA
[2] Univ Massachusetts Amherst, Dept Math & Stat, Amherst, MA USA
[3] Microsoft Res New England, Cambridge, MA 02142 USA
[4] Brown Univ, Dept Biostat, Providence, RI 02912 USA
[5] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
基金
美国国家科学基金会; 英国惠康基金;
关键词
Interpretability; Gaussian processes; Machine learning; Variable selection; GENERALIZED LINEAR-MODELS; QUANTITATIVE TRAIT LOCI; GENETIC ASSOCIATION; MIXED MODELS; SELECTION; STRAINS; CROSS;
D O I
10.1016/j.csda.2023.107914
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ability to interpret machine learning models has become increasingly important as their usage in data science continues to rise. Most current interpretability methods are optimized to work on either (i) a global scale, where the goal is to rank features based on their contributions to overall variation in an observed population, or (ii) the local level, which aims to detail on how important a feature is to a particular individual in the data set. In this work, a new operator is proposed called the "GlObal And Local Score" (GOALS): a simple post hoc approach to simultaneously assess local and global feature variable importance in nonlinear models. Motivated by problems in biomedicine, the approach is demonstrated using Gaussian process regression where the task of understanding how genetic markers are associated with disease progression both within individuals and across populations is of high interest. Detailed simulations and real data analyses illustrate the flexible and efficient utility of GOALS over state-of-the-art variable importance strategies.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Variable importance in regression models
    Groemping, Ulrike
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2015, 7 (02) : 137 - 152
  • [2] Variable importance in latent variable regression models
    Kvalheim, Olav M.
    Arneberg, Reidar
    Bleie, Olav
    Rajalahti, Tarja
    Smilde, Age K.
    Westerhuis, Johan A.
    [J]. JOURNAL OF CHEMOMETRICS, 2014, 28 (08) : 615 - 622
  • [3] A simple variable selection technique for nonlinear models
    Rech, G
    Teräsvirta, T
    Tschernig, R
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2001, 30 (06) : 1227 - 1241
  • [4] A simple estimator for nonlinear error in variable models
    Hong, H
    Tamer, E
    [J]. JOURNAL OF ECONOMETRICS, 2003, 117 (01) : 1 - 19
  • [5] A simple method to visualize results in nonlinear regression models
    Henderson, Daniel J.
    Kumbhakar, Subal C.
    Parmeter, Christopher F.
    [J]. ECONOMICS LETTERS, 2012, 117 (03) : 578 - 581
  • [6] Simple nonlinear models suggest variable star universality
    Lindner, John F.
    Kohar, Vivek
    Kia, Behnam
    Hippke, Michael
    Learned, John G.
    Ditto, William L.
    [J]. PHYSICA D-NONLINEAR PHENOMENA, 2016, 316 : 16 - 22
  • [7] NONLINEAR GLOBAL AND LOCAL DOCUMENT DEGRADATION MODELS
    KANUNGO, T
    HARALICK, RM
    PHILLIPS, I
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 1994, 5 (03) : 220 - 230
  • [8] A Simple Adaptation of Variable Selection Software for Regression Models to Select Variables in Nested Error Regression Models
    Yan Li
    Partha Lahiri
    [J]. Sankhya B, 2019, 81 : 302 - 317
  • [9] A Simple Adaptation of Variable Selection Software for Regression Models to Select Variables in Nested Error Regression Models
    Li, Yan
    Lahiri, Partha
    [J]. SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2019, 81 (02): : 302 - 317
  • [10] Importance measures in global sensitivity analysis of nonlinear models
    Homma, T
    Saltelli, A
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 1996, 52 (01) : 1 - 17