Gaussian Process Neural Additive Models

被引:0
|
作者
Zhang, Wei [1 ]
Barr, Brian [2 ]
Paisley, John [1 ]
机构
[1] Columbia Univ, New York, NY 10025 USA
[2] Capital One, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks have revolutionized many fields, but their black-box nature also occasionally prevents their wider adoption in fields such as healthcare and finance, where interpretable and explainable models are required. The recent development of Neural Additive Models (NAMs) is a significant step in the direction of interpretable deep learning for tabular datasets. In this paper, we propose a new subclass of NAMs that use a single-layer neural network construction of the Gaussian process via random Fourier features, which we call Gaussian Process Neural Additive Models (GP-NAM). GP-NAMs have the advantage of a convex objective function and number of trainable parameters that grows linearly with feature dimensionality. It suffers no loss in performance compared to deeper NAM approaches because GPs are well-suited for learning complex non-parametric univariate functions. We demonstrate the performance of GP-NAM on several tabular datasets, showing that it achieves comparable or better performance in both classification and regression tasks with a large reduction in the number of parameters.
引用
下载
收藏
页码:16865 / 16872
页数:8
相关论文
共 50 条
  • [1] Gaussian Process Surrogate Models for Neural Networks
    Li, Michael Y.
    Grant, Erin
    Griffiths, Thomas L.
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1241 - 1252
  • [2] Additive Gaussian Process for Computer Models With Qualitative and Quantitative Factors
    Deng, X.
    Lin, C. Devon
    Liu, K. -W.
    Rowe, R. K.
    TECHNOMETRICS, 2017, 59 (03) : 283 - 292
  • [3] Sparse Additive Gaussian Process Regression
    Luo, Hengrui
    Nattino, Giovanni
    Pratola, Matthew T.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [4] Sparse Additive Gaussian Process Regression
    Luo, Hengrui
    Nattino, Giovanni
    Pratola, Matthew T.
    Journal of Machine Learning Research, 2022, 23
  • [5] Neural Network with Optimal Neuron Activation Functions Based on Additive Gaussian Process Regression
    Manzhos, Sergei
    Ihara, Manabu
    JOURNAL OF PHYSICAL CHEMISTRY A, 2023, 127 (37): : 7823 - 7835
  • [6] A case based comparison of identification with neural network and Gaussian process models
    Kocijan, J
    Banko, B
    Likar, B
    Girard, A
    Murray-Smith, R
    Rasmussen, CE
    INTELLIGENT CONTROL SYSTEMS AND SIGNAL PROCESSING 2003, 2003, : 129 - 134
  • [7] Partially Linear Additive Gaussian Graphical Models
    Geng, Sinong
    Yan, Minhao
    Kolar, Mladen
    Koyejo, Oluwasanmi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [8] GAUSSIAN PROCESS LSTM RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR SPEECH RECOGNITION
    Lam, Max W. Y.
    Chen, Xie
    Hu, Shoukang
    Yu, Jianwei
    Liu, Xunying
    Meng, Helen
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7235 - 7239
  • [9] Extrapolative Bayesian Optimization with Gaussian Process and Neural Network Ensemble Surrogate Models
    Lim, Yee-Fun
    Ng, Chee Koon
    Vaitesswar, U. S.
    Hippalgaonkar, Kedar
    ADVANCED INTELLIGENT SYSTEMS, 2021, 3 (11)
  • [10] Gaussian Process Morphable Models
    Luthi, Marcel
    Gerig, Thomas
    Jud, Christoph
    Vetter, Thomas
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) : 1860 - 1873