Stationary Activations for Uncertainty Calibration in Deep Learning

被引:0
|
作者
Meronen, Lassi [1 ,2 ]
Irwanto, Christabella [1 ]
Solin, Arno [1 ]
机构
[1] Aalto Univ, Espoo, Finland
[2] Saab Finland Oy, Espoo, Finland
基金
芬兰科学院;
关键词
NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new family of non-linear neural network activation functions that mimic the properties induced by the widely-used Matern family of kernels in Gaussian process (GP) models. This class spans a range of locally stationary models of various degrees of mean-square differentiability. We show an explicit link to the corresponding GP models in the case that the network consists of one infinitely wide hidden layer. In the limit of infinite smoothness the Matern family results in the RBF kernel, and in this case we recover RBF activations. Matern activation functions result in similar appealing properties to their counterparts in GP models, and we demonstrate that the local stationarity property together with limited mean-square differentiability shows both good performance and uncertainty calibration in Bayesian deep learning tasks. In particular, local stationarity helps calibrate out-of-distribution (OOD) uncertainty. We demonstrate these properties on classification and regression benchmarks and a radar emitter classification task.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Deep Learning Uncertainty in Machine Teaching
    Sanchez, Teo
    Caramiaux, Baptiste
    Thiel, Pierre
    Mackay, Wendy E.
    [J]. IUI'22: 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2022, : 173 - 190
  • [22] Uncertainty Quantification for Sparse Deep Learning
    Wang, Yuexi
    Rockova, Veronika
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [23] Data Uncertainty Learning for Single Image Camera Calibration
    Hu, Zhiqiang
    Mikuni, Yoshitaka
    Arata, Koji
    [J]. 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, : 2140 - 2147
  • [24] Evaluation of maxout activations in deep learning across several big data domains
    Castaneda, Gabriel
    Morris, Paul
    Khoshgoftaar, Taghi M.
    [J]. JOURNAL OF BIG DATA, 2019, 6 (01)
  • [25] Scalable deep learning for watershed model calibration
    Mudunuru, Maruti K. K.
    Son, Kyongho
    Jiang, Peishi
    Hammond, Glenn
    Chen, Xingyuan
    [J]. FRONTIERS IN EARTH SCIENCE, 2022, 10
  • [26] Deep learning based timing calibration for PET
    Chen, Huai
    Liu, Huafeng
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3185 - 3188
  • [27] Visualizing abnormalities in chest radiographs through salient network activations in Deep Learning
    Sivaramakrishnan, R.
    Antani, S.
    Xue, Z.
    Candemir, S.
    Jaeger, S.
    Thoma, G. R.
    [J]. 2017 IEEE LIFE SCIENCES CONFERENCE (LSC), 2017, : 71 - 74
  • [28] Evaluation of maxout activations in deep learning across several big data domains
    Gabriel Castaneda
    Paul Morris
    Taghi M. Khoshgoftaar
    [J]. Journal of Big Data, 6
  • [29] Evidential Deep Learning to Quantify Classification Uncertainty
    Sensoy, Murat
    Kaplan, Lance
    Kandemir, Melih
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [30] A Simple Baseline for Bayesian Uncertainty in Deep Learning
    Maddox, Wesley J.
    Garipov, Timur
    Izmailov, Pavel
    Vetrov, Dmitry
    Wilson, Andrew Gordon
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32