Stationary Activations for Uncertainty Calibration in Deep Learning

被引:0
|
作者
Meronen, Lassi [1 ,2 ]
Irwanto, Christabella [1 ]
Solin, Arno [1 ]
机构
[1] Aalto Univ, Espoo, Finland
[2] Saab Finland Oy, Espoo, Finland
基金
芬兰科学院;
关键词
NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new family of non-linear neural network activation functions that mimic the properties induced by the widely-used Matern family of kernels in Gaussian process (GP) models. This class spans a range of locally stationary models of various degrees of mean-square differentiability. We show an explicit link to the corresponding GP models in the case that the network consists of one infinitely wide hidden layer. In the limit of infinite smoothness the Matern family results in the RBF kernel, and in this case we recover RBF activations. Matern activation functions result in similar appealing properties to their counterparts in GP models, and we demonstrate that the local stationarity property together with limited mean-square differentiability shows both good performance and uncertainty calibration in Bayesian deep learning tasks. In particular, local stationarity helps calibrate out-of-distribution (OOD) uncertainty. We demonstrate these properties on classification and regression benchmarks and a radar emitter classification task.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Role of calibration in uncertainty-based referral for deep learning
    Zhang, Ruotao
    Gatsonis, Constantine
    Steingrimsson, Jon Arni
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2023, 32 (05) : 927 - 943
  • [2] A Bayesian Deep Learning Framework for RUL Prediction Incorporating Uncertainty Quantification and Calibration
    Lin, Yan-Hui
    Li, Gang-Hui
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 7274 - 7284
  • [3] Uncertainty Calibration for Deep Audio Classifiers
    Ye, Tong
    Si, Shijing
    Wang, Jianzong
    Cheng, Ning
    Xiao, Jing
    [J]. INTERSPEECH 2022, 2022, : 1556 - 1560
  • [4] Deep Learning with Maxout Activations for Visual Recognition and Verification
    Castaneda, Gabriel
    Morris, Paul
    Khoshgoftaar, Taghi M.
    [J]. 2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 135 - 142
  • [5] TopoAct: Visually Exploring the Shape of Activations in Deep Learning
    Rathore, Archit
    Chalapathi, Nithin
    Palande, Sourabh
    Wang, Bei
    [J]. COMPUTER GRAPHICS FORUM, 2021, 40 (01) : 382 - 397
  • [6] Tempered Sigmoid Activations for Deep Learning with Differential Privacy
    Papernot, Nicolas
    Thakurta, Abhradeep
    Song, Shuang
    Chien, Steve
    Erlingsson, Ulfar
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9312 - 9321
  • [7] Novel Ransomware Detection Exploiting Uncertainty and Calibration Quality Measures Using Deep Learning
    Gazzan, Mazen
    Sheldon, Frederick T.
    [J]. INFORMATION, 2024, 15 (05)
  • [8] Uncertainty Quantification in Deep Learning
    Kong, Lingkai
    Kamarthi, Harshavardhan
    Chen, Peng
    Prakash, B. Aditya
    Zhang, Chao
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5809 - 5810
  • [9] Deep learning uncertainty and confidence calibration for the five-class polyp classification from colonoscopy
    Carneiro, Gustavo
    Pu, Leonardo Zorron Cheng Tao
    Singh, Rajvinder
    Burt, Alastair
    [J]. MEDICAL IMAGE ANALYSIS, 2020, 62
  • [10] Uncertainty aware training to improve deep learning model calibration for classification of cardiac MR images
    Dawood, Tareen
    Chen, Chen
    Sidhu, Baldeep S.
    Ruijsink, Bram
    Gould, Justin
    Porter, Bradley
    Elliott, Mark K.
    Mehta, Vishal
    Rinaldi, Christopher A.
    Puyol-Anton, Esther
    Razavi, Reza
    King, Andrew P.
    [J]. MEDICAL IMAGE ANALYSIS, 2023, 88