Robust Bayesian Kernel Machine via Stein Variational Gradient Descent for Big Data

被引:5
|
作者
Khanh Nguyen [1 ]
Trung Le [2 ]
Tu Dinh Nguyen [2 ]
Dinh Phung [2 ]
Webb, Geoffrey I. [2 ]
机构
[1] Deakin Univ, Geelong, Vic, Australia
[2] Monash Univ, Clayton, Vic, Australia
基金
澳大利亚研究理事会;
关键词
Kernel methods; Stein divergence; random feature; multiclass supervised learning; Bayesian inference; variational method; big data;
D O I
10.1145/3219819.3220015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
for their strong generalization ability, especially on limited data to effectively generalize on unseen data. However, most kernel methods, including the state-of-the-art LIBSVM, are vulnerable to the curse of kernelization, making them infeasible to apply to large-scale datasets. This issue is exacerbated when kernel methods are used in conjunction with a grid search to tune their kernel parameters and hyperparameters which brings in the question of model robustness when applied to real datasets. In this paper, we propose a robust Bayesian Kernel Machine (BKM) - a Bayesian kernel machine that exploits the strengths of both the Bayesian modelling and kernel methods. A key challenge for such a formulation is the need for an efficient learning algorithm. To this end, we successfully extended the recent Stein variational theory for Bayesian inference for our proposed model, resulting in fast and efficient learning and prediction algorithms. Importantly our proposed BKM is resilient to the curse of kernelization, hence making it applicable to large-scale datasets and robust to parameter tuning, avoiding the associated expense and potential pitfalls with current practice of parameter tuning. Our extensive experimental results on 12 benchmark datasets show that our BKM without tuning any parameter can achieve comparable predictive performance with the state-of-the-art LIBSVM and significantly outperforms other baselines, while obtaining significantly speedup in terms of the total training time compared with its rivals.
引用
收藏
页码:2003 / 2011
页数:9
相关论文
共 50 条
  • [1] Riemannian Stein Variational Gradient Descent for Bayesian Inference
    Liu, Chang
    Zhu, Jun
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3627 - 3634
  • [2] Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent
    Kassab, Rahif
    Simeone, Osvaldo
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 2180 - 2192
  • [3] Gradient-free Stein variational gradient descent with kernel approximation
    Yan, Liang
    Zou, Xiling
    [J]. APPLIED MATHEMATICS LETTERS, 2021, 121 (121)
  • [4] p-Kernel Stein Variational Gradient Descent for Data Assimilation and History Matching
    Andreas S. Stordal
    Rafael J. Moraes
    Patrick N. Raanes
    Geir Evensen
    [J]. Mathematical Geosciences, 2021, 53 : 375 - 393
  • [5] p-Kernel Stein Variational Gradient Descent for Data Assimilation and History Matching
    Stordal, Andreas S.
    Moraes, Rafael J.
    Raanes, Patrick N.
    Evensen, Geir
    [J]. MATHEMATICAL GEOSCIENCES, 2021, 53 (03) : 375 - 393
  • [6] Quantile Stein Variational Gradient Descent for Batch Bayesian Optimization
    Gong, Chengyue
    Peng, Jian
    Liu, Qiang
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] VAE Learning via Stein Variational Gradient Descent
    Pu, Yunchen
    Gan, Zhe
    Henao, Ricardo
    Li, Chunyuan
    Han, Shaobo
    Carin, Lawrence
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [8] Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm
    Liu, Qiang
    Wang, Dilin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [9] Multilevel Stein variational gradient descent with applications to Bayesian inverse problems
    Alsup, Terrence
    Venturi, Luca
    Peherstorfer, Benjamin
    [J]. MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 145, 2021, 145 : 93 - +
  • [10] A Modified Stein Variational Inference Algorithm with Bayesian and Gradient Descent Techniques
    Zhang, Limin
    Dong, Jing
    Zhang, Junfang
    Yang, Junzi
    [J]. SYMMETRY-BASEL, 2022, 14 (06):