Non-smooth Bayesian learning for artificial neural networks

被引:2
|
作者
Fakhfakh M. [1 ,2 ]
Chaari L. [2 ]
Bouaziz B. [1 ]
Gargouri F. [1 ]
机构
[1] MIRACL laboratory, University of Sfax, Sfax
[2] University of Toulouse, INP, IRIT, Toulouse
关键词
Artificial neural networks; Hamiltonian dynamics; Machine learning; Optimization;
D O I
10.1007/s12652-022-04073-8
中图分类号
学科分类号
摘要
Artificial neural networks (ANNs) are being widely used in supervised machine learning to analyze signals or images for many applications. Using an annotated learning database, one of the main challenges is to optimize the network weights. A lot of work on solving optimization problems or improving optimization methods in machine learning has been proposed successively such as gradient-based method, Newton-type method, meta-heuristic method. For the sake of efficiency, regularization is generally used. When non-smooth regularizers are used especially to promote sparse networks, such as the ℓ1 norm, this optimization becomes challenging due to non-differentiability issues of the target criterion. In this paper, we propose an MCMC-based optimization scheme formulated in a Bayesian framework. The proposed scheme solves the above-mentioned sparse optimization problem using an efficient sampling scheme and Hamiltonian dynamics. The designed optimizer is conducted on four (4) datasets, and the results are verified by a comparative study with two CNNs. Promising results show the usefulness of the proposed method to allow ANNs, even with low complexity levels, reaching high accuracy rates of up to 94 %. The proposed method is also faster and more robust concerning overfitting issues. More importantly, the training step of the proposed method is much faster than all competing algorithms. © 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
引用
收藏
页码:13813 / 13831
页数:18
相关论文
共 50 条
  • [1] Non-smooth regularization in radial artificial neural networks
    Krutikov, V. N.
    Kazakovtsev, L. A.
    Kazakovtsev, V. L.
    IX INTERNATIONAL MULTIDISCIPLINARY SCIENTIFIC AND RESEARCH CONFERENCE MODERN ISSUES IN SCIENCE AND TECHNOLOGY / WORKSHOP ADVANCED TECHNOLOGIES IN AEROSPACE, MECHANICAL AND AUTOMATION ENGINEERING, 2018, 450
  • [2] Modeling Hysteresis Using Non-smooth Neural Networks
    Tan, Yonghong
    Dong, Ruili
    ADVANCES IN NEURAL NETWORKS - ISNN 2018, 2018, 10878 : 124 - 129
  • [3] Deep Neural Networks Learn Non-Smooth Functions Effectively
    Imaizumi, Masaaki
    Fukumizu, Kenji
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 869 - 878
  • [4] Online Bayesian Identification of Non-Smooth Systems
    Chatzis, Manolis N.
    Chatzi, Eleni N.
    X INTERNATIONAL CONFERENCE ON STRUCTURAL DYNAMICS (EURODYN 2017), 2017, 199 : 918 - 923
  • [5] Global exponential stability of neural networks with non-smooth and impact activations
    Akhmet, M. U.
    Yilmaz, E.
    NEURAL NETWORKS, 2012, 34 : 18 - 27
  • [6] RBF neural networks based robot non-smooth adaptive control
    Zhao Dongya
    Zhu Quanmin
    Li Shaoyuan
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 583 - 587
  • [7] NON-SMOOTH ENERGY DISSIPATING NETWORKS
    Droege, Hannah
    Mollenhoff, Thomas
    Moeller, Michael
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3281 - 3285
  • [8] Distributed Learning over Networks with Non-Smooth Regularizers and Feature Partitioning
    Gratton, Cristiano
    Venkategowda, Naveen K. D.
    Arablouei, Reza
    Werner, Stefan
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1840 - 1844
  • [9] Non-smooth Bayesian optimization in tuning scientific applications
    Luo, Hengrui
    Cho, Younghyun
    Demmel, James W.
    Kozachenko, Igor
    Li, Xiaoye S.
    Liu, Yang
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2024, 38 (06): : 633 - 657
  • [10] Efficient Bayesian Learning of Sparse Deep Artificial Neural Networks
    Fakhfakh, Mohamed
    Bouaziz, Bassem
    Chaari, Lotfi
    Gargouri, Faiez
    ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 78 - 88