Non-smooth Bayesian learning for artificial neural networks

被引:2
|
作者
Fakhfakh M. [1 ,2 ]
Chaari L. [2 ]
Bouaziz B. [1 ]
Gargouri F. [1 ]
机构
[1] MIRACL laboratory, University of Sfax, Sfax
[2] University of Toulouse, INP, IRIT, Toulouse
关键词
Artificial neural networks; Hamiltonian dynamics; Machine learning; Optimization;
D O I
10.1007/s12652-022-04073-8
中图分类号
学科分类号
摘要
Artificial neural networks (ANNs) are being widely used in supervised machine learning to analyze signals or images for many applications. Using an annotated learning database, one of the main challenges is to optimize the network weights. A lot of work on solving optimization problems or improving optimization methods in machine learning has been proposed successively such as gradient-based method, Newton-type method, meta-heuristic method. For the sake of efficiency, regularization is generally used. When non-smooth regularizers are used especially to promote sparse networks, such as the ℓ1 norm, this optimization becomes challenging due to non-differentiability issues of the target criterion. In this paper, we propose an MCMC-based optimization scheme formulated in a Bayesian framework. The proposed scheme solves the above-mentioned sparse optimization problem using an efficient sampling scheme and Hamiltonian dynamics. The designed optimizer is conducted on four (4) datasets, and the results are verified by a comparative study with two CNNs. Promising results show the usefulness of the proposed method to allow ANNs, even with low complexity levels, reaching high accuracy rates of up to 94 %. The proposed method is also faster and more robust concerning overfitting issues. More importantly, the training step of the proposed method is much faster than all competing algorithms. © 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
引用
收藏
页码:13813 / 13831
页数:18
相关论文
共 50 条
  • [31] On a class of non-smooth dynamical systems: a sufficient condition for smooth versus non-smooth solutions
    M. -F. Danca
    Regular and Chaotic Dynamics, 2007, 12 : 1 - 11
  • [32] Recurrent Neural Network for Non-Smooth Convex Optimization Problems With Application to the Identification of Genetic Regulatory Networks
    Cheng, Long
    Hou, Zeng-Guang
    Lin, Yingzi
    Tan, Min
    Zhang, Wenjun Chris
    Wu, Fang-Xiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (05): : 714 - 726
  • [33] Smooth minimization of non-smooth functions
    Yu. Nesterov
    Mathematical Programming, 2005, 103 : 127 - 152
  • [34] Bayesian learning for recurrent neural networks
    Crucianu, M
    Boné, R
    de Beauville, JPA
    NEUROCOMPUTING, 2001, 36 (01) : 235 - 242
  • [35] Hybrid optimization and Bayesian inference techniques for a non-smooth radiation detection problem
    Stefanescu, Razvan
    Schmidt, Kathleen
    Hite, Jason
    Smith, Ralph C.
    Mattingly, John
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2017, 111 (10) : 955 - 982
  • [36] Robust Sparse Rank Learning for Non-Smooth Ranking Measures
    Sun, Zhengya
    Qin, Tao
    Tao, Qing
    Wang, Jue
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 259 - 266
  • [37] Non-Smooth Regularization: Improvement to Learning Framework Through Extrapolation
    Amini, Sajjad
    Soltanian, Mohammad
    Sadeghi, Mostafa
    Ghaemmaghami, Shahrokh
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 1213 - 1223
  • [38] Non-smooth Characteristic on Biological Surface and Development of Bionics Non-smooth Diamond Bit
    Zhong, Chongmei
    ADVANCES IN MANUFACTURING SCIENCE AND ENGINEERING, PTS 1-4, 2013, 712-715 : 360 - 365
  • [39] Deep Learning Neural Networks and Bayesian Neural Networks in Data Analysis
    Chernoded, Andrey
    Dudko, Lev
    Myagkov, Igor
    Volkov, Petr
    XXIII INTERNATIONAL WORKSHOP HIGH ENERGY PHYSICS AND QUANTUM FIELD THEORY (QFTHEP 2017), 2017, 158
  • [40] Effective Proximal Methods for Non-convex Non-smooth Regularized Learning
    Liang, Guannan
    Tong, Qianqian
    Ding, Jiahao
    Pan, Miao
    Bi, Jinbo
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 342 - 351