Non-smooth Bayesian learning for artificial neural networks

被引：2

作者：

Fakhfakh M. ^{[1
,2
]}

Chaari L. ^{[2
]}

Bouaziz B. ^{[1
]}

Gargouri F. ^{[1
]}

机构：

[1] MIRACL laboratory, University of Sfax, Sfax

[2] University of Toulouse, INP, IRIT, Toulouse

来源：

Journal of Ambient Intelligence and Humanized Computing | 2023年 / 14卷 / 10期

关键词：

Artificial neural networks; Hamiltonian dynamics; Machine learning; Optimization;

D O I：

10.1007/s12652-022-04073-8

中图分类号：

学科分类号：

摘要：

Artificial neural networks (ANNs) are being widely used in supervised machine learning to analyze signals or images for many applications. Using an annotated learning database, one of the main challenges is to optimize the network weights. A lot of work on solving optimization problems or improving optimization methods in machine learning has been proposed successively such as gradient-based method, Newton-type method, meta-heuristic method. For the sake of efficiency, regularization is generally used. When non-smooth regularizers are used especially to promote sparse networks, such as the ℓ1 norm, this optimization becomes challenging due to non-differentiability issues of the target criterion. In this paper, we propose an MCMC-based optimization scheme formulated in a Bayesian framework. The proposed scheme solves the above-mentioned sparse optimization problem using an efficient sampling scheme and Hamiltonian dynamics. The designed optimizer is conducted on four (4) datasets, and the results are verified by a comparative study with two CNNs. Promising results show the usefulness of the proposed method to allow ANNs, even with low complexity levels, reaching high accuracy rates of up to 94 %. The proposed method is also faster and more robust concerning overfitting issues. More importantly, the training step of the proposed method is much faster than all competing algorithms. © 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

引用

页码：13813 / 13831

页数：18

共 50 条

[31] On a class of non-smooth dynamical systems: a sufficient condition for smooth versus non-smooth solutions
M. -F. Danca
Regular and Chaotic Dynamics, 2007, 12 : 1 - 11
[32] Recurrent Neural Network for Non-Smooth Convex Optimization Problems With Application to the Identification of Genetic Regulatory Networks
Cheng, Long
Hou, Zeng-Guang
Lin, Yingzi
Tan, Min
Zhang, Wenjun Chris
Wu, Fang-Xiang
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (05): : 714 - 726
[33] Smooth minimization of non-smooth functions
Yu. Nesterov
Mathematical Programming, 2005, 103 : 127 - 152
[34] Bayesian learning for recurrent neural networks
Crucianu, M
Boné, R
de Beauville, JPA
NEUROCOMPUTING, 2001, 36 (01) : 235 - 242
[35] Hybrid optimization and Bayesian inference techniques for a non-smooth radiation detection problem
Stefanescu, Razvan
Schmidt, Kathleen
Hite, Jason
Smith, Ralph C.
Mattingly, John
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2017, 111 (10) : 955 - 982
[36] Robust Sparse Rank Learning for Non-Smooth Ranking Measures
Sun, Zhengya
Qin, Tao
Tao, Qing
Wang, Jue
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 259 - 266
[37] Non-Smooth Regularization: Improvement to Learning Framework Through Extrapolation
Amini, Sajjad
Soltanian, Mohammad
Sadeghi, Mostafa
Ghaemmaghami, Shahrokh
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 1213 - 1223
[38] Non-smooth Characteristic on Biological Surface and Development of Bionics Non-smooth Diamond Bit
Zhong, Chongmei
ADVANCES IN MANUFACTURING SCIENCE AND ENGINEERING, PTS 1-4, 2013, 712-715 : 360 - 365
[39] Deep Learning Neural Networks and Bayesian Neural Networks in Data Analysis
Chernoded, Andrey
Dudko, Lev
Myagkov, Igor
Volkov, Petr
XXIII INTERNATIONAL WORKSHOP HIGH ENERGY PHYSICS AND QUANTUM FIELD THEORY (QFTHEP 2017), 2017, 158
[40] Effective Proximal Methods for Non-convex Non-smooth Regularized Learning
Liang, Guannan
Tong, Qianqian
Ding, Jiahao
Pan, Miao
Bi, Jinbo
20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 342 - 351

← 1 2 3 4 5 →