Gradient based invasive weed optimization algorithm for the training of deep neural network

被引:0
|
作者
Bai Liu
Liming Nie
机构
[1] Hubei University of Technology,School of Computer Science
[2] Zhejiang Sci-Tech University,undefined
来源
关键词
Stacked sparse auto-encoder; Limited memory BFGS; Meta-heuristic algorithm; Global exploration;
D O I
暂无
中图分类号
学科分类号
摘要
Stacked Sparse Auto-Encoder (SSAE) is well known hierarchical deep neural networks for simulating the deep architecture of mammal brain. SSAE can be trained in a greedy layer-wise manner by using methods based on gradient such as Limited memory BFGS (LBFGS). However, methods based on gradient have many disadvantages. The main disadvantage is that they are sensitive to the initial value. In this paper, a meta-heuristic algorithm based on gradient, referred to GCIWOSS, is used to optimize the weights and biases of SSAE. Chaos strategy is firstly used to initial the population of IWO and then a new selection strategy is adopted with the purpose of improving the diversity of population and increasing the global exploration ability. The improved IWO is preparing for the following exploitation based on gradient to avoid falling into local optimal values. In the experiments, the proposed algorithm is proven to be effective in extracting features from different image datasets, compared with the LBFGS and several other feature learning models.
引用
收藏
页码:22795 / 22819
页数:24
相关论文
共 50 条
  • [1] Gradient based invasive weed optimization algorithm for the training of deep neural network
    Liu, Bai
    Nie, Liming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22795 - 22819
  • [2] Optimization strategy of neural network based on hybrid invasive weed arithmetic
    Peng, Bin
    Hu, Changan
    Zhao, Rongzhen
    Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement and Diagnosis, 2013, 33 (04): : 634 - 639
  • [3] A Modified Invasive Weed Optimization Algorithm for Training of Feed-Forward Neural Networks
    Giri, Ritwik
    Chowdhury, Aritra
    Ghosh, Arnob
    Das, Swagatam
    Abraham, Ajith
    Snasel, Vaclav
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3166 - 3173
  • [4] A Combined Training Algorithm for RBF Neural Network Based on Particle Swarm Optimization and Gradient Descent
    Xu, Ming
    Chen, Hao
    Duan, Liwei
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 702 - 706
  • [5] Artificial neural networks training algorithm integrating invasive weed optimization with differential evolutionary model
    Movassagh, Ali Akbar
    Alzubi, Jafar A.
    Gheisari, Mehdi
    Rahimi, Mohamadtaghi
    Mohan, Senthilkumar
    Abbasi, Aaqif Afzaal
    Nabipour, Narjes
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (5) : 6017 - 6025
  • [6] Artificial neural networks training algorithm integrating invasive weed optimization with differential evolutionary model
    Ali Akbar Movassagh
    Jafar A. Alzubi
    Mehdi Gheisari
    Mohamadtaghi Rahimi
    Senthilkumar Mohan
    Aaqif Afzaal Abbasi
    Narjes Nabipour
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 6017 - 6025
  • [7] Algorithm for Tuning Fuzzy Network Attack Classifiers Based on Invasive Weed Optimization
    Anfilofiev, A. E.
    Hodashinsky, I. A.
    Evsutin, O. O.
    2014 DYNAMICS OF SYSTEMS, MECHANISMS AND MACHINES (DYNAMICS), 2014,
  • [8] Active Distribution Network Reconfiguration Based on Modified Invasive Weed Optimization Algorithm
    Shi J.
    Yuan D.
    Xue F.
    Ma L.
    Yang T.
    Shi, Jiying (eesjy@163.com), 2018, Tianjin University (51): : 786 - 796
  • [9] A neural network online training algorithm based on compound gradient vector
    Chen, ZP
    Li, J
    Yue, YJ
    Gao, QA
    Zhao, H
    Xu, ZL
    AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2002, 2557 : 374 - 384
  • [10] Automatic Clustering Based on Invasive Weed Optimization Algorithm
    Chowdhury, Aritra
    Bose, Sandip
    Das, Swagatam
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT II, 2011, 7077 : 105 - +