Activation functions in deep learning: A comprehensive survey and benchmark

被引:253
|
作者
Dubey, Shiv Ram [1 ,4 ]
Singh, Satish Kumar [1 ]
Chaudhuri, Bidyut Baran [2 ,3 ]
机构
[1] Indian Inst Informat Technol, Comp Vis & Biometr Lab, Allahabad, India
[2] Techno India Univ, Kolkata, India
[3] Indian Stat Inst, Kolkata, India
[4] Indian Inst Informat Technol Allahabad, Allahabad, India
关键词
Activation Functions; Neural networks; Convolutional neural networks; Deep learning; Overview; Recurrent Neural Networks; RECTIFIED LINEAR UNITS; NEURAL-NETWORKS;
D O I
10.1016/j.neucom.2022.06.111
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks have shown tremendous growth in recent years to solve numerous problems. Various types of neural networks have been introduced to deal with different types of problems. However, the main goal of any neural network is to transform the non-linearly separable input data into more linearly separable abstract features using a hierarchy of layers. These layers are combinations of linear and non-linear functions. The most popular and common non-linearity layers are activation functions (AFs), such as Logistic Sigmoid, Tanh, ReLU, ELU, Swish and Mish. In this paper, a comprehensive overview and sur-vey is presented for AFs in neural networks for deep learning. Different classes of AFs such as Logistic Sigmoid and Tanh based, ReLU based, ELU based, and Learning based are covered. Several characteristics of AFs such as output range, monotonicity, and smoothness are also pointed out. A performance compar-ison is also performed among 18 state-of-the-art AFs with different networks on different types of data. The insights of AFs are presented to benefit the researchers for doing further research and practitioners to select among different choices. The code used for experimental comparison is released at: https://github.-com/shivram1987/ActivationFunctions.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:92 / 108
页数:17
相关论文
共 50 条
  • [31] A Comprehensive Survey on Event Analysis Using Deep Learning
    Varshney, Abhilasha
    Lamba, Sonia
    Garg, Puneet
    [J]. Proceedings - 2022 5th International Conference on Computational Intelligence and Communication Technologies, CCICT 2022, 2022, : 146 - 150
  • [32] Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
    Wang, Xiaofei
    Han, Yiwen
    Leung, Victor C. M.
    Niyato, Dusit
    Yan, Xueqiang
    Chen, Xu
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (02): : 869 - 904
  • [33] Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey
    Chen, Wuhui
    Qiu, Xiaoyu
    Cai, Ting
    Dai, Hong-Ning
    Zheng, Zibin
    Zhang, Yan
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03): : 1659 - 1692
  • [34] A comprehensive survey on optimizing deep learning models by metaheuristics
    Akay, Bahriye
    Karaboga, Dervis
    Akay, Rustu
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (02) : 829 - 894
  • [35] Deep learning for fake news detection: A comprehensive survey
    Hu, Linmei
    Wei, Siqi
    Zhao, Ziwang
    Wu, Bin
    [J]. AI Open, 2022, 3 : 133 - 155
  • [36] Comprehensive survey on image steganalysis using deep learning
    De La Croix, Ntivuguruzwa Jean
    Ahmad, Tohari
    Han, Fengling
    [J]. ARRAY, 2024, 22
  • [37] Deep Learning in Virtual Try-On: A Comprehensive Survey
    Islam, Tasin
    Miron, Alina
    Liu, Xiaohui
    Li, Yongmin
    [J]. IEEE ACCESS, 2024, 12 : 29475 - 29502
  • [38] Video description: A comprehensive survey of deep learning approaches
    Ghazala Rafiq
    Muhammad Rafiq
    Gyu Sang Choi
    [J]. Artificial Intelligence Review, 2023, 56 : 13293 - 13372
  • [39] A comprehensive survey on applications of transformers for deep learning tasks
    Islam, Saidul
    Elmekki, Hanae
    Elsebai, Ahmed
    Bentahar, Jamal
    Drawel, Nagat
    Rjoub, Gaith
    Pedrycz, Witold
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [40] Video restoration based on deep learning: a comprehensive survey
    Claudio Rota
    Marco Buzzelli
    Simone Bianco
    Raimondo Schettini
    [J]. Artificial Intelligence Review, 2023, 56 : 5317 - 5364