Activation functions in deep learning: A comprehensive survey and benchmark

被引:253
|
作者
Dubey, Shiv Ram [1 ,4 ]
Singh, Satish Kumar [1 ]
Chaudhuri, Bidyut Baran [2 ,3 ]
机构
[1] Indian Inst Informat Technol, Comp Vis & Biometr Lab, Allahabad, India
[2] Techno India Univ, Kolkata, India
[3] Indian Stat Inst, Kolkata, India
[4] Indian Inst Informat Technol Allahabad, Allahabad, India
关键词
Activation Functions; Neural networks; Convolutional neural networks; Deep learning; Overview; Recurrent Neural Networks; RECTIFIED LINEAR UNITS; NEURAL-NETWORKS;
D O I
10.1016/j.neucom.2022.06.111
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks have shown tremendous growth in recent years to solve numerous problems. Various types of neural networks have been introduced to deal with different types of problems. However, the main goal of any neural network is to transform the non-linearly separable input data into more linearly separable abstract features using a hierarchy of layers. These layers are combinations of linear and non-linear functions. The most popular and common non-linearity layers are activation functions (AFs), such as Logistic Sigmoid, Tanh, ReLU, ELU, Swish and Mish. In this paper, a comprehensive overview and sur-vey is presented for AFs in neural networks for deep learning. Different classes of AFs such as Logistic Sigmoid and Tanh based, ReLU based, ELU based, and Learning based are covered. Several characteristics of AFs such as output range, monotonicity, and smoothness are also pointed out. A performance compar-ison is also performed among 18 state-of-the-art AFs with different networks on different types of data. The insights of AFs are presented to benefit the researchers for doing further research and practitioners to select among different choices. The code used for experimental comparison is released at: https://github.-com/shivram1987/ActivationFunctions.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:92 / 108
页数:17
相关论文
共 50 条
  • [1] A comprehensive deep learning benchmark for IoT IDS
    Ahmad, Rasheed
    Alsmadi, Izzat
    Alhamdani, Wasim
    Tawalbeh, Lo'ai
    [J]. COMPUTERS & SECURITY, 2022, 114
  • [2] A Comprehensive Benchmark of Deep Learning Libraries on Mobile Devices
    Zhang, Qiyang
    Li, Xiang
    Che, Xiangying
    Ma, Xiao
    Zhou, Ao
    Xu, Mengwei
    Wang, Shangguang
    Ma, Yun
    Liu, Xuanzhe
    [J]. PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3298 - 3307
  • [3] A Comprehensive Survey on Geometric Deep Learning
    Cao, Wenming
    Yan, Zhiyue
    He, Zhiquan
    He, Zhihai
    [J]. IEEE ACCESS, 2020, 8 : 35929 - 35949
  • [4] The Deep Learning Compiler: A Comprehensive Survey
    Li, Mingzhen
    Liu, Yi
    Liu, Xiaoyan
    Sun, Qingxiao
    You, Xin
    Yang, Hailong
    Luan, Zhongzhi
    Gan, Lin
    Yang, Guangwen
    Qian, Depei
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (03) : 708 - 727
  • [5] A Comprehensive Deep Learning Library Benchmark and Optimal Library Selection
    Zhang, Qiyang
    Che, Xiangying
    Chen, Yijie
    Ma, Xiao
    Xu, Mengwei
    Dustdar, Schahram
    Liu, Xuanzhe
    Wang, Shangguang
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 5069 - 5082
  • [6] Diverse Activation Functions in Deep Learning
    Wang, Bin
    Li, Tianrui
    Huang, Yanyong
    Luo, Huaishao
    Guo, Dongming
    Horng, Shi-Jinn
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [7] A Comprehensive Survey of Loss Functions in Machine Learning
    Wang Q.
    Ma Y.
    Zhao K.
    Tian Y.
    [J]. Annals of Data Science, 2022, 9 (02) : 187 - 212
  • [8] A Comprehensive Survey of Deep Learning for Image Captioning
    Hossain, Md Zakir
    Sohel, Ferdous
    Shiratuddin, Mohd Fairuz
    Laga, Hamid
    [J]. ACM COMPUTING SURVEYS, 2019, 51 (06)
  • [9] Deep Learning for Visual Tracking: A Comprehensive Survey
    Marvasti-Zadeh, Seyed Mojtaba
    Cheng, Li
    Ghanei-Yakhdan, Hossein
    Kasaei, Shohreh
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 3943 - 3968
  • [10] A Comprehensive Survey on Deep Graph Representation Learning
    Ju, Wei
    Fang, Zheng
    Gu, Yiyang
    Liu, Zequn
    Long, Qingqing
    Qiao, Ziyue
    Qin, Yifang
    Shen, Jianhao
    Sun, Fang
    Xiao, Zhiping
    Yang, Junwei
    Yuan, Jingyang
    Zhao, Yusheng
    Wang, Yifan
    Luo, Xiao
    Zhang, Ming
    [J]. NEURAL NETWORKS, 2024, 173