Model complexity of deep learning: a survey

被引:146
|
作者
Hu, Xia [1 ]
Chu, Lingyang [2 ]
Pei, Jian [1 ]
Liu, Weiqing [3 ]
Bian, Jiang [3 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC, Canada
[2] McMaster Univ, Dept Comp & Software, Hamilton, ON, Canada
[3] Microsoft Res, Beijing, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
Deep learning; Deep neural network; Model complexity; Expressive capacity; DECISION TREE COMPLEXITY; NEURAL-NETWORKS; VC-DIMENSION; BOUNDS; SELECTION; ACCURACY; TIME;
D O I
10.1007/s10115-021-01605-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model complexity is a fundamental problem in deep learning. In this paper, we conduct a systematic overview of the latest studies on model complexity in deep learning. Model complexity of deep learning can be categorized into expressive capacity and effective model complexity. We review the existing studies on those two categories along four important factors, including model framework, model size, optimization process, and data complexity. We also discuss the applications of deep learning model complexity including understanding model generalization, model optimization, and model selection and design. We conclude by proposing several interesting future directions.
引用
收藏
页码:2585 / 2619
页数:35
相关论文
共 50 条
  • [1] Model complexity of deep learning: a survey
    Xia Hu
    Lingyang Chu
    Jian Pei
    Weiqing Liu
    Jiang Bian
    [J]. Knowledge and Information Systems, 2021, 63 : 2585 - 2619
  • [2] Deep Learning To Model The Complexity Of Algal Bloom
    Wu, Haoyu
    Lin, Zhibin
    Lin, Borong
    Li, Zhenhao
    Jin, Nanlin
    Zhu, Xiaohui
    [J]. 2022 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, CYBERC, 2022, : 114 - 122
  • [3] Deep Learning Model Selection of Suboptimal Complexity
    O. Yu. Bakhteev
    V. V. Strijov
    [J]. Automation and Remote Control, 2018, 79 : 1474 - 1488
  • [4] Deep Learning Model Selection of Suboptimal Complexity
    Bakhteev, O. Yu
    Strijov, V. V.
    [J]. AUTOMATION AND REMOTE CONTROL, 2018, 79 (08) : 1474 - 1488
  • [5] Survey of Deep Learning Model Compression and Acceleration
    Gao, Han
    Tian, Yu-Long
    Xu, Feng-Yuan
    Zhong, Sheng
    [J]. Ruan Jian Xue Bao/Journal of Software, 2021, 32 (01): : 68 - 92
  • [6] Survey of the VR Environment for Deep Learning Model Development
    Naraha, Taisei
    Akimoto, Kouta
    Yairi, Ikuko Eguchi
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 1423 : 154 - 164
  • [7] A Survey of Security Protection Methods for Deep Learning Model
    Peng, Haipeng
    Bao, Shuang
    Li, Lixiang
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1533 - 1553
  • [8] The intrinsic complexity of learning: A survey
    Jain, S
    [J]. FUNDAMENTA INFORMATICAE, 2003, 57 (01) : 17 - 37
  • [9] Complexity of Representations in Deep Learning
    Ho, Tin Kam
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2657 - 2663
  • [10] Methods for deep learning model failure detection and model adaption: A survey
    Wu, Xiaoyu
    Hu, Zheng
    Pei, Ke
    Song, Liyan
    Cao, Zhi
    Zhang, Shuyi
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2021), 2021, : 218 - 223