Revisiting spatial dropout for regularizing convolutional neural networks

被引:0
|
作者
Sanghun Lee
Chulhee Lee
机构
[1] Yonsei University,Department of Electrical and Electronic Engineering
来源
关键词
Network regularization; Convolutional neural network; Spatial dropout; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Overfitting is one of the most challenging problems in deep neural networks with a large number of trainable parameters. To prevent networks from overfitting, the dropout method, which is a strong regularization technique, has been widely used in fully-connected neural networks. In several state-of-the-art convolutional neural network architectures for object classification, however, dropout was partially or not even applied since its accuracy gain was relatively insignificant in most cases. Also, the batch normalization technique reduced the need for the dropout method because of its regularization effect. In this paper, we show that conventional element-wise dropout can be ineffective for convolutional layers. We found that dropout between channels in the CNNs can be functionally similar to dropout in the FCNNs, and spatial dropout can be an effective way to take advantage of the dropout technique for regularizing. To prove our points, we conducted several experiments using the CIFAR-10 and CIFAR-100 databases. For comparison, we only replaced the dropout layers with spatial dropout layers and kept all other hyperparameters and methods intact. DenseNet-BC with spatial dropout showed promising results (3.32% error rates with CIFAR-10, 3.0 M parameters) compared to other existing competitive methods.
引用
收藏
页码:34195 / 34207
页数:12
相关论文
共 50 条
  • [21] Normalization and dropout for stochastic computing-based deep convolutional neural networks
    Li, Ji
    Yuan, Zihao
    Li, Zhe
    Ren, Ao
    Ding, Caiwen
    Draper, Jeffrey
    Nazarian, Shahin
    Qiu, Qinru
    Yuan, Bo
    Wang, Yanzhi
    [J]. INTEGRATION-THE VLSI JOURNAL, 2019, 65 : 395 - 403
  • [22] DEEPCON: protein contact prediction using dilated convolutional neural networks with dropout
    Adhikari, Badri
    [J]. BIOINFORMATICS, 2020, 36 (02) : 470 - 477
  • [23] Hybridized sine cosine algorithm with convolutional neural networks dropout regularization application
    Bacanin, Nebojsa
    Zivkovic, Miodrag
    Al-Turjman, Fadi
    Venkatachalam, K.
    Trojovsky, Pavel
    Strumberger, Ivana
    Bezdan, Timea
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [24] Student dropout prediction in massive open online courses by convolutional neural networks
    Qiu, Lin
    Liu, Yanshen
    Hu, Quan
    Liu, Yi
    [J]. SOFT COMPUTING, 2019, 23 (20) : 10287 - 10301
  • [25] Student dropout prediction in massive open online courses by convolutional neural networks
    Lin Qiu
    Yanshen Liu
    Quan Hu
    Yi Liu
    [J]. Soft Computing, 2019, 23 : 10287 - 10301
  • [26] Hybridized sine cosine algorithm with convolutional neural networks dropout regularization application
    Nebojsa Bacanin
    Miodrag Zivkovic
    Fadi Al-Turjman
    K. Venkatachalam
    Pavel Trojovský
    Ivana Strumberger
    Timea Bezdan
    [J]. Scientific Reports, 12
  • [27] Revisiting Orthogonality Regularization: A Study for Convolutional Neural Networks in Image Classification
    Kim, Taehyeon
    Yun, Se-Young
    [J]. IEEE Access, 2022, 10 : 69741 - 69749
  • [28] Revisiting Orthogonality Regularization: A Study for Convolutional Neural Networks in Image Classification
    Kim, Taehyeon
    Yun, Se-Young
    [J]. IEEE ACCESS, 2022, 10 : 69741 - 69749
  • [29] Fault diagnosis of bearings based on deep separable convolutional neural network and spatial dropout
    Zhang, Jiqiang
    Kong, Xiangwei
    LI, Xueyi
    Hu, Zhiyong
    Cheng, Liu
    Yu, Mingzhu
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (10) : 301 - 312
  • [30] Spatial Decomposition and Aggregation for Attention in Convolutional Neural Networks
    Zhu, Meng
    Min, Weidong
    Xiang, Hongyue
    Zha, Cheng
    Huang, Zheng
    Li, Longfei
    Fu, Qiyan
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (01)