Residual Networks of Residual Networks: Multilevel Residual Networks

被引:205
|
作者
Zhang, Ke [1 ]
Sun, Miao [2 ]
Han, Tony X. [2 ]
Yuan, Xingfang [2 ]
Guo, Liru [1 ]
Liu, Tao [1 ]
机构
[1] North China Elect Power Univ, Dept Elect & Commun Engn, Baoding 071000, Hebei, Peoples R China
[2] Univ Missouri, Dept Elect & Comp Engn, Columiba, MO 65211 USA
基金
中国国家自然科学基金;
关键词
Image classification; ImageNet data set; residual networks; residual networks of residual networks (RoR); shortcut; stochastic depth (SD);
D O I
10.1109/TCSVT.2017.2654543
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A residual networks family with hundreds or even thousands of layers dominates major image recognition tasks, but building a network by simply stacking residual blocks inevitably limits its optimization ability. This paper proposes a novel residual network architecture, residual networks of residual networks (RoR), to dig the optimization ability of residual networks. RoR substitutes optimizing residual mapping of residual mapping for optimizing original residual mapping. In particular, RoR adds levelwise shortcut connections upon original residual networks to promote the learning capability of residual networks. More importantly, RoR can be applied to various kinds of residual networks (ResNets, Pre-ResNets, and WRN) and significantly boost their performance. Our experiments demonstrate the effectiveness and versatility of RoR, where it achieves the best performance in all residual-networklike structures. Our RoR-3-WRN58-4 + SD models achieve new state-of-the-art results on CIFAR-10, CIFAR-100, and SVHN, with the test errors of 3.77%, 19.73%, and 1.59%, respectively. RoR-3 models also achieve state-of-the-art results compared with ResNets on the ImageNet data set.
引用
收藏
页码:1303 / 1314
页数:12
相关论文
共 50 条
  • [31] Convergence analysis of deep residual networks
    Huang, Wentao
    Zhang, Haizhang
    [J]. ANALYSIS AND APPLICATIONS, 2024, 22 (02) : 351 - 382
  • [32] Residual Sum-Product Networks
    Ventola, Fabrizio
    Stelzner, Karl
    Molina, Alejandro
    Kersting, Kristian
    [J]. INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 : 545 - 556
  • [33] Wide Residual Networks for Semantic Segmentation
    Nakayama, Yoshiki
    Lu, Huimin
    Li, Yujie
    Kim, Hyoungseop
    [J]. 2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 1476 - 1480
  • [34] Capsule Networks With Residual Pose Routing
    Liu, Yi
    Cheng, De
    Zhang, Dingwen
    Xu, Shoukun
    Han, Jungong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 14
  • [35] Deep Residual Networks for Plankton Classification
    Li, Xiu
    Cui, Zuoying
    [J]. OCEANS 2016 MTS/IEEE MONTEREY, 2016,
  • [36] Residual Hyperbolic Graph Convolution Networks
    Xue, Yangkai
    Dai, Jindou
    Lu, Zhipeng
    Wu, Yuwei
    Jia, Yunde
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16247 - 16254
  • [37] Robust Learning with Implicit Residual Networks
    Reshniak, Viktor
    Webster, Clayton G.
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2021, 3 (01): : 34 - 55
  • [38] Difference Residual Graph Neural Networks
    Yang, Liang
    Peng, Weihang
    Zhou, Wenmiao
    Niu, Bingxin
    Gu, Junhua
    Wang, Chuan
    Guo, Yuanfang
    He, Dongxiao
    Cao, Xiaochun
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3356 - 3364
  • [39] Revisiting Residual Networks for Adversarial Robustness
    Huang, Shihua
    Lu, Zhichao
    Deb, Kalyanmoy
    Boddeti, Vishnu Naresh
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8202 - 8211
  • [40] Graph Neural Networks with Adaptive Residual
    Liu, Xiaorui
    Ding, Jiayuan
    Jin, Wei
    Xu, Han
    Ma, Yao
    Liu, Zitao
    Tang, Jiliang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34