Residual Networks of Residual Networks: Multilevel Residual Networks

被引:205
|
作者
Zhang, Ke [1 ]
Sun, Miao [2 ]
Han, Tony X. [2 ]
Yuan, Xingfang [2 ]
Guo, Liru [1 ]
Liu, Tao [1 ]
机构
[1] North China Elect Power Univ, Dept Elect & Commun Engn, Baoding 071000, Hebei, Peoples R China
[2] Univ Missouri, Dept Elect & Comp Engn, Columiba, MO 65211 USA
基金
中国国家自然科学基金;
关键词
Image classification; ImageNet data set; residual networks; residual networks of residual networks (RoR); shortcut; stochastic depth (SD);
D O I
10.1109/TCSVT.2017.2654543
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A residual networks family with hundreds or even thousands of layers dominates major image recognition tasks, but building a network by simply stacking residual blocks inevitably limits its optimization ability. This paper proposes a novel residual network architecture, residual networks of residual networks (RoR), to dig the optimization ability of residual networks. RoR substitutes optimizing residual mapping of residual mapping for optimizing original residual mapping. In particular, RoR adds levelwise shortcut connections upon original residual networks to promote the learning capability of residual networks. More importantly, RoR can be applied to various kinds of residual networks (ResNets, Pre-ResNets, and WRN) and significantly boost their performance. Our experiments demonstrate the effectiveness and versatility of RoR, where it achieves the best performance in all residual-networklike structures. Our RoR-3-WRN58-4 + SD models achieve new state-of-the-art results on CIFAR-10, CIFAR-100, and SVHN, with the test errors of 3.77%, 19.73%, and 1.59%, respectively. RoR-3 models also achieve state-of-the-art results compared with ResNets on the ImageNet data set.
引用
收藏
页码:1303 / 1314
页数:12
相关论文
共 50 条
  • [1] Optimization Method of Residual Networks of Residual Networks for Image Classification
    Zhang, Ke
    Guo, Liru
    Gao, Ce
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 321 - 325
  • [2] Optimization Method of Residual Networks of Residual Networks for Image Classification
    Lin, Long
    Yuan, Hao
    Guo, Liru
    Kuang, Yingqun
    Zhang, Ke
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2018, PT III, 2018, 10956 : 212 - 222
  • [3] Deep Residual Networks of Residual Networks for Image Super-Resolution
    Wei, Xueqi
    Yang, Fumeng
    Wu, Congzhong
    [J]. LIDAR IMAGING DETECTION AND TARGET RECOGNITION 2017, 2017, 10605
  • [4] Wide deep residual networks in networks
    Hmidi Alaeddine
    Malek Jihene
    [J]. Multimedia Tools and Applications, 2023, 82 : 7889 - 7899
  • [5] Wide deep residual networks in networks
    Alaeddine, Hmidi
    Jihene, Malek
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) : 7889 - 7899
  • [6] Invertible Residual Networks
    Behrmann, Jens
    Grathwohl, Will
    Chen, Ricky T. Q.
    Duvenaud, David
    Jacobsen, Joern-Henrik
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [7] Residual Alignment: Uncovering the Mechanisms of Residual Networks
    Li, Jianing
    Papyan, Vardan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Dilated Residual Networks
    Yu, Fisher
    Koltun, Vladlen
    Funkhouser, Thomas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 636 - 644
  • [9] Residual Alignment: Uncovering the Mechanisms of Residual Networks
    Li, Jianing
    Papyan, Vardan
    [J]. Advances in Neural Information Processing Systems, 2023, 36 : 57660 - 57712
  • [10] GLOBALLY CONVERGENT MULTILEVEL TRAINING OF DEEP RESIDUAL NETWORKS
    Kopanicakova, Alena
    Krause, Rolf
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2023, 45 (03): : S254 - S280