Res2Net: A New Multi-Scale Backbone Architecture

被引:1681
|
作者
Gao, Shang-Hua [1 ]
Cheng, Ming-Ming [1 ]
Zhao, Kai [1 ]
Zhang, Xin-Yu [2 ]
Yang, Ming-Hsuan [1 ]
Torr, Philip [3 ]
机构
[1] Nankai Univ, Coll Comp Sci, TKLNDST, Tianjin 300350, Peoples R China
[2] UC Merced, Merced, CA 95343 USA
[3] Univ Oxford, Oxford OX1 2JD, England
基金
英国工程与自然科学研究理事会;
关键词
Feature extraction; Task analysis; Object detection; Semantics; Computer architecture; Kernel; Convolution; Multi-scale; deep learning; SALIENT OBJECT DETECTION;
D O I
10.1109/TPAMI.2019.2938758
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Representing features at multiple scales is of great importance for numerous vision tasks. Recent advances in backbone convolutional neural networks (CNNs) continually demonstrate stronger multi-scale representation ability, leading to consistent performance gains on a wide range of applications. However, most existing methods represent the multi-scale features in a layer-wise manner. In this paper, we propose a novel building block for CNNs, namely Res2Net, by constructing hierarchical residual-like connections within one single residual block. The Res2Net represents multi-scale features at a granular level and increases the range of receptive fields for each network layer. The proposed Res2Net block can be plugged into the state-of-the-art backbone CNN models, e.g., ResNet, ResNeXt, and DLA. We evaluate the Res2Net block on all these models and demonstrate consistent performance gains over baseline models on widely-used datasets, e.g., CIFAR-100 and ImageNet. Further ablation studies and experimental results on representative computer vision tasks, i.e., object detection, class activation mapping, and salient object detection, further verify the superiority of the Res2Net over the state-of-the-art baseline methods. The source code and trained models are available on https://mmcheng.net/res2net/.
引用
收藏
页码:652 / 662
页数:11
相关论文
共 50 条
  • [1] Multi-Scale End-to-End Speaker Recognition System Based on Improved Res2Net
    Deng, Lihong
    Deng, Fei
    Zhang, Gexiang
    Yang, Qiang
    [J]. Computer Engineering and Applications, 2023, 59 (24) : 110 - 120
  • [2] A novel multi-scale facial expression recognition algorithm based on improved Res2Net for classroom scenes
    Gu, Meihua
    Feng, Jing
    Chu, Yalu
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 16525 - 16542
  • [3] A novel spectral-spatial multi-scale network for hyperspectral image classification with the Res2Net block
    Zhang, Zhongqiang
    Liu, Danhua
    Gao, Dahua
    Shi, Guangming
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (03) : 751 - 777
  • [4] Multi-Scale Sea-Land Segmentation Method for Remote Sensing Images Based on Res2Net
    Gao Hui
    Yan Xiaodong
    Zhang Heng
    Niu Yiting
    Wang Jiaqi
    [J]. ACTA OPTICA SINICA, 2022, 42 (18)
  • [5] Curvelet-based Image Super-Resolution via Res2Net Multi-Scale Network
    Wei, Wei
    Xin, Junchang
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [6] A novel multi-scale facial expression recognition algorithm based on improved Res2Net for classroom scenes
    Meihua Gu
    Jing Feng
    Yalu Chu
    [J]. Multimedia Tools and Applications, 2024, 83 : 16525 - 16542
  • [7] REPLAY AND SYNTHETIC SPEECH DETECTION WITH RES2NET ARCHITECTURE
    Li, Xu
    Li, Na
    Weng, Chao
    Liu, Xunying
    Su, Dan
    Yu, Dong
    Meng, Helen
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6354 - 6358
  • [8] MR2CPPIS: Accurate prediction of protein–protein interaction sites based on multi-scale Res2Net with coordinate attention mechanism
    Gong, Yinyin
    Li, Rui
    Liu, Yan
    Wang, Jilong
    Cao, Buwen
    Fu, Xiangzheng
    Li, Renfa
    Chen, Danny Z.
    [J]. Computers in Biology and Medicine, 2024, 176
  • [9] RESNEXT AND RES2NET STRUCTURES FOR SPEAKER VERIFICATION
    Zhou, Tianyan
    Zhao, Yong
    Wu, Jian
    [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 301 - 307
  • [10] Gated Res2Net for Multivariate Time Series Analysis
    Yang, Chao
    Jiang, Mingxing
    Guo, Zhongwen
    Liu, Yuan
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,