EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

被引:0
|
作者
Tan, Mingxing [1 ]
Le, Quoc V. [1 ]
机构
[1] Google Res, Brain Team, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance. Based on this observation, we propose a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient. We demonstrate the effectiveness of this method on scaling up MobileNets and ResNet. To go even further, we use neural architecture search to design a new baseline network and scale it up to obtain a family of models, called Efficient-Nets, which achieve much better accuracy and efficiency than previous ConvNets. In particular, our EfficientNet-B7 achieves state-of-the-art 84.4% top-1 / 97.1% top-5 accuracy on ImageNet, while being 8.4x smaller and 6.1x faster on inference than the best existing ConvNet. Our EfficientNets also transfer well and achieve state-of-the-art accuracy on CIFAR-100 (91.7%), Flowers (98.8%), and 3 other transfer learning datasets, with an order of magnitude fewer parameters.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] EfficientNet convolutional neural networks-based Android malware detection
    Yadav, Pooja
    Menon, Neeraj
    Ravi, Vinayakumar
    Vishvanathan, Sowmya
    Pham, Tuan D.
    [J]. COMPUTERS & SECURITY, 2020, 115
  • [2] Rethinking convolutional neural networks for trajectory refinement
    Yoon, Hanbit
    Ali, Usman
    Choi, Joonhee
    Park, Eunbyung
    [J]. PATTERN RECOGNITION, 2025, 157
  • [3] Scaling up the training of Convolutional Neural Networks
    Snir, Marc
    [J]. 2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 925 - 925
  • [4] Rethinking Automatic Chord Recognition with Convolutional Neural Networks
    Humphrey, Eric J.
    Bello, Juan P.
    [J]. 2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 357 - 362
  • [5] Efficient and accurate compound scaling for convolutional neural networks
    Lin, Chengmin
    Yang, Pengfei
    Wang, Quan
    Qiu, Zeyu
    Lv, Wenkai
    Wang, Zhenyi
    [J]. NEURAL NETWORKS, 2023, 167 : 787 - 797
  • [6] Rethinking the Value of Local Feature Fusion in Convolutional Neural Networks
    Zhenyu Lou
    Xin Ye
    Luoming Zhang
    Weijia Wu
    Yefei He
    Hong Zhou
    [J]. Neural Processing Letters, 2023, 55 : 9085 - 9100
  • [7] Rethinking the Value of Local Feature Fusion in Convolutional Neural Networks
    Lou, Zhenyu
    Ye, Xin
    Zhang, Luoming
    Wu, Weijia
    He, Yefei
    Zhou, Hong
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9085 - 9100
  • [8] Boosted EfficientNet: Detection of Lymph Node Metastases in Breast Cancer Using Convolutional Neural Networks
    Wang, Jun
    Liu, Qianying
    Xie, Haotian
    Yang, Zhaogang
    Zhou, Hefeng
    [J]. CANCERS, 2021, 13 (04) : 1 - 14
  • [9] Learning with rethinking: Recurrently improving convolutional neural networks through feedback
    Li, Xin
    Jie, Zequn
    Feng, Jiashi
    Liu, Changsong
    Yan, Shuicheng
    [J]. PATTERN RECOGNITION, 2018, 79 : 183 - 194
  • [10] Image-based malware representation approach with EfficientNet convolutional neural networks for effective malware classification
    Chaganti, Rajasekhar
    Ravi, Vinayakumar
    Pham, Tuan D.
    [J]. JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 69