Optimizing Convolutional Neural Network Architectures

被引:0
|
作者
Balderas, Luis [1 ,2 ,3 ,4 ]
Lastra, Miguel [2 ,3 ,4 ,5 ]
Benitez, Jose M. [1 ,2 ,3 ,4 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada 18071, Spain
[2] Univ Granada, Distributed Computat Intelligence & Time Series La, Granada 18071, Spain
[3] Univ Granada, Sport & Hlth Univ Res Inst, Granada 18071, Spain
[4] Univ Granada, Andalusian Res Inst Data Sci & Computat Intelligen, Granada 18071, Spain
[5] Univ Granada, Dept Software Engn, Granada 18071, Spain
关键词
convolutional neural network simplification; neural network pruning; efficient machine learning; Green AI; LSTM;
D O I
10.3390/math12193032
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Convolutional neural networks (CNNs) are commonly employed for demanding applications, such as speech recognition, natural language processing, and computer vision. As CNN architectures become more complex, their computational demands grow, leading to substantial energy consumption and complicating their use on devices with limited resources (e.g., edge devices). Furthermore, a new line of research seeking more sustainable approaches to Artificial Intelligence development and research is increasingly drawing attention: Green AI. Motivated by an interest in optimizing Machine Learning models, in this paper, we propose Optimizing Convolutional Neural Network Architectures (OCNNA). It is a novel CNN optimization and construction method based on pruning designed to establish the importance of convolutional layers. The proposal was evaluated through a thorough empirical study including the best known datasets (CIFAR-10, CIFAR-100, and Imagenet) and CNN architectures (VGG-16, ResNet-50, DenseNet-40, and MobileNet), setting accuracy drop and the remaining parameters ratio as objective metrics to compare the performance of OCNNA with the other state-of-the-art approaches. Our method was compared with more than 20 convolutional neural network simplification algorithms, obtaining outstanding results. As a result, OCNNA is a competitive CNN construction method which could ease the deployment of neural networks on the IoT or resource-limited devices.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] A Genetic Programming Approach to Designing Convolutional Neural Network Architectures
    Suganuma, Masanori
    Shirakawa, Shinichi
    Nagao, Tomoharu
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5369 - 5373
  • [22] Comparison of Deep Convolutional Neural Network Architectures for Fruit Categorization
    Kerta, Johan Muliadi
    Rangkuti, Abdul Haris
    Tantio, Jeremy
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2024, 15 (03) : 247 - 259
  • [23] Synthesis of Convolutional Neural Network architectures for biomedical image classification
    Berezsky, Oleh
    Liashchynskyi, Petro
    Pitsun, Oleh
    Izonin, Ivan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
  • [24] Optimizing OpenCL Implementation of Deep Convolutional Neural Network on FPGA
    Qiao, Yuran
    Shen, Junzhong
    Huang, Dafei
    Yang, Qianming
    Wen, Mei
    Zhang, Chunyuan
    [J]. NETWORK AND PARALLEL COMPUTING (NPC 2017), 2017, 10578 : 100 - 111
  • [26] Optimizing Convolutional Neural Network Performance by Mitigating Underfitting and Overfitting
    Li, Qipei
    Yan, Ming
    Xu, Jie
    [J]. 2021 IEEE/ACIS 20TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2021-SUMMER), 2021, : 126 - 131
  • [27] Optimizing Performance of Convolutional Neural Network Using Computing Technique
    Samudre, Pooja
    Shende, Prashant
    Jaiswal, Vishal
    [J]. 2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [28] Optimizing Memory Efficiency for Deep Convolutional Neural Network Accelerators
    Li, Xiaowei
    Li, Jiajun
    Yan, Guihai
    [J]. JOURNAL OF LOW POWER ELECTRONICS, 2018, 14 (04) : 496 - 507
  • [29] Fitness landscape analysis of convolutional neural network architectures for image classification
    Rodrigues, Nuno M.
    Malan, Katherine M.
    Ochoa, Gabriela
    Vanneschi, Leonardo
    Silva, Sara
    [J]. INFORMATION SCIENCES, 2022, 609 : 711 - 726
  • [30] Deep Convolutional Neural Network Architectures for Tonal Frequency Identification in a Lofargram
    Park, Jihun
    Jung, Dae-Jin
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2021, 19 (02) : 1103 - 1112