Efficient memory reuse methodology for CNN-based real-time image processing in mobile-embedded systems

被引:0
|
作者
Zhao, Kairong [1 ]
Chang, Yinghui [2 ]
Wu, Weikang [2 ]
Luo, Hongyin [1 ]
Li, Zirun [1 ]
He, Shan [1 ]
Guo, Donghui [1 ]
机构
[1] Xiamen Univ, Nat Model Microelect Coll, R&D Ctr Integrated Circuit, Sch Elect Sci & Engn, Xiamen 361005, Peoples R China
[2] China Acad Network & Commun CETC, Shijiazhuang 050050, Peoples R China
关键词
Convolution neural networks; Edge computing; Mobile devices; Image processing; Memory reuse;
D O I
10.1007/s11554-023-01375-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time image processing applications such as intelligent security and traffic management requires pattern recognition tasks, such face recognition, and license plate detection, to execute in mobile-embedded systems. These mobile-embedded applications employ the deep neural network (DNN), especially convolutional neural network (CNN), to complete the image classification. However, deploying CNN models on embedded platforms is challenging as memory-costly CNNs are in conflict with the highly limited memory budget. To address this challenge, a variety of CNN memory reduction methodologies have been proposed. Among these methodologies, CNN memory reuse has no influence on accuracy and throughput of CNN and is easy to realize, which is most suitable for embedded application. However, the existing memory reuse algorithms cannot achieve stable optimal solution. To solve the problem, we first improve an existing memory reuse algorithm. Compared with its original version, the improved algorithm provides 7-25% less memory consumption of intermediate results. We further propose a novel CNN memory reuse algorithm. In the new algorithm, we significantly make use of CNN structure to reuse memory and obtain optimal solution at most cases. Compared with two existing memory reuse algorithms, the new algorithm can reduce the memory footprint by an average of 20.3% and 9.4%.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Efficient memory reuse methodology for CNN-based real-time image processing in mobile-embedded systems
    Kairong Zhao
    Yinghui Chang
    Weikang Wu
    Hongyin Luo
    Zirun Li
    Shan He
    Donghui Guo
    [J]. Journal of Real-Time Image Processing, 2023, 20
  • [2] CNN-based Fisheye Image Real-Time Semantic Segmentation
    Saez, Alvaro
    Bergasa, Luis M.
    Romera, Eduardo
    Lopez, Elena
    Barea, Rafael
    Sanz, Rafael
    [J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1039 - 1044
  • [3] Special issue on real-time image and video processing in mobile embedded systems
    Awais Ahmad
    Marco Anisetti
    Ernesto Damiani
    Gwanggil Jeon
    [J]. Journal of Real-Time Image Processing, 2019, 16 : 1 - 4
  • [4] Special issue on real-time image and video processing in mobile embedded systems
    Ahmad, Awais
    Anisetti, Marco
    Damiani, Ernesto
    Jeon, Gwanggil
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2019, 16 (01) : 1 - 4
  • [5] A CNN-Based Real-Time Dense Stereo SLAM System on Embedded FPGA
    Huang, Qian
    Zhang, Yu
    Zheng, Jianing
    Shang, Gaoxing
    Chen, Gang
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 569 - 574
  • [6] A Real-Time CNN-Based Lightweight Mobile Masked Face Recognition System
    Kocacinar, Busra
    Tas, Bilal
    Akbulut, Fatma Patlar
    Catal, Cagatay
    Mishra, Deepti
    [J]. IEEE ACCESS, 2022, 10 : 63496 - 63507
  • [7] Real-time embedded image processing applications using the A(3) methodology
    Sorel, Y
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, 1996, : 145 - 148
  • [8] Digital Architecture for Real-Time CNN-based Face Detection for Video Processing
    Bhattarai, Smrity
    Madanayake, Arjuna
    Cintra, Renato J.
    Duffner, Stefan
    Garcia, Christophe
    [J]. 2017 COGNITIVE COMMUNICATIONS FOR AEROSPACE APPLICATIONS WORKSHOP (CCAA), 2017,
  • [9] An efficient hardware implementation of CNN-based object trackers for real-time applications
    Al-Hussein A. El-Shafie
    Mohamed Zaki
    S. E. D. Habib
    [J]. Neural Computing and Applications, 2022, 34 : 19937 - 19952
  • [10] An efficient hardware implementation of CNN-based object trackers for real-time applications
    El-Shafie, Al-Hussein A.
    Zaki, Mohamed
    Habib, S.E.D.
    [J]. Neural Computing and Applications, 2022, 34 (22): : 19937 - 19952