PULP: A Ultra-Low Power Parallel Accelerator for Energy-Efficient and Flexible Embedded Vision

被引:47
|
作者
Conti, Francesco [1 ]
Rossi, Davide [1 ]
Pullini, Antonio [2 ]
Loi, Igor [1 ]
Benini, Luca [1 ,2 ]
机构
[1] Univ Bologna, Dept Elect Elect & Informat Engn, Bologna, Italy
[2] Swiss Fed Inst Technol, Integrated Syst Lab, Zurich, Switzerland
关键词
Ultra-Low Power; Embedded vision; Convolutional Neural Network; Optical flow; Motion estimation; FD-SOI; Multi-core; OpenRISC; MOTION ESTIMATION; ARCHITECTURE; PROCESSOR; EXPLORATION; MULTIMEDIA; CLUSTER; ENGINE; CORE; CMOS;
D O I
10.1007/s11265-015-1070-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Novel pervasive devices such as smart surveillance cameras and autonomous micro-UAVs could greatly benefit from the availability of a computing device supporting embedded computer vision at a very low power budget. To this end, we propose PULP (Parallel processing Ultra-Low Power platform), an architecture built on clusters of tightly-coupled OpenRISC ISA cores, with advanced techniques for fast performance and energy scalability that exploit the capabilities of the STMicroelectronics UTBB FD-SOI 28nm technology. We show that PULP performance can be scaled over a 1x-354x range, with a peak theoretical energy efficiency of 211 GOPS/W. We present performance results for several demanding kernels from the image processing and vision domain, with post-layout power modeling: a motion detection application that can run at an efficiency up to 192 GOPS/W (90 % of the theoretical peak); a ConvNet-based detector for smart surveillance that can be switched between 0.7 and 27fps operating modes, scaling energy consumption per frame between 1.2 and 12mJ on a 320 x240 image; and FAST + Lucas-Kanade optical flow on a 128 x128 image at the ultra-low energy budget of 14 mu J per frame at 60fps.
引用
收藏
页码:339 / 354
页数:16
相关论文
共 50 条
  • [31] Toward an Ultra-low Latency and Energy Efficient LoRaWAN
    Muthanna, Mohammed Saleh Ali
    Wang, Ping
    Wei, Min
    Ateya, Abdelhamied A.
    Muthanna, Ammar
    INTERNET OF THINGS, SMART SPACES, AND NEXT GENERATION NETWORKS AND SYSTEMS, NEW2AN 2019, RUSMART 2019, 2019, 11660 : 233 - 242
  • [32] Efficient and Sensitive Electrically Small Rectenna for Ultra-Low Power RF Energy Harvesting
    Assimonis, Stylianos D.
    Fusco, Vincent
    Georgiadis, Apostolos
    Samaras, Theodoros
    SCIENTIFIC REPORTS, 2018, 8
  • [33] Efficient and Sensitive Electrically Small Rectenna for Ultra-Low Power RF Energy Harvesting
    Stylianos D. Assimonis
    Vincent Fusco
    Apostolos Georgiadis
    Theodoros Samaras
    Scientific Reports, 8
  • [34] Minimum energy solution for ultra-low power applications
    Guduri, M.
    Dokania, V.
    Verma, R.
    Islam, A.
    MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2019, 25 (05): : 1823 - 1831
  • [35] Minimum energy solution for ultra-low power applications
    M. Guduri
    V. Dokania
    R. Verma
    A. Islam
    Microsystem Technologies, 2019, 25 : 1823 - 1831
  • [36] Energy-efficient AES SubBytes transformation circuit using asynchronous circuits for ultra-low voltage operation
    Shizuku, Yuzuru
    Hirose, Tetsuya
    Kuroki, Nobutaka
    Numa, Masahiro
    Okada, Mitsuji
    IEICE ELECTRONICS EXPRESS, 2015, 12 (04):
  • [37] An Energy-Efficient 24T Flip-Flop Consisting of Standard CMOS Gates for Ultra-Low Power Digital VLSIs
    Shizuku, Yuzuru
    Hirose, Tetsuya
    Kuroki, Nobutaka
    Numa, Masahiro
    Okada, Mitsuji
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (12) : 2600 - 2606
  • [38] Maximize Energy Utilization for Ultra-Low Energy Harvesting Powered Embedded Systems
    Pan, Chen
    Xie, Mimi
    Hu, Jingtong
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA), 2017,
  • [39] Design and Implementation of a Lightweight and Energy-Efficient Semantic Segmentation Accelerator for Embedded Platforms
    Li, Hui
    Li, Jinyi
    Li, Bowen
    Miao, Zhengqian
    Lu, Shengli
    MICROMACHINES, 2025, 16 (03)
  • [40] Energy-Efficient Reconfigurable Cache Architectures for Accelerator-Enabled Embedded Systems
    Farmahini-Farahani, Amin
    Kim, Nam Sung
    Morrow, Katherine
    2014 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2014, : 211 - 220