Quantization tolerant network design and performance estimation of computation-in-memory for energy-efficient 3D object detection inference

被引:0
|
作者
Nagai, Ayumu [1 ]
Ichikawa, Yuya [1 ]
Matsui, Chihiro [1 ]
Takeuchi, Ken [1 ]
机构
[1] Univ Tokyo, Dept Elect Engn & Informat Syst, Bunkyo, Tokyo 1138656, Japan
关键词
computation-in-memory; 3D object detection; quantization tolerance; edge AI; energy efficient CiM; depth completion; LiDAR;
D O I
10.35848/1347-4065/ada9f5
中图分类号
O59 [应用物理学];
学科分类号
摘要
This work discusses network architecture, network training method, and a quantization method for achieving a low-energy and high-energy-efficient system, aiming at equipping robots and automobiles such as Autonomous Mobile Robot (AMR) with Computation-in-Memory (CiM) and performing 3D object detection with low energy consumption. Augmented Point Cloud VoxelNet (APCVN) is a network that improves inference accuracy by allowing a slight increase in computational complexity. Multi-Stage Quantization Aware Training (MSQAT) and U-Quantization (UQ) are a learning method and a quantization strategy, respectively, that improve the quantization tolerance of APCVN. Furthermore, quantitative calculations are conducted to estimate ADC energy consumption per inference, operations per second, energy efficiency, memory array area, memory capacity, and latency per inference in the assumed CiM system. Results show that by applying proposed methods, ADC energy consumption per inference is reduced by 8.7% and energy efficiency is improved by 1.6 times while maintaining high inference accuracy.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] LIORAT: NN Layer I/O Range Training for Area/Energy-Efficient Low-Bit A/D Conversion System Design in Error-Tolerant Computation-in-Memory
    Yamada, Ayumu
    Misawa, Naoko
    Matsui, Chihiro
    Takeuchi, Ken
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
  • [2] A Monolithic 3D Hybrid Architecture for Energy-Efficient Computation
    Yu, Ye
    Jha, Niraj K.
    IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2018, 4 (04): : 533 - 547
  • [3] Quantization to accelerate inference in multi-modal 3D object detection
    Geerhart, Billy
    Dasari, Venkat R.
    Rapp, Brian
    Wang, Peng
    Wang, Ju
    Payne, Christopher X.
    DISRUPTIVE TECHNOLOGIES IN INFORMATION SCIENCES VIII, 2024, 13058
  • [4] Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
    Rizvi, Shahriyar Masud
    Ab Rahman, Ab Al-Hadi
    Sheikh, Usman Ullah
    Fuad, Kazi Ahmed Asif
    Shehzad, Hafiz Muhammad Faisal
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4499 - 4523
  • [5] Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference
    Shahriyar Masud Rizvi
    Ab Al-Hadi Ab Rahman
    Usman Ullah Sheikh
    Kazi Ahmed Asif Fuad
    Hafiz Muhammad Faisal Shehzad
    Applied Intelligence, 2023, 53 : 4499 - 4523
  • [6] Skew-CIM: Process-Variation-Resilient and Energy-Efficient Computation-in-Memory Design Technique With Skewed Weights
    Yi, Donghyeon
    Lee, Seoyoung
    Choi, Injun
    Yun, Gichan
    Choi, Edward Jongyoon
    Park, Jonghee
    Kwak, Jonghoon
    Jang, Sung-Joon
    Ha, Sohmyung
    Chang, Ik-Joon
    Je, Minkyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (05) : 2067 - 2078
  • [7] Performance and Thermal Tradeoffs for Energy-Efficient Monolithic 3D Network-on-Chip
    Lee, Dongjin
    Das, Sourav
    Doppa, Janardhan Rao
    Pane, Partha Pratim
    Chakrabarty, Krishnendu
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2018, 23 (05)
  • [8] An Accurate, Error-Tolerant, and Energy-Efficient Neural Network Inference Engine Based on SONOS Analog Memory
    Xiao, T. Patrick
    Feinberg, Ben
    Bennett, Christopher H.
    Agrawal, Vineet
    Saxena, Prashant
    Prabhakar, Venkatraman
    Ramkumar, Krishnaswamy
    Medu, Harsha
    Raghavan, Vijay
    Chettuvetty, Ramesh
    Agarwal, Sapan
    Marinella, Matthew J.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (04) : 1480 - 1493
  • [9] Energy-Efficient Event Detection in 3D Wireless Sensor Networks
    Toriumi, Susumu
    Sei, Yuichi
    Honiden, Shinichi
    2008 1ST IFIP WIRELESS DAYS (WD), 2008, : 370 - +
  • [10] Efficient Uncertainty Estimation for Monocular 3D Object Detection in Autonomous Driving
    Liu, Zechen
    Han, Zhihua
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2711 - 2718