A processing-in-pixel-in-memory paradigm for resource-constrained TinyML applications

被引:9
|
作者
Datta, Gourav [1 ]
Kundu, Souvik [1 ]
Yin, Zihan [1 ]
Lakkireddy, Ravi Teja [1 ]
Mathai, Joe [2 ]
Jacob, Ajey P. [2 ]
Beerel, Peter A. [1 ]
Jaiswal, Akhilesh R. [1 ,2 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect & Comp Engn, Los Angeles, CA 90007 USA
[2] Univ Southern Calif, Informat Sci Inst, Los Angeles, CA USA
关键词
IMAGE; VISION; FUTURE;
D O I
10.1038/s41598-022-17934-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The demand to process vast amounts of data generated from state-of-the-art high resolution cameras has motivated novel energy-efficient on-device AI solutions. Visual data in such cameras are usually captured in analog voltages by a sensor pixel array, and then converted to the digital domain for subsequent AI processing using analog-to-digital converters (ADC). Recent research has tried to take advantage of massively parallel low-power analog/digital computing in the form of near- and in-sensor processing, in which the AI computation is performed partly in the periphery of the pixel array and partly in a separate on-board CPU/accelerator. Unfortunately, high-resolution input images still need to be streamed between the camera and the AI processing unit, frame by frame, causing energy, bandwidth, and security bottlenecks. To mitigate this problem, we propose a novel Processing-in-Pixel-in-memory ((PM)-M-2) paradigm, that customizes the pixel array by adding support for analog multi-channel, multi-bit convolution, batch normalization, and Rectified Linear Units (ReLU). Our solution includes a holistic algorithm-circuit co-design approach and the resulting (PM)-M-2 paradigm can be used as a drop-in replacement for embedding memory-intensive first few layers of convolutional neural network (CNN) models within foundry-manufacturable CMOS image sensor platforms. Our experimental results indicate that (PM)-M-2 reduces data transfer bandwidth from sensors and analog to digital conversions by similar to 21x, and the energy-delay product (EDP) incurred in processing a MobileNetV2 model on a TinyML use case for visual wake words dataset (VWW) by up to similar to 11x compared to standard near-processing or in-sensor implementations, without any significant drop in test accuracy.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] A processing-in-pixel-in-memory paradigm for resource-constrained TinyML applications
    Gourav Datta
    Souvik Kundu
    Zihan Yin
    Ravi Teja Lakkireddy
    Joe Mathai
    Ajey P. Jacob
    Peter A. Beerel
    Akhilesh R. Jaiswal
    Scientific Reports, 12
  • [2] Reducing Memory Requirements of Resource-Constrained Applications
    Unnikrishnan, P.
    Chen, G.
    Kandemir, M.
    Karakoy, M.
    Kolcu, I.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2009, 8 (03)
  • [3] Neuromorphic-P2M: processing-in-pixel-in-memory paradigm for neuromorphic image sensors
    Abdullah-Al Kaiser, Md
    Datta, Gourav
    Wang, Zixu
    Jacob, Ajey P.
    Beerel, Peter A.
    Jaiswal, Akhilesh R.
    FRONTIERS IN NEUROINFORMATICS, 2023, 17
  • [4] Resource-Constrained Intelligent Trap: Fruit Flies Surveillance Framework with TinyML Integration
    Quan Minh Nguyen
    Vu Thanh Le
    Minh Nhat Lai
    Hien Bich Vo
    2024 IEEE TENTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, ICCE 2024, 2024, : 415 - 420
  • [5] High Accuracy and Low Latency Mixed Precision Neural Network Acceleration for TinyML Applications on Resource-Constrained FPGAs
    Ng, Wei Soon
    Goh, Wang Ling
    Gao, Yuan
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [6] Hyperspectral Video Processing on Resource-Constrained Platforms
    Li, Honglei
    Pan, Lei
    Lee, Eung Joo
    Li, Zhu
    Hoffman, Matthew J.
    Vodacek, Anthony
    Bhattacharyya, Shuvra S.
    2019 10TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING - EVOLUTION IN REMOTE SENSING (WHISPERS), 2019,
  • [7] Resource-constrained signal processing, communications, and networking
    Zhao, Qing
    Manohar, Rajit
    Ulman, Robert
    Veeravalli, Venugopal V.
    IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (03) : 12 - +
  • [8] Resource-Constrained Signal Processing in Sensor Networks
    Sun, Shuli
    Xiao, Wendong
    Yao, Kung
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [9] TinyML optimization for activity classification on the resource-constrained body sensor BI-Vital
    Penner, Kevin
    Wittenfeld, Felix
    Steinhagen, Bastian
    Hesse, Marc
    Rueckert, Ulrich
    2023 IEEE 19TH INTERNATIONAL CONFERENCE ON BODY SENSOR NETWORKS, BSN, 2023,
  • [10] A Model-Specific End-to-End Design Methodology for Resource-Constrained TinyML Hardware
    Dong, Yanchi
    Jia, Tianyu
    Du, Kaixuan
    Jing, Yiqi
    Wang, Qijun
    Zhan, Pixian
    Zhang, Yadong
    Yan, Fengyun
    Ma, Yufei
    Liang, Yun
    Ye, Le
    Huang, Ru
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,