An energy-efficient programmable manycore accelerator for personalized biomedical applications

被引:0
|
作者
Kulkarni A. [1 ]
Page A. [1 ]
Attaran N. [1 ]
Jafari A. [1 ]
Malik M. [2 ]
Homayoun H. [2 ]
Mohsenin T. [1 ]
机构
[1] Department of Computer Science and Electrical Engineering, University of Maryland at Baltimore, Baltimore, 21250, MD
[2] Electrical and Computer Engineering Department, George Mason University, Fairfax, 22030, VA
基金
美国国家科学基金会;
关键词
Low-power manycore accelerator; Personalized biomedical applications; Seizure detection; Stress detection; Tongue drive system (tds);
D O I
10.1109/tvlsi.2017.2754272
中图分类号
学科分类号
摘要
Wearable personalized health monitoring systems can offer a cost-effective solution for human health care. These systems must constantly monitor patients' physiological signals and provide highly accurate, and quick processing and delivery of the vast amount of data within a limited power and area footprint. These personalized biomedical applications require sampling and processing multiple streams of physiological signals with a varying number of channels and sampling rates. The processing typically consists of feature extraction, data fusion, and classification stages that require a large number of digital signal processing (DSP) and machine learning (ML) kernels. In response to these requirements, in this paper, a tiny, energyefficient, and domain-specific manycore accelerator referred to as power-efficient nanoclusters (PENC) is proposed to map and execute the kernels of these applications. Simulation results show that the PENC is able to reduce energy consumption by up to 80% and 25% for DSP and ML kernels, respectively, when optimally parallelized. In addition, we fully implemented three compute-intensive personalized biomedical applications, namely, multichannel seizure detection, multiphysiological stress detection, and standalone tongue drive system (sTDS), to evaluate the proposed manycore performance relative to commodity embedded CPU, graphical processing unit (GPU), and fieldprogrammable gate array (FPGA)-based implementations. For these three case studies, the energy consumption and the performance of the proposed PENC manycore, when acting as an accelerator along with an Intel Atom processor as a host, are compared with the existing commercial off-The-shelf generalpurpose, customizable, and programmable embedded platforms, including Intel Atom, Xilinx Artix-7 FPGA, and NVIDIA TK1 advanced RISC machine -A15 and K1 GPU system on a chip. For these applications, the PENC manycore is able to significantly improve throughput and energy efficiency by up to 1872× and 276×, respectively. For the most computational intensive application of seizure detection, the PENC manycore is able to achieve a throughput of 15.22 giga-operations-per-second (GOPs), which is a 14× improvement in throughput over custom FPGA solution. For stress detection, the PENC achieves a throughput of 21.36 GOPs and an energy efficiency of 4.23 GOP/J, which is 14.87× and 2.28× better over FPGA implementation, respec-Tively. For the sTDS application, the PENC improves a throughput by 5.45× and an energy efficiency by 2.37× over FPGA implementation. 1063-8210 © 2017 IEEE.
引用
收藏
页码:96 / 109
页数:13
相关论文
共 50 条
  • [31] BETA: Binarized Energy-Efficient Transformer Accelerator at the Edge
    Ji, Yuhao
    Fang, Chao
    Wang, Zhongfeng
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [32] An Energy-Efficient Patchable Accelerator and Its Design Methods
    Yoshida, Hiroaki
    Wakizaka, Masayuki
    Yamashita, Shigeru
    Fujita, Masahiro
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2014, E97A (12) : 2507 - 2517
  • [33] Energy-Efficient Personalized Federated Continual Learning on Edge
    Yang, Zhao
    Wang, Haoyang
    Sun, Qingshuang
    IEEE EMBEDDED SYSTEMS LETTERS, 2024, 16 (04) : 345 - 348
  • [34] A Real-time Energy-Efficient Superpixel Hardware Accelerator for Mobile Computer Vision Applications
    Hong, Injoon
    Clemons, Jason
    Venkatesan, Rangharajan
    Frosio, Iuri
    Khailany, Brucek
    Keckler, Stephen W.
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [35] Efficient Run-time Resource Management of a Manycore Accelerator for Stream-based Applications
    Lee, Chanhee
    Kim, Sungchan
    Ha, Soonhoi
    2013 IEEE 11TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA), 2013, : 51 - 60
  • [36] Energy-Efficient FastICA Implementation for Biomedical Signal Separation
    Lan-Da Van
    Wu, Di-You
    Chen, Chien-Shiun
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (11): : 1809 - 1822
  • [37] Exploiting large memory using 32-bit energy-efficient manycore architectures
    Karaoui, Mohamed L.
    Peneau, Pierre-Yves
    Meunier, Quentin
    Wajsburt, Franck
    Greiner, Alain
    2016 IEEE 10TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC), 2016, : 61 - 68
  • [38] R-Blocks: An Energy-Efficient, Flexible, and Programmable CGRA
    De Bruin, Barry
    Vadivel, Kanishkan
    Wijtvliet, Mark
    Jaaskelainen, Pekka
    Corporaal, Henk
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2024, 17 (02)
  • [39] SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Networks
    Aminabadi, Reza Yazdani
    Ruwase, Olatunji
    Zhang, Minjia
    He, Yuxiong
    Arnau, Jose-Maria
    Gonzalez, Antonio
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (02)
  • [40] An Energy-Efficient Hardware Accelerator for Hierarchical Deep Reinforcement Learning
    Shiri, Aidin
    Prakash, Bharat
    Mazumder, Arnab Neelim
    Waytowich, Nicholas R.
    Oates, Tim
    Mohsenin, Tinoosh
    2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,