An energy-efficient programmable manycore accelerator for personalized biomedical applications

被引:0
|
作者
Kulkarni A. [1 ]
Page A. [1 ]
Attaran N. [1 ]
Jafari A. [1 ]
Malik M. [2 ]
Homayoun H. [2 ]
Mohsenin T. [1 ]
机构
[1] Department of Computer Science and Electrical Engineering, University of Maryland at Baltimore, Baltimore, 21250, MD
[2] Electrical and Computer Engineering Department, George Mason University, Fairfax, 22030, VA
基金
美国国家科学基金会;
关键词
Low-power manycore accelerator; Personalized biomedical applications; Seizure detection; Stress detection; Tongue drive system (tds);
D O I
10.1109/tvlsi.2017.2754272
中图分类号
学科分类号
摘要
Wearable personalized health monitoring systems can offer a cost-effective solution for human health care. These systems must constantly monitor patients' physiological signals and provide highly accurate, and quick processing and delivery of the vast amount of data within a limited power and area footprint. These personalized biomedical applications require sampling and processing multiple streams of physiological signals with a varying number of channels and sampling rates. The processing typically consists of feature extraction, data fusion, and classification stages that require a large number of digital signal processing (DSP) and machine learning (ML) kernels. In response to these requirements, in this paper, a tiny, energyefficient, and domain-specific manycore accelerator referred to as power-efficient nanoclusters (PENC) is proposed to map and execute the kernels of these applications. Simulation results show that the PENC is able to reduce energy consumption by up to 80% and 25% for DSP and ML kernels, respectively, when optimally parallelized. In addition, we fully implemented three compute-intensive personalized biomedical applications, namely, multichannel seizure detection, multiphysiological stress detection, and standalone tongue drive system (sTDS), to evaluate the proposed manycore performance relative to commodity embedded CPU, graphical processing unit (GPU), and fieldprogrammable gate array (FPGA)-based implementations. For these three case studies, the energy consumption and the performance of the proposed PENC manycore, when acting as an accelerator along with an Intel Atom processor as a host, are compared with the existing commercial off-The-shelf generalpurpose, customizable, and programmable embedded platforms, including Intel Atom, Xilinx Artix-7 FPGA, and NVIDIA TK1 advanced RISC machine -A15 and K1 GPU system on a chip. For these applications, the PENC manycore is able to significantly improve throughput and energy efficiency by up to 1872× and 276×, respectively. For the most computational intensive application of seizure detection, the PENC manycore is able to achieve a throughput of 15.22 giga-operations-per-second (GOPs), which is a 14× improvement in throughput over custom FPGA solution. For stress detection, the PENC achieves a throughput of 21.36 GOPs and an energy efficiency of 4.23 GOP/J, which is 14.87× and 2.28× better over FPGA implementation, respec-Tively. For the sTDS application, the PENC improves a throughput by 5.45× and an energy efficiency by 2.37× over FPGA implementation. 1063-8210 © 2017 IEEE.
引用
收藏
页码:96 / 109
页数:13
相关论文
共 50 条
  • [41] An Energy-Efficient Reconfigurable LSTM Accelerator for Natural Language Processing
    Azari, Elham
    Vrudhula, Sarma
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4450 - 4459
  • [42] Runtime Reconfigurable Hardware Accelerator for Energy-Efficient Transposed Convolutions
    Marrazzo, Emanuel
    Spagnolo, Fanny
    Perri, Stefania
    PRIME 2022: 17TH INTERNATIONAL CONFERENCE ON PHD RESEARCH IN MICROELECTRONICS AND ELECTRONICS, 2022, : 49 - 52
  • [43] ITA: An Energy-Efficient Attention and Softmax Accelerator for Quantized Transformers
    Islamoglu, Gamze
    Scherer, Moritz
    Paulin, Gianna
    Fischer, Tim
    Jung, Victor J. B.
    Garofalo, Angelo
    Benini, Luca
    2023 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED, 2023,
  • [44] Domino: Graph Processing Services on Energy-efficient Hardware Accelerator
    Xu, Chongchong
    Wang, Chao
    Gong, Lei
    Jin, Lihui
    Li, Xi
    Zhou, Xuehai
    2018 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2018), 2018, : 274 - 281
  • [45] An Energy-Efficient In-Memory Accelerator for Graph Construction and Updating
    Chen, Mingkai
    Liu, Cheng
    Liang, Shengwen
    He, Lei
    Wang, Ying
    Zhang, Lei
    Li, Huawei
    Li, Xiaowei
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (06) : 1781 - 1793
  • [46] An Energy-Efficient YOLO Accelerator Optimizing Filter Switching Activity
    Lim, Kyeongjong
    Kim, Gyuri
    Park, Taehyung
    Nguyen, Xuan Truong
    Lee, Hyuk-Jae
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2472 - 2476
  • [47] Lithography with MeV Energy Ions for Biomedical Applications: Accelerator Considerations
    Sangyuenyongpipat, S.
    Whitlow, H. J.
    Nakagawa, S. T.
    Yoshida, E.
    APPLICATION OF ACCELERATORS IN RESEARCH AND INDUSTRY, 2009, 1099 : 282 - +
  • [48] An Energy-Efficient Accelerator for Hybrid Bit-width DNNs
    Liu, Bo
    Ruan, Xing
    Xia, Mengwen
    Gong, Yu
    Yang, Jinjiang
    Ge, Wei
    Yang, Jun
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3306 - 3313
  • [49] Demonstration of a Distributed Accelerator Framework for Energy-efficient ML Processing
    Steinert, Fritjof
    Knapheide, Justin
    Stabernack, Benno
    2021 31ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2021), 2021, : 386 - 386
  • [50] An FPGA-Based Energy-Efficient Reconfigurable Convolutional Neural Network Accelerator for Object Recognition Applications
    Li, Jixuan
    Un, Ka-Fai
    Yu, Wei-Han
    Mak, Pui-In
    Martins, Rui P.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (09) : 3143 - 3147