An energy-efficient programmable manycore accelerator for personalized biomedical applications

被引:0
|
作者
Kulkarni A. [1 ]
Page A. [1 ]
Attaran N. [1 ]
Jafari A. [1 ]
Malik M. [2 ]
Homayoun H. [2 ]
Mohsenin T. [1 ]
机构
[1] Department of Computer Science and Electrical Engineering, University of Maryland at Baltimore, Baltimore, 21250, MD
[2] Electrical and Computer Engineering Department, George Mason University, Fairfax, 22030, VA
基金
美国国家科学基金会;
关键词
Low-power manycore accelerator; Personalized biomedical applications; Seizure detection; Stress detection; Tongue drive system (tds);
D O I
10.1109/tvlsi.2017.2754272
中图分类号
学科分类号
摘要
Wearable personalized health monitoring systems can offer a cost-effective solution for human health care. These systems must constantly monitor patients' physiological signals and provide highly accurate, and quick processing and delivery of the vast amount of data within a limited power and area footprint. These personalized biomedical applications require sampling and processing multiple streams of physiological signals with a varying number of channels and sampling rates. The processing typically consists of feature extraction, data fusion, and classification stages that require a large number of digital signal processing (DSP) and machine learning (ML) kernels. In response to these requirements, in this paper, a tiny, energyefficient, and domain-specific manycore accelerator referred to as power-efficient nanoclusters (PENC) is proposed to map and execute the kernels of these applications. Simulation results show that the PENC is able to reduce energy consumption by up to 80% and 25% for DSP and ML kernels, respectively, when optimally parallelized. In addition, we fully implemented three compute-intensive personalized biomedical applications, namely, multichannel seizure detection, multiphysiological stress detection, and standalone tongue drive system (sTDS), to evaluate the proposed manycore performance relative to commodity embedded CPU, graphical processing unit (GPU), and fieldprogrammable gate array (FPGA)-based implementations. For these three case studies, the energy consumption and the performance of the proposed PENC manycore, when acting as an accelerator along with an Intel Atom processor as a host, are compared with the existing commercial off-The-shelf generalpurpose, customizable, and programmable embedded platforms, including Intel Atom, Xilinx Artix-7 FPGA, and NVIDIA TK1 advanced RISC machine -A15 and K1 GPU system on a chip. For these applications, the PENC manycore is able to significantly improve throughput and energy efficiency by up to 1872× and 276×, respectively. For the most computational intensive application of seizure detection, the PENC manycore is able to achieve a throughput of 15.22 giga-operations-per-second (GOPs), which is a 14× improvement in throughput over custom FPGA solution. For stress detection, the PENC achieves a throughput of 21.36 GOPs and an energy efficiency of 4.23 GOP/J, which is 14.87× and 2.28× better over FPGA implementation, respec-Tively. For the sTDS application, the PENC improves a throughput by 5.45× and an energy efficiency by 2.37× over FPGA implementation. 1063-8210 © 2017 IEEE.
引用
收藏
页码:96 / 109
页数:13
相关论文
共 50 条
  • [21] A highly energy-efficient, area-efficient switching scheme for SAR ADC in biomedical applications
    Chen, Yushi
    Zhuang, Yiqi
    Tang, Hualian
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2019, 101 (01) : 133 - 143
  • [22] Spara: An Energy-Efficient ReRAM-Based Accelerator for Sparse Graph Analytics Applications
    Zheng, Long
    Zhao, Jieshan
    Huang, Yu
    Wang, Qinggang
    Zeng, Zhen
    Xue, Jingling
    Liao, Xiaofei
    Jin, Hai
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 696 - 707
  • [23] An Energy-Efficient Biomedical Signal Processing Platform
    Kwong, Joyce
    Chandrakasan, Anantha P.
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2011, 46 (07) : 1742 - 1753
  • [24] Nanomaterials for Energy-Efficient Applications
    Huang, Xiaohu
    Xing, Guozhong
    Li, Yongfeng
    Nannen, Ekaterina
    JOURNAL OF NANOMATERIALS, 2015, 2015
  • [25] An Energy-Efficient Integrated Programmable Array Accelerator and Compilation Flow for Near-Sensor Ultralow Power Processing
    Das, Satyajit
    Martin, Kevin J. M.
    Rossi, Davide
    Coussy, Philippe
    Benini, Luca
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (06) : 1095 - 1108
  • [26] An energy-efficient reconfigurable viterbi decoder on a programmable multiprocessor
    Zhong, Guichang
    Willson, Alan N., Jr.
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 1565 - 1568
  • [27] A Database Accelerator for Energy-Efficient Query Processing and Optimization
    Haas, Sebastian
    Arnold, Oliver
    Scholze, Stefan
    Hoeppner, Sebastian
    Ellguth, Georg
    Dixius, Andreas
    Ungethuem, Annett
    Mier, Eric
    Noethen, Benedikt
    Matus, Emil
    Schiefer, Stefan
    Cederstroem, Love
    Pilz, Fabian
    Mayr, Christian
    Schueffny, Rene
    Lehner, Wolfgang
    Fettweis, Gerhard P.
    2016 2ND IEEE NORDIC CIRCUITS AND SYSTEMS CONFERENCE (NORCAS), 2016,
  • [28] An Energy-Efficient Deep Neural Network Accelerator Design
    Jung, Jueun
    Lee, Kyuho Jason
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 272 - 276
  • [29] An Asynchronous Energy-Efficient CNN Accelerator with Reconfigurable Architecture
    Chen, Weijia
    Wu, Hui
    Wei, Shaojun
    He, Anping
    Chen, Hong
    2018 IEEE ASIAN SOLID-STATE CIRCUITS CONFERENCE (A-SSCC): PROCEEDINGS OF TECHNICAL PAPERS, 2018, : 51 - 54
  • [30] A Massively Parallel, Energy Efficient Programmable Accelerator for Learning and Classification
    Majumdar, Abhinandan
    Cadambi, Srihari
    Becchi, Michela
    Chakradhar, Srimat T.
    Graf, Hans Peter
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2012, 9 (01)