Design of an Efficient CNN-Based Cough Detection System on Lightweight FPGA

被引:4
|
作者
Peng, Peng [1 ]
Jiang, Kai [1 ]
You, Mingyu [1 ]
Xie, Jialin [2 ]
Zhou, Hongjun [1 ]
Xu, Weisheng [1 ]
Lu, Jicheng [3 ]
Li, Xiayu [3 ]
Xu, Yun [3 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Control Sci & Engn, Shanghai 201804, Peoples R China
[2] Fudan Univ, Sch Microelect, Shanghai 200433, Peoples R China
[3] Shanghai Fudan Microelect Grp Co Ltd, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
Field programmable gate arrays; Convolutional neural networks; Spectrogram; Hidden Markov models; Mel frequency cepstral coefficient; Feature extraction; Computational modeling; Cough Detection; CNN; deep learning; FPGA; hardware acceleration; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1109/TBCAS.2023.3236976
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Precisely and automatically detecting the cough sound is of vital clinical importance. Nevertheless, due to privacy protection considerations, transmitting the raw audio data to the cloud is not permitted, and therefore there is a great demand for an efficient, accurate, and low-cost solution at the edge device. To address this challenge, we propose a semi-custom software-hardware co-design methodology to help build the cough detection system. Specifically, we first design a scalable and compact convolutional neural network (CNN) structure that generates many network instances. Second, we develop a dedicated hardware accelerator to perform the inference computation efficiently, and then we find the optimal network instance by applying network design space exploration. Finally, we compile the optimal network and let it run on the hardware accelerator. The experimental results demonstrate that our model achieves 88.8% classification accuracy, 91.2% sensitivity, 86.5% specificity, and 86.5% precision, while the computation complexity is only 1.09 M multiply-accumulation (MAC). Additionally, when implemented on a lightweight field programmable gate array (FPGA), the complete cough detection system only occupies 7.9 K lookup tables (LUTs), 12.9 K flip-flops (FFs), and 41 digital signal processing (DSP) slices, providing 8.3 GOP/s actual inference throughput and total power dissipation of 0.93 W. This framework meets the needs of partial application and can be easily extended or integrated into other healthcare applications.
引用
收藏
页码:116 / 128
页数:13
相关论文
共 50 条
  • [1] A Reconfigurable CNN-Based Accelerator Design for Fast and Energy-Efficient Object Detection System on Mobile FPGA
    Kim, Victoria Heekyung
    Choi, Kyuwon Ken
    [J]. IEEE ACCESS, 2023, 11 : 59438 - 59445
  • [2] Lightweight CNN-Based Low-Light-Image Enhancement System on FPGA Platform
    Wang, Wei
    Xu, Xin
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (06) : 8023 - 8039
  • [3] Lightweight CNN-Based Low-Light-Image Enhancement System on FPGA Platform
    Wei Wang
    Xin Xu
    [J]. Neural Processing Letters, 2023, 55 : 8023 - 8039
  • [4] Lightweight CNN-Based Method for Spacecraft Component Detection
    Liu, Yuepeng
    Zhou, Xingyu
    Han, Hongwei
    [J]. AEROSPACE, 2022, 9 (12)
  • [5] Design and Implementation of a Lightweight Deep CNN-Based Plant Biometric Authentication System
    Yan, Wenqing
    Tang, Jingwei
    Stucki, Sandro
    [J]. IEEE ACCESS, 2023, 11 : 79984 - 79993
  • [6] CNN-Based Lightweight Flame Detection Method in Complex Scenes
    Li X.
    Zhang D.
    Sun L.
    Xu Y.
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (05): : 415 - 422
  • [7] Detection of Malicious FPGA Bitstreams using CNN-Based Learning
    Chaudhuri, Jayeeta
    Chakrabarty, Krishnendu
    [J]. 2022 IEEE EUROPEAN TEST SYMPOSIUM (ETS 2022), 2022,
  • [8] Low Power FPGA-SoC Design Techniques for CNN-based Object Detection Accelerator
    Kim, Heekyung
    Choi, Ken
    [J]. 2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 1130 - 1134
  • [9] High Power-Efficient and Performance-Density FPGA Accelerator for CNN-Based Object Detection
    Zhang, Gang
    Zhang, Chaofan
    Wang, Fan
    Tang, Fulin
    Wu, Yihong
    Yang, Xuezhi
    Liu, Yong
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 117 - 128
  • [10] The Design of Lightweight and Multi Parallel CNN Accelerator Based on FPGA
    Li Zong-ling
    Wang Lu-yuan
    Yu Ji-yang
    Cheng Bo-wen
    Hao Liang
    [J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 1521 - 1528