Efficient Real-Time Smart Keyword Spotting Using Spectrogram-Based Hybrid CNN-LSTM for Edge System

被引:0
|
作者
Syafalni, Infall [1 ,2 ,3 ]
Amadeus, Clarence [1 ]
Sutisna, Nana [1 ,3 ]
Adiono, Trio [1 ]
机构
[1] Bandung Inst Technol, Sch Elect Engn & Informat, Bandung 40132, Indonesia
[2] Bandung Inst Technol, Univ Ctr Excellence Microelect, Bandung 40132, Indonesia
[3] Interuniv Microelect Ctr IMEC, B-3001 Leuven, Belgium
关键词
Edge computing; hybrid CNN-LSTM; keyword spotting; real-time; embedded devices;
D O I
10.1109/ACCESS.2024.3380350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keyword Spotting (KWS) is the task of recognizing spoken command words from a database. With recent application human-machine interactions, KWS systems require real-time performance, where edge computing is a preferable option. To allow KWS systems to work on fast and real-time implementation, a low-complexity yet high-accurate AI model is mandatory. In this paper, we propose a comprehensive voice command recognition system design and its hardware implementation. The proposed AI model considered in this system is SpectroNet-based and an efficient hybrid CNN-LSTM architecture with low complexity. Jetson Xavier NX is an edge device because of its strong computational power as an embedded device. The implementation result shows the proposed method offers quite good in terms of accuracy, indicated by no accuracy drop between the model implemented in PC and Jetson Xavier. However, the inference time is quite high, which is 180 ms/step. To improve the speed of the system, the TensorRT library is used to further optimize the model. Optimization of the model is found effective, reducing 59.35% of the total operation performed in SpectroNet when FP32 precision is used, and 59.63% when FP16 precision is used. The model is also sped up by 45% if FP32 precision mode is used and 62% if FP16 precision mode is used. However, there is a slight accuracy drop of 2.68% if FP32 precision mode is used and 4.84% if FP16 precision mode is used. This slight drop in accuracy is considered negligible compared to the performance boost that TensorRT gives. The work is useful for intelligent control systems such as smart vehicles, smartphones, computers, and smart communications.
引用
下载
收藏
页码:43109 / 43125
页数:17
相关论文
共 50 条
  • [21] EEG-fNIRS-based hybrid image construction and classification using CNN-LSTM
    Mughal, Nabeeha Ehsan
    Khan, Muhammad Jawad
    Khalil, Khurram
    Javed, Kashif
    Sajid, Hasan
    Naseer, Noman
    Ghafoor, Usman
    Hong, Keum-Shik
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [22] A real-time transformer discharge pattern recognition method based on CNN-LSTM driven by few-shot learning
    Zheng, Qinghe
    Wang, Ruoyu
    Tian, Xinyu
    Yu, Zhiguo
    Wang, Hongjun
    Elhanashi, Abdussalam
    Saponara, Sergio
    ELECTRIC POWER SYSTEMS RESEARCH, 2023, 219
  • [23] FPGA-accelerated hybrid CNN-LSTM system for efficient EEG-based drowsiness recognitionFPGA-accelerated hybrid CNN-LSTM system...R. M. R. Yanamala, M. Pullakandam
    Rama Muni Reddy Yanamala
    Muralidhar Pullakandam
    The Journal of Supercomputing, 81 (3)
  • [24] RETRACTED: CNN-LSTM Hybrid Real-Time IoT-Based Cognitive Approaches for ISLR with WebRTC: Auditory Impaired Assistive Technology (Retracted Article)
    Gupta, Meenu
    Thakur, Narina
    Bansal, Dhruvi
    Chaudhary, Gopal
    Davaasambuu, Battulga
    Hua, Qiaozhi
    JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
  • [25] A Real-Time Keyword Spotting System Based on an End-To-End Binary Convolutional Neural Network in FPGA
    Yoon, Jinsung
    Lee, Donghyun
    Kim, Neungyun
    Lee, Su-Jung
    Kwak, Gil-Ho
    Kim, Tae-Hwan
    2023 IEEE SYMPOSIUM IN LOW-POWER AND HIGH-SPEED CHIPS, COOL CHIPS, 2023,
  • [26] Fault Detection and Classification in Ring Power System With DG Penetration Using Hybrid CNN-LSTM
    Alhanaf, Ahmed Sami
    Farsadi, Murtaza
    Balik, Hasan Huseyin
    IEEE ACCESS, 2024, 12 : 59953 - 59975
  • [27] Efficient prediction of runway visual range by using a hybrid CNN-LSTM network architecture for aviation services
    Shankar, Anand
    Sahana, Bikash Chandra
    THEORETICAL AND APPLIED CLIMATOLOGY, 2024, 155 (03) : 2215 - 2232
  • [28] CardiacRT-NN: Real-Time Detection of Cardiovascular Disease Using Self-attention CNN-LSTM for Embedded Systems
    Li, Yixin
    Sui, Ning
    Gehi, Anil
    Guo, Chengan
    Guo, Zhishan
    ADVANCES IN NEURAL NETWORKS-ISNN 2024, 2024, 14827 : 610 - 621
  • [29] Optimized Hybrid CNN-LSTM Based Islanding Detection of Solar-Wind Power System
    Vadlamudi, Bindu
    Anuradha, T.
    ELECTRIC POWER COMPONENTS AND SYSTEMS, 2024, 52 (03) : 337 - 355
  • [30] Efficient prediction of runway visual range by using a hybrid CNN-LSTM network architecture for aviation services
    Anand Shankar
    Bikash Chandra Sahana
    Theoretical and Applied Climatology, 2024, 155 : 2215 - 2232