Classical and Deep Learning Methods for Speech Command Recognition

被引:2
|
作者
Xie, Jie [1 ]
Li, Qijing [1 ]
Hu, Kai [1 ]
Zhu, Mingying [2 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Wuxi, Jiangsu, Peoples R China
[2] Nanjing Univ, Sch Econ, Wuxi, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
speech command recognition; convolutional neural networks; acoustic feature;
D O I
10.1109/ICICN52636.2021.9673813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an application area of speech command recognition, smart home has provided people a convenient way to communicate with various digital devices. In this study, we aim to investigate both machine learning and deep learning architectures for improved speaker-independent speech command recognition. First, we extract statistical MFCCs vectors to train classical machine learning models: KNN, SVM, and RF. Second, we trained deep learning models using two end-to-end architectures with different inputs. Experimental results indicate that our presented method achieved the highest accuracy and F1 score of 0.846 +/- 0.148 and 0.84 +/- 0.157 on the private dataset.
引用
收藏
页码:41 / 45
页数:5
相关论文
共 50 条
  • [1] Speech Command Recognition Using Deep Learning
    Ayache, Mohammad
    Kanaan, Hussien
    Kassir, Kawthar
    Kassir, Yasser
    [J]. 2021 SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2021, : 24 - 29
  • [2] Application of Bernstein and pattern recognition methods for speech command recognition
    Department of Computer Education, Gazi University, 06500 Ankara, Turkey
    [J]. J. Appl. Sci., 2007, 20 (3063-3068):
  • [3] Yi Language Speech Recognition using Deep Learning Methods
    Chen, Ziyan
    Yang, Hongwu
    [J]. PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1064 - 1068
  • [4] Deep Learning Enabled High-Performance Speech Command Recognition on Graphene Flexible Microphones
    Zhang, Xin-Yu
    Liu, Hang
    Ma, Xiang-Yu
    Wang, Zi-Cheng
    Li, Guo-Peng
    Han, Lei
    Sun, Kuan
    Yang, Qi-Sheng
    Ji, Shou-Rui
    Yu, Du-Li
    Li, Yu-Tao
    Ren, Tian-Ling
    [J]. ACS APPLIED ELECTRONIC MATERIALS, 2022, 4 (05) : 2306 - 2312
  • [5] Pattern recognition system: from classical methods to deep learning techniques
    Bendjenna, Hakim
    Meraoumia, Abdallah
    Chergui, Othaila
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (03)
  • [6] A Speech Command Control-Based Recognition System for Dysarthric Patients Based on Deep Learning Technology
    Lin, Yu-Yi
    Zheng, Wei-Zhong
    Chu, Wei Chung
    Han, Ji-Yan
    Hung, Ying-Hsiu
    Ho, Guan-Min
    Chang, Chia-Yuan
    Lai, Ying-Hui
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (06):
  • [7] Speech Emotion Recognition with Deep Learning
    Harar, Pavol
    Burget, Radim
    Dutta, Malay Kishore
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 137 - 140
  • [8] Speech Recognition using Deep Learning
    Lakkhanawannakun, Phoemporn
    Noyunsan, Chaluemwut
    [J]. 2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, : 514 - 517
  • [9] Deep Learning for Emotional Speech Recognition
    Sanchez-Gutierrez, Maximo E.
    Marcelo Albornoz, E.
    Martinez-Licona, Fabiola
    Leonardo Rufiner, H.
    Goddard, John
    [J]. PATTERN RECOGNITION, MCPR 2014, 2014, 8495 : 311 - +
  • [10] Deep Learning for Emotional Speech Recognition
    Alhamada, M., I
    Khalifa, O. O.
    Abdalla, A. H.
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA2020), 2020, 2306