Classical and Deep Learning Methods for Speech Command Recognition

被引:2
|
作者
Xie, Jie [1 ]
Li, Qijing [1 ]
Hu, Kai [1 ]
Zhu, Mingying [2 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Wuxi, Jiangsu, Peoples R China
[2] Nanjing Univ, Sch Econ, Wuxi, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
speech command recognition; convolutional neural networks; acoustic feature;
D O I
10.1109/ICICN52636.2021.9673813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an application area of speech command recognition, smart home has provided people a convenient way to communicate with various digital devices. In this study, we aim to investigate both machine learning and deep learning architectures for improved speaker-independent speech command recognition. First, we extract statistical MFCCs vectors to train classical machine learning models: KNN, SVM, and RF. Second, we trained deep learning models using two end-to-end architectures with different inputs. Experimental results indicate that our presented method achieved the highest accuracy and F1 score of 0.846 +/- 0.148 and 0.84 +/- 0.157 on the private dataset.
引用
下载
收藏
页码:41 / 45
页数:5
相关论文
共 50 条
  • [21] EFFICIENT DEEP LEARNING FOR PATHOLOGICAL SPEECH RECOGNITION
    Pham, Tuan D.
    2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 103 - 104
  • [22] Persian speech recognition using deep learning
    Hadi Veisi
    Armita Haji Mani
    International Journal of Speech Technology, 2020, 23 : 893 - 905
  • [23] Deep learning for Depression Recognition from Speech
    Tian, Han
    Zhu, Zhang
    Jing, Xu
    MOBILE NETWORKS & APPLICATIONS, 2023, 29 (4): : 1212 - 1227
  • [24] Fake Speech Recognition Using Deep Learning
    Camacho, Steven
    Maria Ballesteros, Dora
    Renza, Diego
    APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021, 2021, 1431 : 38 - 48
  • [25] Deep fusion framework for speech command recognition using acoustic and linguistic features
    Mehra, Sunakshi
    Susan, Seba
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 38667 - 38691
  • [26] Comparison of the performance of innovative deep learning and classical methods of machine learning to solve industrial recognition tasks
    Anding, K.
    Haar, L.
    Polte, G.
    Walz, J.
    Notni, G.
    PHOTONICS AND EDUCATION IN MEASUREMENT SCIENCE, 2019, 11144
  • [27] Deep fusion framework for speech command recognition using acoustic and linguistic features
    Sunakshi Mehra
    Seba Susan
    Multimedia Tools and Applications, 2023, 82 : 38667 - 38691
  • [28] A Review of Recent Advances on Deep Learning Methods for Audio-Visual Speech Recognition
    Ivanko, Denis
    Ryumin, Dmitry
    Karpov, Alexey
    MATHEMATICS, 2023, 11 (12)
  • [29] SPEECH EMOTION RECOGNITION WITH ENSEMBLE LEARNING METHODS
    Shih, Po-Yuan
    Chen, Chia-Ping
    Wu, Chung-Hsien
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2756 - 2760
  • [30] STATISTICAL-METHODS FOR LEARNING OF SPEECH RECOGNITION
    SUZUKI, H
    OIZUMI, J
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1964, 10 (04) : 403 - &