Classical and Deep Learning Methods for Speech Command Recognition

被引：2

作者：

Xie, Jie ^{[1
]}

Li, Qijing ^{[1
]}

Hu, Kai ^{[1
]}

Zhu, Mingying ^{[2
]}

机构：

[1] Jiangnan Univ, Sch Internet Things Engn, Wuxi, Jiangsu, Peoples R China

[2] Nanjing Univ, Sch Econ, Wuxi, Jiangsu, Peoples R China

来源：

2021 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

speech command recognition; convolutional neural networks; acoustic feature;

D O I：

10.1109/ICICN52636.2021.9673813

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

As an application area of speech command recognition, smart home has provided people a convenient way to communicate with various digital devices. In this study, we aim to investigate both machine learning and deep learning architectures for improved speaker-independent speech command recognition. First, we extract statistical MFCCs vectors to train classical machine learning models: KNN, SVM, and RF. Second, we trained deep learning models using two end-to-end architectures with different inputs. Experimental results indicate that our presented method achieved the highest accuracy and F1 score of 0.846 +/- 0.148 and 0.84 +/- 0.157 on the private dataset.

引用

页码：41 / 45

页数：5

共 50 条

[1] Speech Command Recognition Using Deep Learning
Ayache, Mohammad
Kanaan, Hussien
Kassir, Kawthar
Kassir, Yasser
[J]. 2021 SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2021, : 24 - 29
[2] Application of Bernstein and pattern recognition methods for speech command recognition
Department of Computer Education, Gazi University, 06500 Ankara, Turkey
[J]. J. Appl. Sci., 2007, 20 (3063-3068):
[3] Yi Language Speech Recognition using Deep Learning Methods
Chen, Ziyan
Yang, Hongwu
[J]. PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1064 - 1068
[4] Deep Learning Enabled High-Performance Speech Command Recognition on Graphene Flexible Microphones
Zhang, Xin-Yu
Liu, Hang
Ma, Xiang-Yu
Wang, Zi-Cheng
Li, Guo-Peng
Han, Lei
Sun, Kuan
Yang, Qi-Sheng
Ji, Shou-Rui
Yu, Du-Li
Li, Yu-Tao
Ren, Tian-Ling
[J]. ACS APPLIED ELECTRONIC MATERIALS, 2022, 4 (05) : 2306 - 2312
[5] Pattern recognition system: from classical methods to deep learning techniques
Bendjenna, Hakim
Meraoumia, Abdallah
Chergui, Othaila
[J]. JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (03)
[6] A Speech Command Control-Based Recognition System for Dysarthric Patients Based on Deep Learning Technology
Lin, Yu-Yi
Zheng, Wei-Zhong
Chu, Wei Chung
Han, Ji-Yan
Hung, Ying-Hsiu
Ho, Guan-Min
Chang, Chia-Yuan
Lai, Ying-Hui
[J]. APPLIED SCIENCES-BASEL, 2021, 11 (06):
[7] Speech Emotion Recognition with Deep Learning
Harar, Pavol
Burget, Radim
Dutta, Malay Kishore
[J]. 2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 137 - 140
[8] Speech Recognition using Deep Learning
Lakkhanawannakun, Phoemporn
Noyunsan, Chaluemwut
[J]. 2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, : 514 - 517
[9] Deep Learning for Emotional Speech Recognition
Sanchez-Gutierrez, Maximo E.
Marcelo Albornoz, E.
Martinez-Licona, Fabiola
Leonardo Rufiner, H.
Goddard, John
[J]. PATTERN RECOGNITION, MCPR 2014, 2014, 8495 : 311 - +
[10] Deep Learning for Emotional Speech Recognition
Alhamada, M., I
Khalifa, O. O.
Abdalla, A. H.
[J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA2020), 2020, 2306

← 1 2 3 4 5 →