Performance Optimization of Speech Recognition System with Deep Neural Network Model

被引：0

作者：

Wei Guan ^{[1
]}

机构：

[1] College of Modern Science and Technology, China Jiliang University, HangzhouZhejiang

来源：

Optical Memory and Neural Networks | 2018年 / 27卷 / 4期

关键词：

acoustic model; deep neural network; discriminative training; performance optimization; speech recognition;

D O I：

10.3103/S1060992X18040094

中图分类号：

学科分类号：

摘要：

Abstract: With the development of internet, man-machine interaction has tended to be more important. Precise speech recognition has become an important means to achieve man-machine interaction. In this study, deep neural network model was used to enhance speech recognition performance. Feedforward fully connected deep neural network, time-delay neural network, convolutional neural network and feedforward sequence memory neural network were studied, and their speech recognition performance was studied by comparing their acoustic models. Moreover, the recognition performance of the model after adding different dimension human voice features was tested. The results showed that the performance of the speech recognition system could be improved effectively by using the deep neural network model, and the performance of feedforward sequence memory neural network was the best, followed by deep neural network, time-delay neural network and convolutional neural network. Different extraction features had different improvement effects on model performance. The performance of the model which was added with Fbank extraction features was superior to that added with Mel-frequency cepstrum coefficient (MFCC) extraction feature. The model performance improved after the addition of vocal characteristics. Different models had different vocal characteristic dimensions. © 2018, Allerton Press, Inc.

引用

页码：272 / 282

页数：10

共 50 条

[41] Hierarchical Expert Neural Network System for Speech Recognition
Rocha, Priscila
Silva, Washington
Barros, Allan
JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2019, 30 (03) : 347 - 359
[42] Deep Neural Network-based Speech Separation Combining with MVDR Beamformer for Automatic Speech Recognition System
Lee, Bong-Ki
Jeong, Jaewoong
2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2019,
[43] A Fuzzy Neural Network Applied in the Speech Recognition System
Zhang, Xueying
Wang, Peng
Li, Gaoyun
Hou, Wenjun
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2008, : 14 - +
[44] Performance Evaluation of an Accessory Category Recognition System Using Deep Neural Network
Sakai, Yuki
Oda, Tetsuya
Ikeda, Makoto
Barolli, Leonard
PROCEEDINGS OF 2016 19TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS), 2016, : 437 - 441
[45] An optimization method for speech enhancement based on deep neural network
Sun, Haixia
Li, Sikun
3RD INTERNATIONAL CONFERENCE ON ADVANCES IN ENERGY, ENVIRONMENT AND CHEMICAL ENGINEERING, 2017, 69
[46] Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition
Zhang, Hua
Gou, Ruoyun
Shang, Jili
Shen, Fangyao
Wu, Yifan
Dai, Guojun
FRONTIERS IN PHYSIOLOGY, 2021, 12
[47] ACCELERATING RECURRENT NEURAL NETWORK LANGUAGE MODEL BASED ONLINE SPEECH RECOGNITION SYSTEM
Lee, Kyungmin
Park, Chiyoun
Kim, Namhoon
Lee, Jaewon
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5904 - 5908
[48] Optimization of Deep Neural Network for Neuromorphic System
Lee, Jae Eun
Lee, Chul Jun
Lee, Dae Seok
Kim, Dong Wook
Seo, Young Ho
2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, : 430 - 431
[49] LOCAL TRAJECTORY BASED SPEECH ENHANCEMENT FOR ROBUST SPEECH RECOGNITION WITH DEEP NEURAL NETWORK
You, Yongbin
Qian, Yanmin
Yu, Kai
2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 5 - 9
[50] Performance Evaluation of Deep Autoencoder Network for Speech Emotion Recognition
AndleebSiddiqui, Maria
Hussain, Wajahat
Ali, Syed Abbas
Danish-ur-Rehman
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (02) : 606 - 611

← 1 2 3 4 5 →