Network Oral English Teaching System Based on Speech Recognition Technology and Deep Neural Network

被引：0

作者：

He, Na ^{[1
]}

Liu, Weihua ^{[2
]}

机构：

[1] Pingxiang Univ, Sch Foreign Languages, Pingxiang 337000, Peoples R China

[2] Jiangxi Telecom Co, Pingxiang Branch, Pingxiang 337000, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2023年 / 14卷 / 12期

关键词：

Deep neural network; Markov model; voice design technology; Viterbi algorithm; oral English teaching;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

With the development of computer technology, computer-aided instruction is being used more and more widely in the field of education. Based on speech recognition technology and deep neural network, this paper proposes an online oral English teaching system. Firstly, the speech recognition technology is introduced and its feature extraction is elaborated in detail. Then, three basic problems and three basic algorithms that need to be solved in speech recognition system using Markov model are discussed. The application of HMM technology in speech recognition system is studied, and some algorithms are optimized. The logarithmic processing of Viterbi algorithm, compared with the traditional algorithm, greatly reduces the amount of computation and solves the overflow problem in the operation process. By combining deep network with HMM, continuous speech signal modeling is realized. According to the characteristics of the DNN-HMM model, it is proposed that the model cannot model the long-term dependence of speech signals and train complex problems. Based on Kaldi, the model training comparison experiments of monophonon model, triphonon model and adding feature transformation technology are carried out to continuously improve the model performance. Finally, through simulation experiments, it is found that the recognition rate of the optimized DNN-HMM mixed model proposed in this paper is the highest, reaching 97.5%, followed by the HMM model, which is 95.4%, and the lowest recognition rate is the PNN model, which is 90.1%.

引用

页码：829 / 839

页数：11

共 50 条

[11] Deep Neural Network Based Speech Separation for Robust Speech Recognition
Tu Yanhui
Jun, Du
Xu Yong
Dai Lirong
Chin-Hui, Lee
[J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 532 - 536
[12] Design of English teaching speech recognition system based on LSTM network and feature extraction
Geng, Yanmei
[J]. SOFT COMPUTING, 2023,
[13] Application of sensor network and speech recognition system in online english teaching
Ding, Xiaolong
[J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023,
[14] Application of speech software based on mobile network technology in oral English teaching and classroom feedback
Yan, Kefei
[J]. SOFT COMPUTING, 2023,
[15] A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters
Mao, Congmin
Liu, Sujing
[J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (03) : 679 - 684
[16] Audiovisual speech recognition based on a deep convolutional neural network
Rudregowda S.
Patilkulkarni S.
Ravi V.
H.L. G.
Krichen M.
[J]. Data Science and Management, 2024, 7 (01): : 25 - 34
[17] Deep Convolution Neural Network Based Speech Recognition for Chhattisgarhi
Londhe, Narendra D.
Kshirsagar, Ghanahshyam B.
Tekchandani, Hitesh
[J]. 2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 667 - 671
[18] Stimulated Deep Neural Network for Speech Recognition
Wu, Chunyang
Karanasou, Penny
Gales, Mark J. F.
Sim, Khe Chai
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 400 - 404
[19] Oral English Speech Recognition Based on Enhanced Temporal Convolutional Network
Wu, Hao
Sangaiah, Arun Kumar
[J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 28 (01): : 121 - 132
[20] Language Model Optimization for a Deep Neural Network Based Speech Recognition System for Serbian
Pakoci, Edvin
Popovic, Branislav
Pekar, Darko
[J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 483 - 492

← 1 2 3 4 5 →