Network Oral English Teaching System Based on Speech Recognition Technology and Deep Neural Network

被引:0
|
作者
He, Na [1 ]
Liu, Weihua [2 ]
机构
[1] Pingxiang Univ, Sch Foreign Languages, Pingxiang 337000, Peoples R China
[2] Jiangxi Telecom Co, Pingxiang Branch, Pingxiang 337000, Peoples R China
关键词
Deep neural network; Markov model; voice design technology; Viterbi algorithm; oral English teaching;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the development of computer technology, computer-aided instruction is being used more and more widely in the field of education. Based on speech recognition technology and deep neural network, this paper proposes an online oral English teaching system. Firstly, the speech recognition technology is introduced and its feature extraction is elaborated in detail. Then, three basic problems and three basic algorithms that need to be solved in speech recognition system using Markov model are discussed. The application of HMM technology in speech recognition system is studied, and some algorithms are optimized. The logarithmic processing of Viterbi algorithm, compared with the traditional algorithm, greatly reduces the amount of computation and solves the overflow problem in the operation process. By combining deep network with HMM, continuous speech signal modeling is realized. According to the characteristics of the DNN-HMM model, it is proposed that the model cannot model the long-term dependence of speech signals and train complex problems. Based on Kaldi, the model training comparison experiments of monophonon model, triphonon model and adding feature transformation technology are carried out to continuously improve the model performance. Finally, through simulation experiments, it is found that the recognition rate of the optimized DNN-HMM mixed model proposed in this paper is the highest, reaching 97.5%, followed by the HMM model, which is 95.4%, and the lowest recognition rate is the PNN model, which is 90.1%.
引用
收藏
页码:829 / 839
页数:11
相关论文
共 50 条
  • [11] Deep Neural Network Based Speech Separation for Robust Speech Recognition
    Tu Yanhui
    Jun, Du
    Xu Yong
    Dai Lirong
    Chin-Hui, Lee
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 532 - 536
  • [12] Design of English teaching speech recognition system based on LSTM network and feature extraction
    Geng, Yanmei
    [J]. SOFT COMPUTING, 2023,
  • [13] Application of sensor network and speech recognition system in online english teaching
    Ding, Xiaolong
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2023,
  • [14] Application of speech software based on mobile network technology in oral English teaching and classroom feedback
    Yan, Kefei
    [J]. SOFT COMPUTING, 2023,
  • [15] A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters
    Mao, Congmin
    Liu, Sujing
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (03) : 679 - 684
  • [16] Audiovisual speech recognition based on a deep convolutional neural network
    Rudregowda S.
    Patilkulkarni S.
    Ravi V.
    H.L. G.
    Krichen M.
    [J]. Data Science and Management, 2024, 7 (01): : 25 - 34
  • [17] Deep Convolution Neural Network Based Speech Recognition for Chhattisgarhi
    Londhe, Narendra D.
    Kshirsagar, Ghanahshyam B.
    Tekchandani, Hitesh
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 667 - 671
  • [18] Stimulated Deep Neural Network for Speech Recognition
    Wu, Chunyang
    Karanasou, Penny
    Gales, Mark J. F.
    Sim, Khe Chai
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 400 - 404
  • [19] Oral English Speech Recognition Based on Enhanced Temporal Convolutional Network
    Wu, Hao
    Sangaiah, Arun Kumar
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 28 (01): : 121 - 132
  • [20] Language Model Optimization for a Deep Neural Network Based Speech Recognition System for Serbian
    Pakoci, Edvin
    Popovic, Branislav
    Pekar, Darko
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 483 - 492