Isolated Word Speech Recognition System Using Deep Neural Networks

被引:8
|
作者
Dhanashri, Dhavale [1 ]
Dhonde, S. B. [1 ]
机构
[1] AISSMS IOIT, Dept Elect, Pune, Maharashtra, India
关键词
Isolated word speech recognition; Deep neural networks; Deep belief networks; Acoustic model;
D O I
10.1007/978-981-10-1675-2_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition is the process of converting speech signals into words. For acoustic modeling HMM-GMM is used for many years. For GMM, it requires assumptions near the data distribution for calculating probabilities. For removing this limitation, GMM is replaced by DNN in acoustic model. Deep neural networks are the feed forward neural networks having more than one or multiple layers of hidden units. In this work, we have presented the isolated word speech recognition system using acoustic model of HMM and DNN. We are using Deep Belief Network pre-training algorithm for initializing deep neural networks. DBN is a multilayer generative probabilistic model with large number of stochastic binary units. The features used are the mel-frequency cepstrum coefficients (MFCC). Experimental results are calculated on TI digits database. Proposed system has achieved 86.06 % accuracy on TI digits database. System accuracy can be further increased by increasing the number of hidden units.
引用
收藏
页码:9 / 17
页数:9
相关论文
共 50 条
  • [1] An isolated-word speech recognition system using neural networks
    Runstein, F
    Violaro, F
    [J]. 38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 550 - 553
  • [2] BILINGUAL SPEECH RECOGNITION SYSTEM FOR ISOLATED WORDS USING DEEP NEURAL NETWORK
    Bharathi, B.
    Kavitha, S.
    Sugapriya, S.
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, AND SIGNAL PROCESSING (ICCCSP): SPECIAL FOCUS ON TECHNOLOGY AND INNOVATION FOR SMART ENVIRONMENT, 2018, : 78 - 81
  • [3] Isolated speech recognition using artificial neural networks
    Polur, PD
    Zhou, RB
    Yang, J
    Adnani, F
    Hobson, RS
    [J]. PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 1731 - 1734
  • [4] Emotional Speech Recognition Using Deep Neural Networks
    Trinh Van, Loan
    Dao Thi Le, Thuy
    Le Xuan, Thanh
    Castelli, Eric
    [J]. SENSORS, 2022, 22 (04)
  • [5] An isolated word speech recognition system based on kohonen neural network
    Figueiredo, FL
    Violaro, F
    [J]. VTH BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, PROCEEDINGS, 1998, : 151 - 156
  • [6] Improved Speaker Recognition System for Stressed Speech using Deep Neural Networks
    Dumpala, Sri Harsha
    Kopparapu, Sunil Kumar
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1257 - 1264
  • [7] Isolated word recognition using modular recurrent neural networks
    Lee, T
    Ching, PC
    Chan, LW
    [J]. PATTERN RECOGNITION, 1998, 31 (06) : 751 - 760
  • [8] Automatic Recognition of Kazakh Speech Using Deep Neural Networks
    Mamyrbayev, Orken
    Turdalyuly, Mussa
    Mekebayev, Nurbapa
    Alimhan, Keylan
    Kydyrbekova, Aizat
    Turdalykyzy, Tolganay
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT II, 2019, 11432 : 465 - 474
  • [9] Speech Recognition Using Deep Neural Networks: A Systematic Review
    Nassif, Ali Bou
    Shahin, Ismail
    Attili, Imtinan
    Azzeh, Mohammad
    Shaalan, Khaled
    [J]. IEEE ACCESS, 2019, 7 : 19143 - 19165
  • [10] Improving Large Vocabulary Urdu Speech Recognition System using Deep Neural Networks
    Farooq, Muhammad Umar
    Adeeba, Farah
    Rauf, Sahar
    Hussain, Sarmad
    [J]. INTERSPEECH 2019, 2019, : 2978 - 2982